Neoadjuvant therapy (NAT) is increasingly used in breast cancer management [
1–
3]. NAT aims to reduce the volume of inoperable breast lesions, such as locally advanced or inflammatory carcinomas, to facilitate surgery. More recently, NAT has been extended to patients with operable disease, reflecting data showing improved disease-free survival and overall survival in such patients if a pathologic complete response (pCR) is achieved [
4,
5]. Accumulating evidence supports dynamic contrast-enhanced MRI (DCE-MRI) for monitoring early response to NAT [
6] and assessing for pCR [
7,
8]; professional society guidelines currently recommend performing MRI both before and after NAT for response evaluation, recognizing the test's high diagnostic performance [
7,
9,
10]. However, the use of MRI continues to be limited by contraindications, patient preferences, high cost, incomplete access, and potential treatment delays.
Contrast-enhanced mammography (CEM) is an emerging breast imaging technique that combines digital dual-energy mammography and IV administration of an iodinated contrast agent [
11]. CEM entails acquisition of a two-view low-energy image, equivalent to a standard digital mammogram, and a high-energy image using an x-ray spectrum above the 33.2-keV K-edge of iodine. The high-energy images are not suitable for diagnostic purposes and are postprocessed to obtain recombined images showing areas of contrast enhancement [
12]. Numerous studies have shown substantially higher diagnostic performance in breast cancer detection for CEM than for conventional morphologic techniques, such as digital mammography and breast tomosynthesis [
12–
14]. Studies have also reported comparable performance of CEM and MRI in breast cancer detection and staging [
15,
16], indicating that CEM could serve as a credible alternative to MRI for such purposes [
17–
23]. Supporting the emergence of CEM, the American College of Radiology BI-RADS recently released a supplement with a CEM lexicon [
24].
Findings on both CEM and DCE-MRI reflect tumor neoangiogenesis, which is essential for breast cancer growth, progression, and dissemination. Hypoxia, along with other physiologic and pathologic stimuli, leads to the formation of mural defects within the vasculature that allow contrast medium to extrava-sate and temporarily accumulate in the tumor interstitium. This extravasated contrast medium may be visualized on gadolinium-enhanced T1-weighted MR images and on recombined CEM images. Systemic therapy alters breast vascularity, diminishing enhancement of both fibroglandular tissue and tumors on MRI [
25]. After systemic therapy, residual viable breast cancer may exhibit slow initial and persistent late enhancement that is typically associated with benign tumors. A similar pattern after NAT may be observed on CEM, such that any residual enhancement at the site of a previously known tumor, even in a delayed phase of acquisition, would warrant suspicion for residual disease.
The aim of this study was to compare CEM and MRI in the evaluation of NAT response in patients with breast cancer, with additional attention to the utility of a delayed CEM acquisition.
Methods
Patients
The institutional review board approved this prospective single-center study. All patients provided signed written informed consent.
From May 2015 to April 2018, consecutive women with biopsy-proven breast cancer who were candidates for NAT were screened for potential study eligibility. On the basis of this screening process, women were considered ineligible if they were under age 18 because a history of adverse reaction to iodinated and/or gadolinium-based contrast agents or severe allergiclike reaction to drugs and/or foods, had renal function impairment, were unable to undergo MRI, were pregnant, or were breastfeeding. The number of women deemed ineligible on the basis of these screening criteria was not recorded. Eligible patients were approached for study enrollment. Enrolled patients underwent evaluation by both MRI and CEM at three time points: pre-NAT (before treatment), mid-NAT (within 1–3 months after treatment start), and post-NAT (2 weeks after the last treatment cycle). Enrolled patients who did not complete imaging at all time points were excluded from the final analysis.
The MRI and CEM examinations were performed on separate days at each time point. In premenopausal women, DCE-MRI was obtained during the follicular phase between the 7th and 14th days of the menstrual cycle to minimize background parenchymal enhancement [
26]. No attempts were made to perform CEM examinations at a certain menstrual phase. Adverse reactions to the contrast material administrations for CEM and MRI were recorded.
A medical oncologist selected the NAT regimen for each patient according to the cancer's biologic characteristics. The NAT selection and administration were performed as part of patients' clinical care, unrelated to this investigation. No patient received neoadjuvant endocrine therapy. Before the start of NAT, each lesion was marked with a charcoal suspension [
27] to facilitate later identification of the original tumor location, particularly in patients with pCR. Charcoal is routinely used for this purpose because it does not yield MRI artifact, has low cost, does not entail radioisotope injection, and has an excellent safety profile with very rare complications.
Contrast-Enhanced Mammography Protocol
CEM examinations were performed using a Selenia Dimensions mammography system (Hologic) equipped with software for dual-energy imaging (I-View, Hologic). Before image acquisition, 1.5 mL/kg body weight of iodinated contrast agent (350 mg I/mL ioversol, Optiray, Mallinckrodt Pharmaceuticals) was administered IV using a power injector at a flow rate of 3 mL/s; 2 minutes after contrast material injection, the breast was compressed, and a set of low-energy and high-energy images were acquired.
For pre-NAT CEM examinations, two standard craniocaudal and mediolateral oblique views were acquired in each breast. For mid-NAT and post-NAT examinations, CEM was performed only in the affected breast to reduce radiation exposure for study participants. In addition, for post-NAT examinations, a delayed CEM acquisition of the affected breast was obtained at 6 minutes after contrast material injection. A 6-minute interval was used for the delayed CEM acquisition to mirror the timing of the final contrast-enhanced MRI acquisition (as described in the next section) and to obtain images before a time point at which most benign and malignant lesions would be expected to show physiologic washout.
MRI Protocol
MRI examinations were performed with the patient in the prone position using a 1.5-T unit (Signa, GE Healthcare) equipped with an 8-channel phased-array breast coil. In each patient, the three examinations used the same protocol, which included an axial T2-weighted STIR sequence (TR/TE, 5362/50; inversion time, 150 ms; flip angle, 160°; slice thickness, 2 mm; matrix, 356 × 356), an axial DWI echo-planar sequence (TR/TE, 8883/70; slice thickness, 3 mm; matrix, 356 × 356; b values, 50 and 800 s/mm2), and an axial T1-weighted gradient-echo 3D Vibrant-Flex (GE Health-care) sequence (TR/TE, 6.6/4.5; slice thickness, 1.6 mm; matrix, 356 × 356) that was acquired once before and five times after IV administration of 0.1 mmol/kg of gadobenate dimeglumine (Multi-Hance, Bracco Imaging) using a power injector at a flow rate of 2 mL/s. Postprocessed subtraction, multiplanar reconstruction, and maximum-intensity-projection images were generated.
Image Analysis
One radiologist (D.B., with 20 years of posttraining experience in breast imaging including 3 years in CEM) reviewed the CEM examinations, and a different radiologist (C.F., with 15 years of post-training experience in breast imaging including 15 years in breast MRI) reviewed the MRI examinations. The readers were blinded to the results of the other imaging modality and to other clinical details. For each examination, the readers recorded whether an enhancing lesion was identified; the presence of an enhancing lesion was considered to indicate residual disease. If an enhancing lesion was identified, then the readers recorded the lesion's largest dimension. For CEM, this measurement was performed on the recombined images; for MRI, this measurement was performed on the subtracted postcontrast images. For multifocal lesions, the largest lesion was measured. For post-NAT CEM examinations, during the same interpretation session, the reader also recorded whether an enhancing lesion was identified on delayed CEM and, if so, measured the lesion's size on delayed CEM. For all modalities, if no enhancing lesion was identified, then a lesion size of 0 mm was recorded. The readers reviewed the examinations from all three time points for each patient in a single session. Before the readings, to help standardize tumor size measurements, the readers jointly reviewed the imaging in 10 patients with breast cancer who underwent both CEM and MRI but who were not included in the study analysis.
Treatment response was determined between pre-NAT and mid-NAT imaging and between pre-NAT and post-NAT imaging using the lesion size measurements and response assessment categories based on RECIST version 1.1 (v1.1) [
28]. Baseline and follow-up examinations at the given time points were compared separately for CEM and MRI. Response was classified as a complete response if the lesion was no longer visualized, partial response if the lesion showed at least a 30% diameter decrease, progressive disease if the lesion showed at least a 20% diameter increase, and stable disease if the lesion showed neither a diameter decrease of at least 30% or a diameter increase of at least 20% [
28].
Histopathologic Reference Standard
All patients underwent surgical resection of the tumor after completion of NAT. One of two breast pathologists (nonauthors, both with 20 years of posttraining experience in breast pathology) performed the histologic assessment of the surgical specimen in each patient to determine whether pCR was achieved and to measure the size of residual tumor. For the purpose of the study, pCR was defined as the absence of both invasive cancer and ductal carcinoma in situ (DCIS). The presence of DCIS as the only residual tumor (i.e., T category of ypTis) was recorded. The tumor size was recorded as 0 mm in patients with pCR and in patients with only DCIS.
Post Hoc Analysis
Patients with a size discrepancy of more than 10 mm between imaging modalities at any time point or between any post-NAT imaging modality and final pathology were considered outliers in terms of size measurement. For these patients, the two study readers performed a joint post hoc assessment to identify reasons for the discrepancy. This assessment included evaluation of all imaging tests, with attention to additional lesion characteristics and imaging findings that may have contributed to the discrepancy, and review of the pathologic findings. The two study readers also performed a joint post analysis of all patients with a false-positive or false-negative interpretation for pCR on any post-NAT imaging test.
Statistical Analysis
Continuous variables were summarized as mean ± SD or as median and interquartile range depending on the given variable's distribution as assessed by the Shapiro-Wilk test. Bland-Altman analysis was used to assess agreement in measurements of lesion size among combinations of CEM, MRI, delayed CEM, and final pathology; results of the Bland-Altman analyses were reported as the bias and limits of agreement. Pearson correlation coefficients were used to assess agreement of size measurements among imaging tests and pathology and were classified as follows: 0.00–0.09, negligible correlation; 0.10–0.39, weak correlation; 0.40–0.69, moderate correlation; 0.70–0.89, strong correlation; 0.90–1.00, very strong correlation [
29]. Unweighted Cohen kappa coefficients were used to assess agreement of RECIST response categories between mid-NAT CEM and MRI and between post-NAT CEM and MRI. Overall results were classified as follows using the scale created by Landis and Koch [
30]: 0.00–0.20, slight agreement; 0.21–0.40, fair agreement; 0.41–0.60, moderate agreement; 0.61–0.80, substantial agreement; and 0.81–1.00, almost perfect agreement. The diagnostic performance of post-NAT CEM, MRI, and delayed CEM for the detection of pCR was calculated using presence of pCR on final pathology as the reference standard. Findings were considered true-positive when imaging showed no enhancement and pathologic assessment showed pCR, true-negative when imaging showed enhancement and pathology showed residual disease, false-positive when imaging showed no enhancement and pathology showed residual disease, and false-negative when imaging showed enhancement and pathology showed pCR. Diagnostic performance was compared between imaging tests using the McNemar test. A
p value less than .05 was considered statistically significant. MedCalc for Windows (version 19.2, MedCalc Software) was used for all statistical analyses.
Discussion
This prospective single-center study compared CEM, MRI, and delayed CEM in the evaluation of tumor response to NAT in patients with breast cancer. Lesion size was consistently lower on CEM than on MRI across the study time points, with a maximal systematic bias of –0.7 mm. On post-NAT imaging, both CEM and MRI systematically overestimated lesion size with respect to final pathology (by 1.2 mm for MRI and 0.8 mm for CEM). Size on post-NAT imaging was within 10 mm of the size on final pathology in 86–88% of patients for both modalities. RECIST v1.1 response assessment categories on CEM and MRI showed substantial agreement on mid-NAT imaging and almost perfect agreement on post-NAT imaging. Finally, on post-NAT imaging, sensitivity for pCR was significantly higher for MRI than for CEM, although specificity and accuracy were not significantly different. Though MRI remains the preferred imaging test for post-NAT response assessment, the findings indicate potential utility of CEM in evaluating NAT response when MRI cannot be performed (e.g., because of unavailability, contraindication, or patient preference).
In early studies after the advent of CEM, Fallenberg et al. [
31] and Lobbes et al. [
32] found no significant differences in breast cancer size measurements among CEM, MRI, and pathology, with good correlation among the techniques. However, in a prospective study of 33 patients who underwent NAT, Barra et al. [
20] found that CEM and MRI exhibited mean differences in residual tumor size of 8 mm and 18 mm, respectively, in comparison with pathology. The size discrepancies with respect to pathology are substantially larger in the study by Barra et al. than in the current study. One possible explanation relates to the assessment of multifocal residual cancer, which was observed in the study by Barra et al. but not in the current study. For determination of histopathologic size in such cases, Barra et al. calculated the sum of the diameters for all residual lesions. The authors did not explicitly indicate whether the same approach was used for determining the size of multifocal lesions on imaging. In comparison, the largest dimension of the largest lesion was measured for multifocal lesions on imaging in the current study. Additionally, the prone position used for breast MRI results in a median lesion displacement of approximately 3–6 cm in all three orthogonal directions in comparison with the supine position used during surgery, possibly yielding greater distances among multiple enhancing foci [
33]. Similar geometric issues apply, albeit to a lesser extent, as a result of breast compression during CEM. Finally, for both imaging and pathology, mean post-NAT sizes were larger for Barra et al. (1.6 cm for pathology, 2.4 cm for CEM, 3.6 cm for MRI) than in the current study (8.1 mm for pathology, 10.3 mm for CEM, 10.6 mm for MRI, 11.5 mm for delayed CEM), indicating the presence of additional underlying differences between studies in terms of patients and/or cancers.
Size discrepancies of more than 10 mm among various combinations of imaging modalities and pathology were observed in a small number of patients. One factor contributing to size discrepancy was background parenchymal enhancement on CEM. This finding supports scheduling CEM on day 7–14 of the menstrual cycle in trials evaluating NAT response. In a recent meta-analysis of the diagnostic performance of CEM that included over 10,000 patients [
15], the timing of CEM examinations according to menstrual cycle phase was not significantly associated with the sensitivity or specificity of CEM, although caution was advised in interpreting these results given the fragmented nature in which the timing of the CEM examinations was reported across studies.
The definition of pCR itself is a central factor affecting the performance of imaging tests for pCR detection, though this definition varies across centers [
7]. A key aspect of the variation in the definition of pCR is the classification of residual DCIS. In our study, pCR was defined as the absence of invasive cancer and DCIS. Residual DCIS was not considered pCR because accurate detection and size estimation of residual DCIS by imaging could guide tailored surgical plans, helping to achieve negative margins and thereby successful radical surgery [
34]. Although DCIS was considered to represent residual disease, the pathologist did not record the exact size of residual DCIS, resulting in any residual enhancement correctly identified on imaging being deemed an overestimate of pathologic size. These considerations indicate ongoing challenges presented by residual DCIS for both pathology and imaging interpretation.
Past studies have also compared the performance of CEM and MRI in detecting pCR after NAT [
18,
20–
22]. Our study aligns with those by Iotti et al. [
21] and Patel et al. [
18] in terms of the definition of pCR and in terms of defining the tests' diagnostic performance with respect to detection of pCR (vs detection of residual disease). Iotti et al. evaluated 46 women with breast cancer who underwent NAT and reported sensitivities of 100% (95% CI, 63–100%) and 87% (95% CI, 47–100%) and specificities of 84% (95% CI, 69–94%) and 60% (95% CI, 43–76%) for CEM and MRI, respectively. Patel et al. evaluated 65 women with breast cancer who underwent NAT and reported a sensitivity of 95% (95% CI, 73–100%) for both CEM and MRI and a specificity of 66.7% (95% CI, 51–80%) for CEM and 68.9% (95% CI, 53–81%) for MRI. Contrary to those two studies, our study showed that MRI had significantly higher sensitivity than CEM, achieving a sensitivity for pCR of 100%. Variation across studies in the rates of pCR (31% for the current study vs 17% for Iotti et al. and 31% for Patel et al.) and DCIS (24% for the current study vs 7.8% for Patel et al.; rate not directly reported by Iotti et al.) may contribute to the variation in performance. The performance of MRI in the current study may also in part relate to the use of gadobenate dimeglumine, a high-relaxivity contrast agent that has shown superiority to other agents [
35,
36]. The imaging criterion of an absence of enhancement for detecting pCR in the present study and the two earlier studies also impacts the overall observed levels of performance. A prior meta-analysis noted significant variation in the performance of MRI for detecting pCR after NAT on the basis of the selected threshold, with studies variably defining response as no enhancement or enhancement equal to or less than that of normal breast tissue [
9]. However, enhancement equal to or less than that of normal breast tissue cannot be used for CEM because normal fibroglandular tissue is not recognizable on CEM images.
Several earlier studies explored the adjunct of delayed CEM acquired within 10 minutes after contrast material administration in clinical settings other than NAT monitoring, yielding variable results [
37–
39]. One study reported that the delayed acquisition had higher specificity for diagnosing mass lesions in dense breasts [
37], whereas two other studies did not report significant performance improvements [
38,
39]. To our knowledge, this is the first study to evaluate a delayed CEM acquisition for the assessment of response to NAT. At post-NAT imaging, delayed CEM yielded a systematically larger size with respect to CEM (by 1.1 mm) and detected additional cases of residual DCIS that were not detected by CEM or MRI. This resulted in delayed CEM having the fewest number of false-positive results and having a higher specificity for pCR in comparison with CEM, although this difference in specificity was not statistically significant. The detection of in situ disease may be the primary advantage of delayed CEM, facilitating individualized surgical plans [
40]. On the basis of the findings, we suggest that a delayed acquisition be included if performing CEM rather than MRI after NAT.
The use of NAT has increased in the last decade, evolving from a therapy intended to enable breast conservation to an option for improving outcomes by achieving pCR [
41]. In current practice, all patients undergoing NAT then undergo surgery as part of a standard treatment plan. However, omission of surgery in patients with imaging findings of pCR, supported by percutaneous needle biopsy, remains a future goal [
42]. To adopt such an approach, highly reliable detection of residual disease by imaging is crucial. Though MRI remains the preferred test, the modality has a range of barriers, including high cost and heterogeneous availability. CEM may represent an accessible and affordable alternative. An advantage of CEM is its acceptance by patients. In a study by Hobbs et al. [
43], 49 interviewed women overall preferred CEM to MRI because CEM was faster, more comfortable, and less noisy. A concern relating to use of CEM is the potential for adverse reactions to the iodinated contrast agent. In a systematic review of 84 studies and 14,012 patients, Zanardo et al. [
44] reported a pooled rate of adverse reactions of 0.82%, with most reactions being mild.
Limitations of our study must be considered. First, the study was conducted at a single center and had a small sample size. Second, results were not stratified by tumor subtypes. Third, only a single reader reviewed images for each modality. Fourth, the reader for CEM had substantially less experience than the reader for MRI, reflecting experience levels for the two modalities that are likely to be encountered in clinical practice. Indeed, breast MRI was described as early as 1986 [
45], whereas CEM was initially described in trials from 2003 [
13,
46,
47]. Finally, the additional glandular dose from the deferred CEM acquisitions was not recorded. According to a prior systematic review, the mean additional glandular dose derived from a delayed CEM acquisition ranges from 0.43 to 2.65 mGy per view [
44]. In addition, Jeukens et al. [
48] found that radiation exposure from a unilateral single CEM exposure posed a small health risk, considering not only the average glandular dose but also the age-dependent lifetime attributable risk for breast cancer incidence and mortality.
In conclusion, CEM and MRI yielded comparable lesion measurements and RECIST v1.1 response assessments after NAT for breast cancer and no significant difference in specificity or accuracy for pCR. Nonetheless, MRI had a sensitivity of 100% for detecting pCR, which was significantly higher than the sensitivity of CEM. Though MRI remains the preferred test when available, the findings support CEM as a useful alternative when MRI is unavailable. If CEM is performed after NAT, then inclusion of a delayed CEM acquisition may help detect residual DCIS.