|
|
||||||||
1
Department of Radiology, Box 1667, University of California Medical Center,
San Francisco, CA 94143-1667.
2
Department of Radiology, Box 357115, University of Washington Medical Center,
1959 N.E. Pacific St., Seattle, WA 98195.
Received June 28, 2001;
accepted after revision September 6, 2001.
Presented in part at the annual meeting of the American Roentgen Ray
Society, Seattle, AprilMay 2001.
Abstract
|
|
|---|
MATERIALS AND METHODS. We analyzed outcomes from 51,805 consecutive mammography examinations. Screening and diagnostic examinations were audited separately. Diagnostic examinations were audited by indication for examination. Extrapolating from our known mix of screening (79%) and diagnostic (21%) examinations, we determined expected combined outcomes for various mixes that might be encountered in clinical practice. Similarly, we determined the expected overall diagnostic mammography outcomes for various clinically relevant mixes of indications for examination.
RESULTS. Outcomes vary substantially depending on the mix of screening and diagnostic examinations performed. For example, expected outcomes for practices with screeningdiagnostic mixes of 90-10% and 50-50% are, respectively: rate of abnormal findings, 6% versus 11%; rate of positive biopsy findings, 38% versus 42%; cancer detection rate, 10 per 1,000 versus 30 per 1,000; mean invasive cancer size, 14.4 mm versus 16.0 mm; nodal metastasis rate, 8% versus 11%; and rate of stage 0 and stage I cancers, 87% versus 82%. Diagnostic outcomes also vary substantially according to indication for examination, with a higher rate of abnormal findings, a higher rate of positive biopsy findings, and a larger mean invasive cancer size expected for mixes involving a high percentage of workups for palpable lesions.
CONCLUSION. When screening and diagnostic mammography outcomes are not segregated during auditing, and when diagnostic outcomes are not segregated by indication for examination, analysis of combined audit data should be based on extrapolations from known outcomes.
|
|
|---|
On the basis of published mammography outcomes data derived from large screening practices and population-based screening programs [3,4,5, 7,8,9,10,11,12,13], desirable goals have been put forth for the detection of breast cancer in asymptomatic women [2, 6, 9, 14]. However, substantial differences in mammography outcomes are found when auditing screening versus diagnostic examinations [15, 16], and some of these differences have been shown to be statistically significant [16]. The typical mammography practice involves a mixture of screening and diagnostic examinations, but the percentage of screening versus diagnostic examinations may vary considerably. Therefore, a mammography practice should expect its own outcomes data to vary depending on its specific mix of cases.
Diagnostic mammography examinations are performed for a variety of problem-solving indications ranging from the presence of breast implants in asymptomatic women to the evaluation of the extent of a known breast cancer. Insofar as the probability of malignancy is vastly different in these clinical scenarios, substantial differences in mammography outcomes are found when diagnostic examinations themselves are segregated by indication for examination [16]. Again, a mammography practice should expect its own diagnostic outcomes data to vary depending on its specific mix of indications for examination.
Although subset analyses of auditing data (e.g., screening versus diagnostic examinations, diagnostic examinations as a function of indication for examination) produce more meaningful results, federal regulations do not mandate this type of auditing, and the time and cost involved are substantial. These factors limit the widespread use of comprehensive auditing. In this article, we present data to aid mammography facilities that do not segregate mammography outcomes during auditing. By developing mathematic models based on outcomes data from our own comprehensively audited practice, we have constructed tables of expected outcomes for the full range of case mixes encountered in mainstream clinical practice.
|
|
|---|
We segregated our diagnostic examinations according to indication for examination using the following categories: workup of a screening examination with abnormal findings (20% of cases), short-interval follow-up of a probably benign lesion (6%), periodic surveillance of a cancer patient treated with breast preservation surgery (lumpectomy) (20%), workup of a palpable mass (15%), and other (39%). The "other" category was used for examinations that did not fit into any of the previous categories and included some asymptomatic women with breast implants or a history of previous mastectomy for carcinoma, women with a history of focal breast pain or palpable breast thickening, and women with known breast cancer being examined either for extent of disease before surgery or for response to neoadjuvant chemotherapy.
At the time of image interpretation, the radiologist entered in our computer database the indication for the examination, a BI-RADS assessment category for each breast, and management recommendations. Subsequently, we recorded a benign or malignant diagnosis for all cases in which biopsy was performed. For biopsies showing malignancy (defined as ductal carcinoma in situ or any invasive carcinoma), we also recorded the size, nodal status, and stage of the cancer, based on the American Joint Committee on Cancer staging system [18].
On the basis of these data, we performed a medical audit of our practice, segregating screening from diagnostic examinations and subdividing the diagnostic examinations into categories according to indication for examination. In this manner, we were able to perform subset analyses that we then used as the source for spreadsheet-based mathematic (simple linear) extrapolations of outcomes data involving a series of case mixes of screening versus diagnostic examinations, and a series of case mixes involving different percentages of indications for diagnostic examinations.
|
|
|---|
Among the diagnostic examinations, 1,779 were interpreted as having abnormal findings (BI-RADS assessment category 4 or 5). Recommended biopsies were performed for 1,301 of these examinations, yielding a diagnosis of malignancy in 600 cases. Invasive cancer was found in 416 cases, 84 of which were axillary nodal metastasis. For all cancers detected at diagnostic mammography, 450 were stage 0 or stage I.
Table 1 summarizes the outcomes data for screening and diagnostic examinations in our practice. Based on mathematic extrapolation using these observed data, Table 2 displays combined screening plus diagnostic outcomes data derived for a range of case mixes of screening and diagnostic examinations that may be encountered in clinical practice.
|
|
We designed the data in Table 2 to be used by mammography practices that do not segregate screening and diagnostic examinations during auditing. If a given practice can determine (or estimate) its own case mix of screening and diagnostic examinations, it can compare its observed outcomes data with those presented in the table. For example, a practice that involves 30% screening and 70% diagnostic mammography should expect to observe different outcomes from those of a practice that involves 90% screening and 10% diagnostic mammography. For these two respective practices, the estimated rates of abnormal findings are 13% versus 6%, the estimated rates of positive biopsy findings are 44% versus 38%, the estimated cancer detection rates are 39 per 1,000 versus 10 per 1,000, the estimated rates of axillary nodal metastasis are 12% versus 8%, the estimated rates of stage 0 and stage I cancers are 79% versus 87%, and the estimated mean sizes of invasive cancer are 16.8 mm versus 14.4 mm.
Diagnostic Outcomes as a Function of Indication for Examination
This analysis involved 11,114 diagnostic examinations. Of these cases,
2,214 were performed for the workup of abnormal screening examinations, 673
for short-interval follow-up of probably benign lesions, 2,253 for
short-interval follow-up of lumpectomy patients, 1,692 for the evaluation of
palpable masses, and 4,282 for other (miscellaneous) reasons. Among the
diagnostic examinations performed to work up abnormal screening examinations,
814 were interpreted as abnormal (biopsy recommended). Biopsies eventually
were performed for 640 of these examinations, yielding a diagnosis of
malignancy in 248 cases. Invasive cancer was found in 147 cases, 13 of which
were axillary nodal metastasis. For all the detected cancers, 217 were stage 0
or stage I. Among the diagnostic examinations performed for short-interval
follow-up of probably benign lesions, 43 were interpreted as abnormal (biopsy
recommended). Biopsies were performed for 30, yielding a diagnosis of
malignancy in three cases. Invasive cancer was found in two cases, neither of
which was axillary nodal metastasis. All three cancers were stage 0 or stage
I. Among the diagnostic examinations performed for short-interval follow-up of
lumpectomy patients, 63 were interpreted as abnormal (biopsy recommended).
Biopsies were performed for 49, yielding a diagnosis of malignancy in 36
cases. Invasive cancer was found in 21 cases, one of which had axillary nodal
metastasis. For all the detected cancers, 32 were stage 0 or stage I. Among
the diagnostic examinations performed to evaluate a palpable mass, 385 were
interpreted as abnormal (biopsy recommended), and biopsies were performed for
337, yielding a diagnosis of malignancy in 209 cases. Invasive cancer was
found in 179 cases, 58 of which were axillary nodal metastasis. For all the
detected cancers, 101 were stage 0 or stage I. Among the diagnostic
examinations performed for other reasons, 474 were interpreted as abnormal
(biopsy recommended), and biopsies were performed for 298, yielding a
diagnosis of malignancy in 126 cases. Invasive cancer was found in 67 cases,
12 of which were axillary nodal metastasis. For all the detected cancers, 98
were stage 0 or stage I.
Table 3 summarizes the outcomes data obtained for diagnostic examinations in our practice according to indication for examination. As has been reported previously, outcomes data for the probably benign category, the lumpectomy category, and the "other" (miscellaneous) category were generally similar [16]. The most obvious exception was the high rate of positive biopsy findings for lumpectomy cases (73% in our practice; reported in other studies to be approximately 50%) [19, 20]). For simplicity, we combined the three similar categories in producing Tables 4,5,6,7, which display combined diagnostic mammography outcomes data for many ranges of case mixes of indications for examination. We designed the data in these tables to be used by mammography practices that audit screening and diagnostic examinations separately but do not perform subset audits of their diagnostic mammography cases. Those practices that can determine (or estimate) their own case mixes of indications for examination can compare their observed diagnostic outcomes data with those presented in the appropriate table. Tables 4,5,6,7 should be used by practices for which the workup of abnormal screening examinations constitutes 10%, 20%, 30%, and 40% (respectively) of all diagnostic mammography examinations. Review of Tables 4,5,6,7 shows that combined diagnostic outcomes data are influenced primarily by the percentage of examinations performed to evaluate palpable masses.
|
|
|
|
|
Palpable Masses
As shown in Table 8,
separate analysis of our diagnostic examinations performed for evaluation of
palpable masses showed striking differences in clinical outcomes between this
subset of examinations and the combination of all other diagnostic
examinations, as well as the combination of all other diagnostic and screening
examinations. Because it may be easier for a mammography practice to simply
determine (or estimate) the percentage of its patients who have palpable
masses, we produced Tables 9
and 10, which display
estimates of combined outcomes data for many ranges of case mixes of palpable
mass cases. We designed Table 9
to be used by mammography practices that audit screening and diagnostic
examinations separately and Table
10 to be used by practices that do not segregate screening and
diagnostic examinations during auditing.
|
|
|
As Table 9 indicates, a practice performing 5% of its diagnostic mammography examinations to evaluate palpable masses can expect to have different outcomes from a practice performing 70% of its examinations to evaluate palpable masses. For these two respective practices, the estimated rate of abnormal findings in 15% versus 20%, the estimated rate of positive biopsy results is 42% versus 56%, and the estimated cancer detection rate is 48 per 1,000 versus 100 per 1,000.
As Table 10 indicates, a practice performing 2% of its screening plus diagnostic mammography examinations to evaluate palpable masses can expect to have different outcomes from a practice performing 30% of its cases to evaluate palpable masses. For these two practices, respectively, the estimated rate of abnormal findings is 7% versus 12%, the estimated rate of positive biopsy findings is 40% versus 46%, and the estimated cancer detection rate is 15 per 1,000 versus 46 per 1,000.
|
|
|---|
Unfortunately, many practices have neither the time nor the financial resources to perform comprehensive segregated audits. In this situation, analysis of combined data should be based on known differences between diagnostic and screening outcomes. By using the data reported in this article, which involve concurrently collected, consecutive screening and diagnostic mammography examinations, we derived outcomes data for a wide range of case mixes of screening-to-diagnostic examinations (Table 2), so that a practice that can determine (or estimate) its own case mix can then use our derived data as a benchmark with which to compare their observed outcomes.
Diagnostic mammography examinations are performed for a variety of problem-solving indications. Substantial differences in mammography outcomes are found when diagnostic examinations themselves are segregated by indication for examination [16]. Again, a mammography practice should expect its own diagnostic outcomes data to vary depending on its specific mix of indications for examinations. Therefore, we provide Tables 4,5,6,7 to be used by the diagnostic mammography practice that can determine (or estimate) its percentage of examinations done for each indication, so that its medical audit may be more meaningful.
Our results support data found in other smaller series of patients undergoing diagnostic mammography for palpable masses [21,22,23]. As shown in Table 8, the effect of palpable masses on audit outcomes data is so substantial that we suggest that a practice can simply derive expected outcome data if it can determine (or estimate) the percentage of examinations it performed for the evaluation of palpable masses. Tables 9 and 10 serve this purpose.
The data reported herein come from an academic practice in which most cases are interpreted by full-time breast imaging specialists. These data, therefore, may not be representative of the performance of general diagnostic radiologists in community practice. However, our series of concurrent screening and diagnostic mammography examinations is now the only published set of data with a sufficiently large number of cases to use for estimating outcomes. As other large consecutive series of comprehensively audited combinations of screening plus diagnostic cases are reported, especially if some come from community practices, a more meaningful data set can be used to derive the outcomes estimates displayed in the tables of this article.
|
|
|---|

16 and 900.
Washington, DC: United States Department of Health and Human Services,
1997
This article has been cited by other articles:
![]() |
E. A. Sickles, D. L. Miglioretti, R. Ballard-Barbash, B. M. Geller, J. W. T. Leung, R. D. Rosenberg, R. Smith-Bindman, and B. C. Yankaskas Performance Benchmarks for Diagnostic Mammography Radiology, June 1, 2005; 235(3): 775 - 790. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Smith-Bindman, P. W. Chu, D. L. Miglioretti, E. A. Sickles, R. Blanks, R. Ballard-Barbash, J. K. Bobo, N. C. Lee, M. G. Wallis, J. Patnick, et al. Comparison of Screening Mammography in the United States and the United Kingdom JAMA, October 22, 2003; 290(16): 2129 - 2137. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. A. Berg Rationale for a Trial of Screening Breast Ultrasound: American College of Radiology Imaging Network (ACRIN) 6666 Am. J. Roentgenol., May 1, 2003; 180(5): 1225 - 1228. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |