|
|
||||||||
Original Research |
1 Département de Radiologie, Assistance Publique des Hôpitaux de
Paris, 20 rue Leblanc, Paris, F-75015 France.
2 Université Paris-Descartes, Faculté de Médecine, Paris,
F-75015 France.
3 Département de Radiologie, Hôpital Européen Georges
Pompidou, Paris, F-75015 France.
4 Centre d'Investigations Cliniques, Hôpital Européen George
Pompidou, Paris, F-75015 France.
5 Unité de Recherché Clinique, Hôpital Européen
George Pompidou, Paris, F-75015 France.
Received July 15, 2005;
accepted after revision October 10, 2005.
Address correspondence to M.-P. Revel
(marie-pierre.revel{at}egp.aphp.fr).
Abstract
|
|
|---|
MATERIALS AND METHODS. CT lung analysis volumetric software was used to retrospectively calculate the doubling times of 63 solid noncalcified nodules by comparing nodule volumes on baseline and follow-up CT scans obtained a median of 3.7 months apart. A final diagnosis based on validated criteria was available for all 63 nodules. All CT examinations were performed with 1.25-mm-thick slices on a four-detector unit. Taking 500 days as the upper value for malignancies, we evaluated whether the software-calculated doubling times could be used to distinguish malignant from benign solid nodules. We also examined whether the relative volume variation of benign nodules correlated with initial nodule size, interscan interval, or differences in contrast administration or exposure parameters between baseline and follow-up CT.
RESULTS. There were 52 benign and 11 malignant nodules. Benign nodules had a median doubling time of 947 days and a mean relative volume variation of -4.4% (range, -50% to 38%). Malignant nodules had a median doubling time of 117 days and a mean relative volume variation of 102% (22-462%). The sensitivity, specificity, and negative and positive predictive values of the volumetric software for diagnosing malignancy were 91% (95% confidence interval [CI], 0.59-1.00), 90% (95% CI, 0.79-0.97), 98% (95% CI, 0.89-1.00), and 67% (95% CI, 0.38-0.88), respectively. No correlation was found between the relative volume variation of benign nodules and their initial size, the interscan interval, or differences in contrast administration or exposure parameters between the two CT examinations.
CONCLUSION. Software-calculated pulmonary nodule doubling times of more than 500 days have a 98% negative predictive value for the diagnosis of solid malignant pulmonary nodules. This method may be useful for diagnosing malignant pulmonary nodules on follow-up CT.
Keywords: chest comparative studies computer applications3D CT lung lung cancer MDCT oncologic imaging
|
|
|---|
Most solid nodules identified incidentally or during screening are benign [1-3, 5], but this diagnosis is rarely evident on the basis of morphologic criteria alone. Although certain features on thin-section CT images can help to distinguish between small malignant and benign nodules [6], internal nodule characteristics show considerable overlap [7]. The presence of intranodular fat is one reliable sign of a benign lesion [7], but few nodules have that characteristic.
In theory, contrast-enhanced CT is applicable for the diagnosis of nodules larger than 5 mm, but its specificity is only 58% [8]. Fluorine-18 FDG PET is inaccurate for the diagnosis of nodules smaller than 10 mm in diameter [9, 10], which represent most nodules detected on CT [1-3, 5]. Because the probability of malignancy is relatively low in this size range [11], invasive management is not warranted. For all these reasons, nodules with a diameter of less than 10 mm are generally monitored by means of serial CT, with the aim of detecting a size increase suggestive of malignancy [12]. The maximal transverse diameter generally is used to monitor pulmonary nodules and to calculate their volume. However, volume changes estimated from 2D images may miss asymmetric growth [13]. Furthermore, manual measurements are not sufficiently reproducible to detect small variations in nodule diameter. The repeatability coefficient was 1.3 mm for the best observer in a study of intraobserver agreement for 2D CT measurements [14], suggesting that smaller variations cannot be considered clinically significant.
It is estimated that the doubling time of malignant tumors is generally between 30 and 500 days, with a mean of 100 days [15].
In this study, we evaluated software permitting automatic volume segmentation of pulmonary nodules and calculation of their doubling time. Using an upper cutoff of 500 days for the malignant doubling time, we calculated sensitivity, specificity, and positive and negative predictive values for the diagnosis of malignancy. We also analyzed the influence of factors such as initial nodule size, interscan interval, and differences between initial and follow-up CT examinations (contrast administration and exposure parameters) on the relative volume variation of benign noninfectious nodules.
|
|
|---|
Nodule Selection and Imaging
Solid noncalcified nodules with a diameter smaller than 2 cm were selected
for this study if they were detected on CT performed with a 1.25-mm
collimation and a standard reconstruction algorithm and if a second CT scan
obtained with these same two parameters was available, together with the final
diagnosis. A keyword search on our department database for the years 2000-2003
yielded 63 nodules meeting these criteria. These 63 nodules had been found in
45 patients who ranged in age from 35 to 82 years (mean, 59 years) and had
been examined with a CT unit (LightSpeed, GE Healthcare) for various clinical
indications. The patients consisted of 15 women ranging in age from 41 to 82
years (median, 65 years) and 30 men ranging in age from 35 to 78 years
(median, 54 years). The age distribution of the men and women was not
significantly different (p = 0.08, Mann-Whitney test).
Twenty-seven of the 45 patients had a solitary nodule. The two largest nodules were selected in the 18 patients who had more than one nodule. CT had been performed for follow-up of thoracic or extrathoracic malignancy in 17 patients and to characterize a pulmonary nodule that had been discovered incidentally on chest radiography in 15 patients. In the remaining 13 patients, the nodules had been found incidentally on CT scans acquired for suspected pulmonary embolism in seven cases; hemoptysis in three cases; and pulmonary fibrosis, cardiomyopathy, and pleuro-pneumonia in one case each.
The initial CT acquisitions were performed with (20 cases) or without (43 cases) contrast medium, depending on the indication.
The CT acquisition that served to calculate the nodule doubling time was generally performed 3 months after the first; in our institution, this CT follow-up time is standard for patients with indeterminate nodules measuring between 5 and 10 mm. For patients with nodules smaller than 5 mm in diameter, the interval was between 6 and 12 months. Nodules with evidence of malignancy on the baseline examination were reevaluated 3-8 weeks later, at the time of core biopsy or surgical resection.
Software Description and Doubling Time Calculation
The nodule volume doubling time was calculated on a workstation (Advantage
Workstation 4.0, GE Healthcare Europe) running software (CT Advanced Lung
Analysis, GE Healthcare Europe). This software segments pulmonary nodules with
a combination of watershed segmentation and shape-analysis techniques. It is
dedicated to the evaluation of small lesions, not large masses. Different
shape-analysis algorithms are used depending on the type of nodule, which is
recognized automaticallythat is, juxtavascular (strong vascular
connections); juxtapleural (nodule connected to the pleural surface); and well
circumscribed (few vascular connections and no pleural contact). CT
acquisitions with a slice thickness of no more than 1.25 mm and a standard
reconstruction algorithm are recommended for accurate segmentation. The
software compares nodule volumes between baseline and follow-up CT and
calculates the doubling time if growth is detected. The same CT parameters
(field of view, exposure parameters) should be used for volume comparison and
accurate doubling-time estimation. The relative volume variation
(RVV) is obtained with the following equation:
![]() |
On the basis of the literature [15, 16], we assumed that a doubling time of less than 500 days corresponded to a malignant lesion for solid nodules. Using this cutoff value, we calculated the sensitivity, specificity, and positive and negative predictive values of the software for diagnosing malignancy.
Noninfectious benign nodules normally remain stable over time, and their relative volume variation between two consecutive CT scans should therefore be close to zero. Two of the 52 benign nodules were found to be infectious, and both fully resolved on late control examinations. These two nodules were thus excluded from further analysis. For the remaining 50 benign nodules, we analyzed the influence on the relative volume variation of the following factors: initial nodule size, nodule type (juxtapleural, juxtavascular, or well circumscribed), interscan interval, and differences in the use of contrast material and in the exposure parameters between the two CT examinations.
Final Diagnosis
The final diagnosis of malignancy for the 11 malignant lesions was based on
pathologic examination of core biopsies, surgical specimens, or both. Other
lesions were confirmed to be benign if they did not grow for at least 2 years
on CT (42 nodules), showed no 18F-FDG uptake and measured at least
10 mm (two nodules), or had morphologic features typical of a benign lesion
(seven nodules contained fat). The last benign nodule in this series was
confirmed to be benign at surgery.
Statistical Analysis
Statistical calculations were performed using statistics software (SAS
version 8.2, SAS Institute). The performance of the CT Advanced Lung Analysis
software was assessed in terms of its sensitivity, specificity, and positive
and negative predictive values for malignancy; the associated 95% confidence
intervals (CIs) were calculated with the exact binomial method. Age
distribution was compared between men and women using the Mann-Whitney
test.
Because the relative volume of variation was normally distributed, the correlation between relative volume variation and the studied parameters was tested with parametric methodsnamely, Fisher's exact test (analysis of variance or covariance) for quantitative variables and qualitative variables with more than two techniques and the Student's t test for qualitative variables with two categories.
Regarding the absolute value of the relative volume of variation (AVRVV), we considered two groups of benign nodules: group A comprised benign nodules for which AVRVV was less than 10%, and group B comprised benign nodules with 10% or higher AVRVV values. The effects of potential factors of variability were compared in these two groups of nodules using the Student's t test for normally distributed quantitative variables such as the initial nodule size, the Mann-Whitney test for other quantitative variables such as the interscan interval, and the chi-square test for categoric variables such as the nodule type.
|
|
|
|
|
|---|
Malignant Nodules
Seven of the 11 malignant nodules corresponded to primary lung carcinomas
and four to metastases. The interscan interval ranged from 0.8 to 6 months
(median, 1.9 months). The relative volume variation of the malignant lesions
ranged from 22% to 462% (mean, 102%; median, 55%). The diameter variation,
measured with electronic calipers on a PACS screen, was more than 2 mm for six
of the 11 nodules and less than 1 mm (not significant [NS]) for the other five
nodules (Figs. 1A,
1B,
1C,
2A,
2B, and
2C). These five nodules were
rescanned within 8 weeks after the baseline CT examination, at the time of
core biopsy or surgical resection. On this basis, the sensitivity of the
software-calculated doubling time for malignancy was 91% (95% CI, 0.59-1.00),
whereas the sensitivity of manual diameter-change measurement was 54% (95% CI,
0.23-0.83).
|
|
|
|
|
|
The doubling time was less than 500 days in five false-positive cases (range, 138-396 days). Thus, the specificity of the software-calculated doubling time was 90% (95% CI, 0.79-0.97). Two of these cases were due to erroneous automated nodule segmentation because of pleural or mediastinal connections that were evident on visual analysis. The remaining three cases were due to partial volume averaging at the top of the nodule modifying the segmentation from the first CT examination to the second. Except for two infectious nodules that shrank, the other 50 benign nodules remained stable over time (Figs. 3A, 3B, 3C, 4A, and 4B) and their relative volume variation did not correspond to a real volume change. Twenty-nine of these 50 nodules had a difference in tube current (mAs) exposure of more than 50 mAs between the two scans. In 15 cases, only one scan involved contrast administration.
|
|
The relative volume variation values of these 50 benign nodules ranged from -50% to 38% (mean, -4.4%) and were normally distributed. No correlation was found between the relative volume variation and the initial nodule size (p = 0.84, F test, NS) (Fig. 5A), the interscan interval (p = 0.59, F test, NS), the nodule type (p = 0.20, F test, NS) (Fig. 6), differences in contrast administration (p = 0.12; Student's t test, NS), or differences in tube current settings (p = 0.58, Student's t test, NS).
|
|
|
|
|
|
|
|---|
Automated volumetric measurement methods are more reproducible [17], a key factor in serial comparisons, than manual measurements. However, volume measurements may be influenced by the level of inspiration or the patient's position and by differences in successive CT protocols, even though protocols are, in principle, standardized. In our institution, the CT protocol for nodule follow-up consists of 1.25-mm-thick helical acquisitions performed without contrast administration, with the exposure parameters depending on the patient's weight (70 mAs for a 70-kg patient). However, only 21 of the 50 benign nodules in this study were examined with a less than a 50-mAs difference between the two scans, and contrast administration status was identical between the two scans in only 35 cases.
Here, we evaluated the capacity of a volumetric software program to distinguish between malignant and benign solid nodules on the basis of their doubling times. Assuming a malignant doubling time cutoff of 500 days, we obtained a sensitivity and specificity of 91% and 90%, respectively. This time of 500 days is generally accepted as the upper limit of the doubling time for malignant pulmonary lesions, even though some such lesions grow more slowly (especially some adenocarcinomas in elderly patients) [18]. One nodule in our study, corresponding to an adenocarcinoma, had a software-calculated doubling time of 646 days. The use of a higher malignant doubling time would have increased the number of false-positives in our study. With a 700-day cutoff for malignant doubling, there would have been no false-negatives but three additional false-positives, reducing the specificity from 90% to 85%. The mean doubling time for the 11 malignant cases in our study was 164 days, a value close to the 149 days reported by Hasegawa et al. [16] for malignant solid nodules. Our series did not include any partially solid nodules or nodules with ground-glass opacities (likely bronchioloalveolar carcinomas), which are slow growers, with reported doubling times of more than 1,000 days [16].
The manually measured diameter variation was less than 1 mm (i.e., below the manual measurement error) for five of the 11 malignant nodules identified here. These lesions would have been left to follow-up rather than diagnosed as malignancies if they had been evaluated only on the basis of their interscan diameter change within 8 weeks after baseline CT. This underlines the need to use another measurement method, such as volumetry, for at least short interscan intervals.
The median interval between baseline and follow-up CT was less than 2 months for the 11 malignant nodules in this study. This suggests that the recall time for the first follow-up CT may be shortened when volumetric software is used.
We found no correlation between the relative volume variation of benign nodules and any of the potential factors of variability we tested, such as differences in tube current exposure or contrast administration between CT protocols, the interscan interval, the nodule type, or the nodule size on baseline CT. The relative volume variation was expected to correlate with initial nodule size in our evaluation, particularly because Yankelevitz et al. [13] have found that the relative error in volumetric measurements of nodule phantoms depends on nodule size. However, when evaluated on real nodules, this is not observed. In the study by Revel et al. [17], in which three radiologists performed three consecutive measurements of the nodules' volumes on the basis of a single CT acquisition, the software repeatability was not correlated with the nodules' sizes.
This was also observed by Wormanns et al. [19], on the basis of two consecutive CT acquisitions performed at a 10-minute interval in patients with pulmonary metastases. That study comprised 151 nodules with a mean diameter of 7.4 mm. The volume measurement error between the two CT scans did not correlate with the mean nodule volume in the study of Wormanns et al.
Because the relative volume variation of benign nodules is not influenced in our study by nodule characteristics or differences between the successive CT protocols, the main sources of variation appear to be differences in patient positioning or in the degree of inspiration. Because the software segmentation process is partly based on a density threshold, it may be affected by factors modifying the density of the nodule environment, such as the amount of air around the nodule. It would be useful if subsequent versions of volumetric software indicated the variation in whole-lung volume between the two CT acquisitions used for doubling-time calculation.
This study has several limitations. One is the relatively small number of nodules evaluated (n = 63). There are two main reasons for that: One is that diagnostic validation of benign lesions requires at least 2 years of stability on CT follow-up. We could not include nodules with a shorter CT follow-up. The second reason is that we evaluated no more than two nodules per patient to avoid a potential patient-effect bias.
Another limitation is the relatively small proportion of malignant lesions in our series, in which only 11 (17%) of 63 nodules were malignant. However, this proportion reflects that seen in daily practice. Among the large number of nodules found on CT, most are benign. Furthermore, most malignant nodules are identified during initial imaging studies and (fortunately) are not referred for follow-up. The nodules in our series were selected if they were solid, non-calcified, less than 2 cm and if two CT scans obtained with a 1.25-mm collimation and a standard reconstruction algorithm were available, together with the final diagnosis. We believe this is the right approach to obtain a nonbiased series. A series with a majority of malignant lesions would have been strongly biased and could not have been used to estimate the specificity of the test.
The final diagnosis of the benign or malignant nature of the nodules was based on histology only for the 11 malignant lesions and for one benign nodule. For the remaining benign nodules, diagnosis was mainly based on their long-term2 years or morestability (42 nodules). This has the potential risk of misdiagnosing malignancy in some slowly-growing lung cancers, and thus of overestimating the sensitivity of the doubling time in our study. However, slow growth mainly concerns non-solid lesions that were not included in our series, which evaluated only solid nodules.
The imaging examinations were performed on a 4-MDCT scanner, which is no longer state-of-the-art. However, this technology allows one to acquire 1.25-mm-thick slices, which are considered adequate for accurate segmentation with the software we used. It would be of interest to determine whether nodule segmentation is better with 0.625-mm-thick slices. Partial volume effects on nodule borders should be less marked, especially for small nodules; thus, the segmentation should be improved.
Finally, the retrospective nature of the study means that we are unable to propose an optimal recall time for evaluating the growth of lung nodules. One theoretic advantage of using dedicated volumetric software would be to detect eventual growth earlier. The median interscan interval for the malignant lesions in our series was only 1.9 months, which is shorter than the traditional 3-month interval used for first CT follow-up, on the basis of recommendations in lung cancer screening studies. Patient management was not modified in this series because the software was used retrospectively. However, our results have led us to reevaluate high-risk patients with nodules close to 1 cm in diameter earlierat 8 weeks instead of 3 months. Nodules stable at 3 months are reevaluated only at 1 and 2 years; the 6-month manual diameter follow-up is no longer performed. Prospective studies are needed to determine the optimal first and subsequent recall times for software volumetric measurements.
In conclusion, this study shows that software-calculated doubling time, estimated on the basis of a second CT study performed a median of 3.7 months after the first, has a negative predictive value of 98% for malignancy in small, solid, initially indeterminate pulmonary nodules. Prospective studies are needed to determine the optimal recall time when using dedicated volumetric software.
|
|
|---|
This article has been cited by other articles:
![]() |
C. Rampinelli, E. De Fiori, S. Raimondi, G. Veronesi, and M. Bellomi In Vivo Repeatability of Automated Volume Calculations of Small Pulmonary Nodules With CT Am. J. Roentgenol., June 1, 2009; 192(6): 1657 - 1661. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Gavrielides, L. M. Kinnard, K. J. Myers, and N. Petrick Noncalcified Lung Nodules: Volumetric Assessment with Thoracic CT Radiology, April 1, 2009; 251(1): 26 - 37. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Xu, H. J. van der Zaag-Loonen, M. Oudkerk, Y. Wang, R. Vliegenthart, E. T. Scholten, J. Verschakelen, M. Prokop, H. J. de Koning, and R. J. van Klaveren Smooth or Attached Solid Indeterminate Nodules Detected at Baseline CT Screening in the NELSON Study: Cancer Risk during 1 Year of Follow-up Radiology, January 1, 2009; 250(1): 264 - 272. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Wang, R. J. van Klaveren, H. J. van der Zaag-Loonen, G. H. de Bock, H. A. Gietema, D. M. Xu, A. L. M. Leusveld, H. J. de Koning, E. T. Scholten, J. Verschakelen, et al. Effect of Nodule Characteristics on Variability of Semiautomated Volume Measurements in Pulmonary Nodules Detected in a Lung Cancer Screening Program Radiology, August 1, 2008; 248(2): 625 - 631. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Beigelman-Aubry, P. Raffy, W. Yang, R. A. Castellino, and P. A. Grenier Computer-Aided Detection of Solid Lung Nodules on Follow-Up MDCT Screening: Evaluation of Detection, Tracking, and Reading Time Am. J. Roentgenol., October 1, 2007; 189(4): 948 - 955. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |