For improvement of the outcomes of the diagnosis and treatment of SPNs, two quantitative prediction models have been developed to facilitate the risk assessment of malignancy in patients with the most troublesome subset of indeterminate pulmonary nodules—namely, those that are radiologically indeterminate. The first is the Mayo Clinic model, which is based on a logistic regression analysis and considers six independent predictors of malignancy: older age, history of smoking, history of an extrathoracic cancer more than 5 years before nodule detection, larger nodule diameter, upper lobe location, and spiculated margins [
2]. The second is the Veterans Affairs Cooperative Study model, or VA model, which identified four independent predictors—older age, history of smoking, larger nodule diameter, and shorter time since quitting smoking—using logistic regression analysis [
3]. As shown by Schultz et al. [
4], both the Mayo Clinic and VA models predict the pretest probability of malignant SPNs with accuracy that is sufficient to make them clinically useful, especially given that neither model is intended to be used as a stand-alone diagnostic test.
Although similar, both models are slightly different. In particular, the VA model included a positive smoking history, older age, and a large nodule diameter together as an independent predictor of a malignant SPN. However, in the VA model, the adjusted odds ratio for a positive smoking history was higher than in the Mayo Clinic model. Conversely, the Mayo Clinic model excluded patients with a history of lung cancer or a history of an extrathoracic cancer within 5 years because these would be strong predictors of malignant SPNs, whereas in the VA model, these factors were not independent predictors of malignancy. Finally, in the VA model, two other variables—specifically, spiculation and a remote history of extrathoracic cancer—were not included.
Materials and Methods
Study Population
From a single institutional database, between January 2009 and July 2010, 59 consecutive oncologic patients (mean age ± SD, 67 ± 10 years), with indeterminate solid lung nodules on previous CT images who underwent PET/CT within a few months were selected. Of patients selected, 14% had history of head and neck cancer, 27% of gastrointestinal cancer, 19% of breast cancer, 22% of lung cancer, 7% of melanoma, and 11% of other cancers. The mean time between cancer diagnosis and PET/CT was 38 ± 38 months (range, 1–119 months), and the mean period between last therapy (surgery or chemotherapy or other) and PET/CT was 32 ± 31 months (range, 1–119 months). At the time of diagnosis, 17 patients had cancers at stage I, seven at stage II, five at stage III, and four at stage IV, and 26 had cancers with stage unknown. At the time of PET/CT, all patients were considered to be disease free or to have no progression of disease, according to reviewed charts. All patients had received regular clinical and imaging follow-up evaluations at our center. Twenty-four (41%) patients were smokers, and no patients had a history of pneumonia or other inflammatory lung pathology at the time of CT and PET/CT. At least two serial CT studies were performed before PET/CT for each patient.
All patients were referred to PET/CT for characterization of lung nodules within 3 months of chest CT. Features contributing to the indeterminate nature of the lung lesions were unclear morphologic features, such as margins, contour, attenuation value, and growth pattern. Patients with benign calcification or without a definitive clinical diagnosis were excluded from the study. This study was approved by the local institutional review board for retrospective analysis.
Pretest Probability of Malignancy
In patients with an SPN, we calculated the likelihood of malignancy according to the Mayo Clinic and VA models in a manner similar to that used by Schultz et al. [
4]. The following equation was computed for the Mayo Clinic model. The pretest probability of a malignant SPN was
ex/(1 +
ex) for
x = –6.8272 + (0.0391 × age) + (0.7917 × smoke) + (1.3388 × cancer) + (0.1274 × diameter) + 1.0407 × spiculation) + (0.7838 × upper), where
e is the base of the natural logarithm, “age” indicates the patient's age in years, “smoke” indicates smoking history in pack-years, “cancer” indicates history of an extrathoracic cancer 5 or more years before nodule identification (0, absent; 1, present in history), “diameter” indicates the measurement in millimeters of the largest nodule, “spiculation” indicates mention of nodule spiculation on any imaging test report (0, absent; 1, present on report), and “upper” indicates location of the nodule within the upper lobe of either lung (0, absent; 1, present in upper lobe) [
3].
For the VA model, the respective equation was as follows. The pretest probability of malignancy was
ex/(1 +
ex) for
x = –8.404 + (2.061 × smoke) + [(0.779 × age)/10] + (0.112 × diameter) – [(0.567 × quit)/10], where
e is the base of the natural logarithm, “smoke” indicates smoking history in pack-years, “age” indicates the patient's age in years at the time of nodule identification, “diameter” indicates the measurement in millimeters of the largest nodule, and “quit” indicates the number of years since quitting smoking [
4]. Because of the limitations of the Veterans Affairs Cooperative Study database, in this model, we did not include study participants who had cancer diagnosed more than 5 years before nodule detection.
Therefore, on the basis of the variables conceived in each model, we calculated the likelihood of malignancy in 31 and 26 patients in accordance with the Mayo Clinic and VA models, respectively. Patients with multiple lung nodules at CT were a priori considered to be at high likelihood of malignancy [
8,
9]. According to the guidelines for management of small pulmonary nodules, the pretest probability of malignancy was classified as low when less than 5%, intermediate when 5–60% and high when greater than 60% [
3,
9]. Thus, a separate analysis for patients with low (< 5%), intermediate (5–60%), and high (> 60%) pretest probabilities of malignancy was performed. Thereafter, patients were reclassified according to PET/CT results. Furthermore, for elaboration of the broad category of intermediate likelihood of malignancy, a further separate analysis considering the following categories (very low, < 5%; low, < 5–40%; intermediate, > 40–60%; and high, > 60%), as suggested by Gould et al. [
5], was computed.
CT Examination and Interpretation
All patients underwent CT of the chest, abdomen, and pelvis to identify metastases. All chest CT examinations were acquired from the lung apices through the lung bases using 8–MDCT (Eclos, Hitachi Medical) or 40–MDCT (Somatom Definition AS, Siemens Healthcare) and using the following parameters: section thickness, 2.5 or 3 mm; reconstruction, 2.5 or 2 mm; gantry rotation time, 0.8 or 0.5 second; pitch, 0.7 or 0.8; tube potential, 120 kV; and current–exposure time product setting adjusted for body weight. All patients received IV contrast medium (Omnipaque 350, GE Healthcare) at a dose of 2 mL/kg injected at a flow rate of 3 mL/s.
CT images were reviewed and interpreted by two radiologists with 3 and 5 years of experience in chest CT interpretation. Both specialists were presented with the patients’ clinical history but were unaware of histologic findings. Pulmonary nodules were assessed for size, number, location (upper, middle, or lower lobe), and laterality (right or left lung). Nodule size was defined as the largest diameter measured with electronic calipers on the CT images. Moreover, the nodules were classified according to their size, their margins (smooth or irregular), attenuation (solid or partly solid vs ground-glass opacity), calcification (absent or present) and growth. For the purpose of the present study, only solid lung nodules were included.
PET/CT Examination and Interpretation
Whole-body FDG PET/CT was performed using a dedicated PET/CT scanner (Biograph 16 HT, Siemens Healthcare). The PET component is a high-resolution scanner with a spatial resolution of 4.7 mm and has no septa, thus allowing 3D-only acquisitions. The CT portion of the scanner is the Somatom Sensation 16-slice CT. Together with the PET system, the CT scanner is used both for attenuation correction of PET data and for localization of FDG uptake. All patients were instructed to fast for at least 6 h before the integrated PET/CT examination. After injection of 300 MBq (8.1 mCi) of FDG per kilogram body weight, patients rested for 60 min in a comfortable chair. Emission images from the proximal femur to the base of the skull were acquired for 3 minutes per bed position. Acquired images were reconstructed using an attenuation-weighted ordered subset expectation maximization iterative reconstruction, with two iterations and eight subsets. Fourier rebinning was used to reduce the 3D dataset to a 2D equivalent dataset, and a 4-mm full width at half-maximum gaussian filter was applied to the image after reconstruction along the axial and transaxial directions. The data were reconstructed over a 128 × 128 matrix with 2-mm pixel size and slice thickness.
Two nuclear medicine physicians with 3 and 5 years of experience of PET interpretation independently evaluated the images. At visual analysis, lung nodules were defined as malignant if they showed hypermetabolic activity of FDG higher than the metabolic activity of the other regions. Conversely, no significant FDG uptake was used for defining a negative scan. In case of discordant imaging findings between the two readers, a third nuclear medicine physician was required for the consensus. Maximum standardized uptake value (SUV
max) of the lung lesion was determined by drawing isovolumetric regions of interest on the attenuation-corrected FDG PET/CT images for each suspected lung nodule. SUV
max was calculated as the ratio of regional radioactivity concentration divided by the administered radioactivity normalized to body weight [
10]. In case of multiple nodules, the region of interest was drawn around the largest lesion. A separate analysis was performed using the SUV
max threshold of 4.0 in accordance with Grgic et al. [
11].
Net Reclassification Improvement
New methods have recently been proposed to evaluate and compare predictive risk models. These are based primarily on stratification into clinical categories on the basis of risk in an attempt to assess the ability of new models to more accurately reclassify individuals into higher or lower risk strata. Since its first description in 2006, much interest has been generated in reclassification, and although the approach is relatively new, there have been further methodologic developments. Researchers in the fields of breast cancer, diabetes, genetics, and clinical cardiology have published articles on these techniques [
12]. The net reclassification improvement assesses risk reclassification and is the difference in proportions moving up and down risk strata among case patients versus control participants—that is, those who did or did not develop the disease during follow-up [
13]. Net reclassification improvement is calculated as follows: [
Pr(up /
cases) –
Pr(
down /
cases)] – [
Pr(up /
controls) –
Pr(
down /
controls)], where
Pr stands for probability; the net reclassification improvement is similar to simple percentage reclassification, but it distinguishes correct direction of changes (up for case patients and down for control participants). Ideally, the predicted probabilities would move higher (up a category) for case patients and lower (down a category) for control participants.
Net reclassification improvement was calculated as a sum of two separate components: individuals with events and individuals without events. An “event” refers to the presence of malignancy (i.e., a true-positive). For events, we assigned 1 for upward reclassification, –1 for downward reclassification, and 0 when people did not change their risk category. The opposite was done for nonevents. The summation of the individual scores and division by numbers of people in each group were used to obtain the value of net reclassification improvement. Assuming independence between event and non-events among individuals and following McNemar logic for significance testing in correlated proportions, we used a simple asymptotic test for the null hypothesis where the net reclassification improvement was 0 (
z test) [
13].
Final Outcome
Final diagnosis was established by histopathology or by CT follow-up. The diagnosis of malignant lung lesions was made in the following situations: when histopathologic analysis of surgical specimen or histology obtained by needle biopsy was positive for secondary localization or for a primary cancer; and when a significant increase (diameter increase by ≥ 25%) in size or number (or both) of pulmonary nodules was found at imaging studies during follow-up. The nodules that did not change during follow-up or that spontaneously resolved without employing therapy were considered to be benign lesions. The observation follow-up period was at least 2 years. The characterization of lung nodules was verified by histology specimens in 28 patients and imaging studies in the remaining 31 patients. The difference between correct and incorrect reclassification according to patient outcome was defined as the net reclassification improvement.
Statistical Analysis
Continuous variables were expressed as a median (range) and categoric data as a percentage. Comparisons between groups were performed by Mann-Whitney U test, chi-square test with Yates correction, and McNemar test, as appropriate. Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and diagnostic accuracy for the detection of the lung lesions were calculated using standard methods. A two-sided p value was used, and p < 0.05 was considered statistically significant (Advanced Models version 15.0, SPSS).
Discussion
Risk stratification with clinical prognostic models may be useful for advising patients and making treatment decisions. The distribution of predicted values for a given model may indicate the number of patients who should be classified into each risk category but not whether this is done correctly [
14]. It is difficult to determine which of the two models, the Mayo Clinic or the VA model, is better at classifying pulmonary nodules in individuals or to determine whether individual risk estimates differ between the two models. One may examine comparative distributions through risk reclassification as a novel marker for the incremental value of a test. This represents the primary endpoint of the present study. We show that PET/CT is able to further stratify the presence of pulmonary malignancy in patients with low- to indeterminate-risk lung nodules and history of cancer. In particular, a substantial number of pulmonary nodules considered as having low or intermediate pretest likelihood of malignancy had abnormal findings on PET/CT that were associated with malignancy.
Bayesian analysis involves the use of likelihood ratios for imaging findings and clinical features associated with SPNs to establish probability of malignancy [
15,
16]. A general Bayesian model principle is that each characteristic used in the development of the model is conditionally independent of all others. Several previous studies have calculated the likelihood ratios for a range of values for clinical variables in predicting malignancy. Clinical variables that have been correlated with the probability of malignancy include baseline incidence of malignancy, age, smoking history, nodule size, nodule edge characteristics, and presence or absence of occult calcification on CT densitometry. Positive findings on PET/CT appear to represent a significantly better indicator of malignancy compared with findings on CT.
Clinical features—including symptoms, physical examination findings, and laboratory results—are usually nonspecific and may not be able to stratify risk between benign and malignant conditions. Bronchoscopy has limited usefulness in a patient with an SPN. Transthoracic needle aspiration biopsy with fluoroscopic CT or ultrasound guidance is diagnostic in 80–95% of malignant nodules. An indeterminate biopsy report, however, may not exclude malignancy [
17]. The objective of PET before exploratory thoracotomy in patients with benign solitary nodules is to determine appropriateness for surgery, as well as to prevent complications that occur in a significant percentage of patients.
We found a prevalence of lung malignancy in 52% of the patients enrolled in the present study. SPNs have been documented to have a 40% malignancy rate [
18,
19]. This latter percentage varies with age, smoking history, location, and patient selection [
20,
21]. We considered a subset of oncologic patients who represent a high-risk category. In our study, PET/CT showed different diagnostic accuracies based on the type of lung nodules. For example, a similar sensitivity was registered for SPNs and multiple lung lesions (72% vs 78%, respectively), whereas a lower specificity was reported for multiple nodules than for a single nodule (85% vs 92%, respectively). The rate of false-positives can be attributed to abnormal
18F-FDG uptake in benign lesions, including infections such as histoplasmosis and tuberculosis, hamartoma, lung cysts, and vascular abnormalities. Respiratory system toxicity is a common side effect and complication of anticancer therapy. Indeterminate lung nodules in patients with cancer history should be characterized in a minimally invasive manner. The accuracy of PET/CT in our study was high, which is in accordance with previous data [
22].
Pulmonary metastases are common. Necropsy studies have reported evidence of pulmonary metastases in 50% or more of patients with sarcoma, melanoma, and cancers of the breast, prostate, thyroid, uterus, and kidney [
23,
24]. Our study found a substantial rate of metastatic disease (39%) and primary lung cancer (14%). Nearly half of the patients in the present study who had lung metastases also had breast or gastrointestinal tumors. By contrast, in accordance with Quint et al. [
25], primary lung cancer was found to be more frequent in patients with head and neck cancer, although these patients are thought to be at a greater risk for metastatic disease.
We opted to study lung nodules of 5 mm or more in diameter, which includes lesions that challenge the spatial resolution limits of PET/CT and of both the Mayo Clinic and VA models. The VA model considers only those lung nodules with diameter of 7–30 mm, whereas the Mayo Clinic model has been applied to patients with newly discovered 4- to 30-mm radiologically indeterminate SPNs. In the present study, only five patients had an SPN with a diameter of 5 mm or more. Therefore, based on the variables conceived in each model, we calculated the likelihood of malignancy in 31 patients and 26 subjects in accordance with the Mayo Clinic and VA models, respectively. Bryant and Cerfolio [
26] showed that FDG PET/CT may be useful for the management of nodules less than 10 mm in diameter. Pomerri et al. [
27] concluded that a diameter greater than 5 mm and irregular margins were strong predictors of nodule growth by multiple stepwise regression analysis, whereas calcification was associated with a very low likelihood of progression.
As discussed by Schultz et al. [
4], both the Mayo Clinic and VA models appear to be sufficiently accurate to assist in making clinical decisions about the choice and interpretation of subsequent diagnostic tests. The accuracy of both models was similar to that reported in the original articles describing their development [
2,
3]. The Mayo Clinic model was slightly more accurate than the VA model, but this difference was not statistically significant. Furthermore, the Mayo Clinic model is probably a better choice in practice settings with a low prevalence of malignant lung nodules; conversely, the VA model may be a better choice in settings with a high prevalence of malignant SPNs. From the present analysis, on the basis of the Mayo Clinic model, it appears that all subjects with intermediate likelihood of malignancy (> 40–60%) had positive findings on PET/CT and a true malignancy at follow-up. By contrast, on the basis of the VA model, the majority of patients with very low (100%) or low (41.2%) probability of malignancy had positive findings both on PET/CT and at follow-up. This is in line with the concept that the VA model is preferred in patients with high probability of disease. We used both models because they are consistently complementary. Nevertheless, the VA model is missing some variables, such as spiculation and remote history of thoracic cancer, that are present in the Mayo Clinic model, and the Mayo Clinic model is missing the variable in the VA model that indicates the number of years since quitting smoking. Moreover, as previously stated, the Mayo Clinic model was conceived for patients with a broader spectrum of nodule diameter than was the VA model.
Physicians may overestimate the probability of a malignant lesion in patients with low risk of malignant disease. According to our results, FDG PET can be considered to be helpful in low-risk patients (i.e., those with a high NPV of normal findings on FDG PET, as reported in
Table 4 for the Mayo Clinic model). In patients with intermediate risk, biopsy is still recommended. If biopsy is impractical or yields an equivocal result, FDG PET can be helpful as an additional examination. A positive finding on FDG PET requires further histolopathologic confirmation [
28]. In patients with low to moderate pretest probability of malignancy and an intermediate SPN that measures at least 8–10 mm in diameter, the American College of Chest Physicians recommends FDG PET to characterize the SPN [
29]. As reported by Herder et al. [
7], predictive models tended to underestimate the probability of malignancy, particularly with lower-risk patients. In the present study, we encountered positive findings on PET/CT in 28% and 80% of patients otherwise classified as low risk for the Mayo Clinic and VA models, respectively. In a study by Gould et al. [
3], PET was used to refine the management of patients after the pretest probability of cancer was determined. If the pretest probability was low (20%) and findings on PET were negative, then the posttest probability of malignancy was less than 2% and follow-up was suggested. However, when findings on PET are negative and the pretest probability is relatively high (65%), the calculated posttest probability increases to more than 10%, and needle-biopsy or video-assisted thoracic surgery biopsy should still be considered. Therefore, PET can be more useful for refining the management of patients at low risk, rather than those at high risk of malignancy.
Our findings show that the addition of PET/CT results in reclassification of approximately 59% of the sample, with a net reclassification improvement ranging from 0.23 to 1.6. The net reclassification improvement in patients with proven lung malignancy ranged from 0.23 to 1.1, whereas the net reclassification improvement for patients without secondary or primary lung cancer ranged from 0.23 to 0.5. These results suggest that, when applied to a high-risk population with suspicion for primary or secondary lung cancer, an FDG PET/CT–adjusted strategy may effectively identify more individuals who had cancer but it can identify many other individuals at higher risk who do not have malignancy (13 of 15 patients considered to be at high risk by CT were correctly stratified as being at low risk by PET/CT). PET is most useful in the pretest probability range of 5–60% [
30,
31], as illustrated in
Table 4. Moreover, PET may be helpful in the detection of mediastinal lymph node metastases, even when the nodes are not enlarged on CT [
32], and for detection of occult extrathoracic metastasis and of synchronous extrathoracic primary malignancies.
Grgic et al. [
11] investigated the prognostic value of SUV in SPN for survival. They found that the highest diagnostic accuracy was achieved with an SUV of more than 4.0 (sensitivity, specificity, and accuracy all of 85%) whereas visual interpretation achieved corresponding values of 94, 70, and 84%, respectively. In the current study, we observed that more than the half of patients at intermediate to high risk of malignancy had an SUV
max greater than 4.0, but approximately 40% of patients at intermediate risk had a low SUV
max. Therefore, SUV
max may not be predictive of malignancy in this setting. In comparison with visual image analysis, a semiquantitative approach has the conceptual advantage of objectiveness. Apostolova et al. [
33] reported that correction of motion blur is mandatory for accurate SUV quantification of SPNs. Motion blur causes relevant (≥ 30%) underestimation of actual SUV in most SPNs of up to 30 mm in FDG PET/CT. However, the results of SUV measurements are also prone to heterogeneity owing to prevailing differences in data acquisition and reconstruction methods [
34] and are in relation with previous treatments.
The most important limitation of the present study is that histologic diagnosis was made in only 28 of the 59 patients. As previously stated, small nodule size, respiratory misregistration, and inaccurate SUVmax assessment owing to volume averaging may represent additional limitations.
In conclusion, in oncologic patients with low and intermediate likelihood of lung nodule malignancy, FDG PET/CT can be helpful for excluding the presence of primary or metastatic malignant lung lesions. FDG PET/CT is most efficacious in patients with low to intermediate probability of malignancy. A well-structured clinical trial would be important to confirm the utility of FDG PET/CT for the evaluation of radiologically indeterminate lung nodules, in both patients with and those without history of malignancy.