Predictive Value of Chest CT in Patients with Cystic Fibrosis: A Single-Center 10-Year Experience
Abstract
OBJECTIVE. The objective of our study was to assess the accuracy of one of the most used scoring systems, the Bhalla scoring system, in the detection of lung impairment in patients with cystic fibrosis and in the prediction of cystic fibrosis progression.
MATERIALS AND METHODS. From the database of our center, 300 CT examinations performed between 1991 and 2001 were reviewed. Pulmonary function tests performed the same day as radiologic assessment were available. Of this group, 145 examinations were retrospectively included, referring to 87 patients with confirmed cystic fibrosis and a mean age (± SD) of 15.6 ± 8.4 years (range, 9 months-38 years). Thirty patients underwent one CT examination, 56 underwent two examinations, and one patient underwent three examinations. The mean interval between two examinations was 36.5 months. The 145 examinations were independently reviewed by three radiologists who were blinded to the clinical and pulmonary function test results. The CT examinations were assessed using the scoring system proposed by Bhalla and colleagues.
RESULTS. CT assessed using the Bhalla scoring system is mildly correlated with functional pulmonary test results and has high interobserver reproducibility. The CT score significantly changed between scans obtained in a mean interval of 36.5 months, whereas functional pulmonary test results did not, suggesting that CT is more sensitive than function tests for detecting small changes. However, the variation in CT scores did not predict progression of functional pulmonary test results or progression of CT findings between scans.
CONCLUSION. CT assessment based on the Bhalla scoring system is more sensitive than pulmonary function tests in detecting initial morphologic changes. However, we found no evidence of the predictive value of CT.
Introduction
The rationale for performing follow-up of the lungs in patients with cystic fibrosis relies on the high incidence of pulmonary insufficiency, which is the primary cause of death in 90-95% of patients [1, 2]. Patient status is assessed with clinical examination through the Shwachman-Kulczycki or other scoring systems [3-5]. Although these clinical scales are useful, they are not completely and directly correlated with pathologic changes of the lung [6, 7]. This is also true for laboratory tests, which are useful for the diagnosis of lung disease and the detection of complications. Pulmonary function tests are additional tools for lung evaluation that do not supply pathologic information.
Radiologic techniques provide morphologic information. Since the early application of chest radiography in patients with cystic fibrosis, a scoring system was developed to improve the accuracy and reproducibility of the examination at follow-up [8-10].
CT was also proposed for the follow-up of lung impairment in patients with cystic fibrosis [11]. As with chest radiography, a correlation between CT using a scoring system and functional status was found [7, 11, 12].
The CT examination rate that has been proposed in the follow-up of patients with cystic fibrosis is every 4-5 years. However, the data that are available refer to small populations [6, 11], acute settings [3], or short-term or one-shot evaluation and correlation.
The aims of this longitudinal retrospective study were to determine the correlation between CT using the Bhalla scoring system and pulmonary function test results and to assess the role of CT in predicting disease progression.
Materials and Methods
From the database of the Centre of Respiratory Pathophysiology at the University of Parma, 300 CT examinations of patients with cystic fibrosis performed between 1991 and 2001 were reviewed. Our institution approved this study, and informed consent was not required because of the retrospective nature of this investigation.
All scans were obtained in the follow-up set ting; patients with acute cystic fibrosis were excluded. Inclusion criteria were, first, CT examination with lung and mediastinal windows readable; and, second, pulmonary function tests performed the same day as radiologic assessment. The pulmonary function test results had to provide at least forced vital capacity (VC) and forced expiratory volume in the first second (FEV1). CT examinations with im ages that were not of adequate quality and incomplete examinations were excluded, and cases with missing pulmonary function test results were also excluded.
Of this group of 300 examinations, 145 CT examinations were included in this retrospective study. These 145 examinations referred to 87 patients with a mean age of 15.6 years (range, 9 months-38 years). All patients had a confirmed dia gnosis of cystic fibrosis obtained with a sweat electrolyte test. Thirty patients underwent one CT examination, 56 underwent two examinations, and one patient underwent three examinations.
The mean interval between two examinations was 36.5 ± 8 months (range, 16.1-55.6 months). The CT examinations were performed on different scanners because of the constantly updating technology of the radiology department. We included in our population patients who underwent a sequential high-resolution scanning protocol of the thorax with standard acquisition parameters (slice thickness, 1 or 1.5 mm; scan interval, 10 or 20 mm) and re construction parameters (high-frequency kernel [B70]; lung window settings [width, -550 H; level, 1,500 H]). All examinations were performed at full inspiration, a respiratory status that is less affected by variability in the dimensions of the airways.
The 145 examinations were independently reviewed by three board-certified radiologists with at least 3 years' experience in thoracic imaging. The radiologists used the Bhalla scoring system [11] to evaluate lung impairment (Table 1) and were blinded to clinical and pulmonary function test results. The examinations were assessed using the scoring system shown in Table 1. Bronchiectasis and emphysema were identified according to criteria described elsewhere [13, 14]. Each of the radiologists provided a CT score using the Bhalla scoring system, and the overall mean CT score was obtained.
Score | ||||
---|---|---|---|---|
Category | 0 | 1 | 2 | 3 |
Severity of bronchiectasis | Absent | Mild (luminal diameter slightly greater than diameter of adjacent blood vessel) | Moderate (lumen 2-3 times the diameter of the vessel) | Severe (lumen > 3 times diameter of vessel) |
Peribronchial thickening | Absent | Mild (wall thickness equal to diameter of adjacent vessel) | Moderate (wall thickness greater than and up to twice the diameter of adjacent vessel) | Severe (wall thickness > 2 times the diameter of adjacent vessel) |
Extent of bronchiectasis (no. of BP segments) | Absent | 1-5 | 6-9 | > 9 |
Extent of mucus plugging (no. of BP segments) | Absent | 1-5 | 6-9 | > 9 |
Sacculations or abscesses (no. of BP segments) | Absent | 1-5 | 6-9 | > 9 |
Generations of bronchial divisions involved (bronchiectasis or plugging) | Absent | Up to 4th generation | Up to 5th generation | Up to the 6th generation and distal |
No. of bullae | Absent | Unilateral (not > 4) | Bilateral (not > 4) | > 4 |
Emphysema (no. of BP segments) | Absent | 1-5 | > 5 | |
Collapse or consolidation | Absent | Subsegmental | Segmental or lobar |
Note—CT score was calculated using this form. Add the letter “P” for plugging and “T” for peribronchial thickening when present. Subtract the CT score from 25 to determine the patient's score. BP = bronchopulmonary.
Statistical analysis was performed using a software package (SPSS version 11.0, SPSS) on a per-examination basis and on a per-patient basis. The per-examination analysis was performed by assessing the 145 CT examinations as independent statistical units on the basis of the long interscan interval. The per-patient analysis was performed to estimate progression of disease. Association indexes were obtained between patient characteristics, functional pulmonary test results, and CT scores. Differences between groups' means were assessed using the Student's t test. The two-way intraclass correlation coefficient was used to assess interobserver agreement. Linear regression was used for per-patient analysis. Statistical significance was set at a p value of < 0.05.
Results
Per-Examination Analysis
No significant correlations were observed between patient age and VC (p > 0.05), whereas significant correlations were observed between age and FEV1, age and mean CT score for the three reviewer radiologists, VC and mean CT score, and FEV1 and mean CT score (Table 2).
Correlation | Ra | pb |
---|---|---|
Age and VC | −0.11 | 0.201 |
Age and FEV1 | −0.19 | 0.027c |
Age and mean CT scored | 0.37 | 0.001c |
VC and mean CT scored | 0.39 | 0.001c |
FEV1 and mean CT scored | 0.52 | 0.001c |
Note—VC = vital capacity, FEV1 = forced expiratory volume in first second.
a
Pearson's correlation coefficient.
b
Measure of statistical significance of R.
c
Significant value.
d
Mean of three radiologist reviewers' CT scores.
Interobserver agreement and the pooled agreement were very good (p < 0.05) for CT scores between the radiologists (Table 3). No significant differences were observed between CT scores between radiologists (p > 0.05).
CT Scores of | ICC | 95% CI | pa | pb |
---|---|---|---|---|
Observers 1 and 2 | 0.9937 | 0.9913-0.9955 | 0.001c | 0.727 |
Observers 1 and 3 | 0.9938 | 0.9914-0.9955 | 0.001c | 0.294 |
Observers 2 and 3 | 0.9978 | 0.9969-0.9984 | 0.001c | 0.241 |
Observers 1, 2, and 3 | 0.9967 | 0.9957-0.9976 | 0.001c |
Note—ICC = intraclass correlation coefficient (two ways).
a
Measure of statistical significance of ICC.
b
Student's t test.
c
Significant value.
Per-Patient Analysis
There was a significant correlation between VC, FEV1, and mean CT score obtained at the first and second examinations (Table 4). There was also a significant difference between the mean CT score at scan 1 and the mean CT score at scan 2 (p < 0.05), whereas no significant differences in functional test results were observed between the two scans (p > 0.05).
Correlation | Ra | pb | Mean of Paired Differences | SD of Paired Differences | pc |
---|---|---|---|---|---|
VC at scan 1-VC at scan 2 | 0.51 | 0.001d | 3.36 | 17.37 | 0.154 |
FEV1 at scan 1-FEV1 at scan 2 | 0.71 | 0.001d | 2.45 | 16.72 | 0.281 |
Mean CT score at scan 1-mean CT score at scan 2 | 0.91 | 0.001d | −0.982 | 1.84 | 0.001d |
Note—VC = vital capacity, FEV1 = forced expiratory volume in first second.
a
Pearson's correlation coefficient.
b
Measure of statistical significance of R.
c
Student's t test.
d
Significant value.
Significant correlations of functional tests (p < 0.05) and no significant differences of scores (p > 0.05) were observed between scores obtained at the first and second examinations. Significant correlations and differences in mean CT score were observed between scores obtained at the first and second examinations (p < 0.05).
No significant correlation between variation in functional test results (VC and FEV1), the mean CT score, and the time elapsed between the first and second CT scans was seen (p > 0.05). There was no significant correlation between the variation in functional test results (VC and FEV1) and mean CT score (p > 0.05) (Table 5 and Figs. 1, 2, 3, 4 and 5).
Correlation | Regression Coefficient | p |
---|---|---|
VC at scan 2-VC at scan 1 vs time between scans | 0.045 | 0.371 |
FEV1 at scan 2-FEV1 at scan 1 vs time between scans | 0.094 | 0.247 |
Mean CT score at scan 2-mean CT score at scan 1 vs time between scans | −0.019 | 0.445 |
Mean CT score at scan 2-mean CT score at scan 1 vs VC at scan 2-VC at scan 1 | −0.158 | 0.123 |
Mean CT score at scan 2-mean CT score at scan 1 vs FEV1 at scan 2-FEV1 at scan 1 | −0.199 | 0.073 |
Note—VC = vital capacity, FEV1 = forced expiratory volume in first second.
Discussion
The rationale for performing chest CT in patients with cystic fibrosis is to obtain additional information with prognostic significance that should be used to modify clinical approach and treatment. For this purpose, standardization of the radiologic evaluation through a scoring system is crucial to obtain correlation with clinical outcome and for reproducibility.
In this study, no significant correlations were found between patient age and pulmonary function (VC), whereas VC was shown in a previous report to decline with patient age [15]. These results are probably due to the fact that the rate of decline varies among individuals and even in the same patient. In addition, pulmonary function test results are not linearly correlated with clinical status. VC and FEV1 and derived ratios are reproducible but are not particularly sensitive to the early stages of disease, which, on the contrary, are better detected using small-airways function tests [16, 17]. Recently, investigators have shown that the lack of correlation between CT data and pulmonary function test results may be due to the higher sensitivity of CT in showing changes in the small airways before impairment can be detected on functional tests [18, 19].
The per-examination analysis showed significant but rather fair correlations between the mean CT score and VC (r = 0.39) and between the mean CT score and FEV1 (r = 0.52). Helbich et al. [7] reported in 117 patients a high correlation between CT score and VC (r = 0.62) and CT score and FEV1 (r = 0.70). Santamaria et al. [6] also reported a moderate correlation between CT score and VC (r = 0.60) and CT score and FEV1 (r = 0.50) in a limited population of 30 patients. Conversely, Bhalla et al. [11] did not find significant correlations between CT score and VC (r = 0.48) or CT score and FEV1 (r = 0.42), whereas they described a significant correlation between CT score and the ratio of FEV1 to forced VC (r = 0.69). However, that study [11] was performed retrospectively on a fairly small population of patients (n = 14).
These different results and the fair correlation between CT scores and pulmonary function parameters can be explained by four factors.
The first factor is that CT does not allow one to distinguish between acute and chronic injury or to predict the reversibility of findings [20]. Therefore, acute and chronic radiologic findings are evaluated as similar [21].
The second factor is related to the fact that the Bhalla scoring system uses an ordinal scale. The same score may describe two different degrees of injury, and the intervals between scores are not necessarily equal or proportional. In addition, the characteristics that are considered in the scoring system are not weighted in accordance with their clinical importance.
The third factor is that there is large intraindividual variability and interindividual variability in the commonly used outcome measurements [17].
The fourth factor is the inclusion criteria, which may have allowed an excessive number of patients with disease at early stages to be included in the study cohorts, and the low number of patients in these studies.
It is not clear how to explain the optimal performance of the Bhalla scoring system for chest radiography providing better results than CT that has been reported in earlier investigations [9, 10, 22]. Conversely, other investigations have shown high-resolution CT to be superior to chest radiography [23, 24]. This could probably be due to the capability of CT to depict bronchiectasis, which is associated with pulmonary function tests [6, 7], whereas bronchiectasis was missed on chest radiography in 44.5% of bronchopulmonary segments and mucous plugs were missed in 91.5% of bronchopulmonary segments in one study [11]. High correlation between the chest radiography score and high-resolution CT score was found, and the authors concluded that in many cases CT results could be predicted by combining chest radiography findings with pulmonary function test results. This observation suggests that CT could be indicated in patients with mild respiratory symptoms for identifying early changes, whereas in those with stable disease, chest radiography could be the first imaging technique [6].
The interobserver agreement in our study shows that the Bhalla scoring system is highly reproducible, supporting the results of previous studies [7, 11].
The per-patient analysis showed a significant correlation and significant difference in mean CT scores between the first and second examinations. These results may suggest that the CT score is more sensitive to mild changes than functional pulmonary tests. The lack of significant differences between the functional test scores obtained at the first and second examinations explains why the CT score cannot be used to predict changes in pulmonary function test results between two examinations. The lack of significant correlation between the variations in functional parameters (VC and FEV1) and mean CT score, the time elapsed between the first and second CT scans, and the lack of significant correlation between the variations in functional parameters (VC and FEV1) and mean CT score show that these parameters cannot be used to predict the progression of disease. This finding could be indirectly confirmed by the insufficient correlation between the age-based severity curve of the Brasfield chest radiograph score and the decline of pulmonary function in a population of 230 patients [25]. In addition, this finding may be explained by the interval of 36.5 months between examinations in our study, which may not be sufficient to find significant correlation. However, the clinical role of variations in CT score, which is able to predict changes only when scans are obtained several years apart, is not clear.
Study Limitations
A limitation of this study is the use of different CT scanner technologies. This limitation is unavoidable and related to technologic updates that occurred over a period of 10 years. However, no significant differences were observed between CT scores obtained with different scanning protocols. It is unusual that a patient undergoes follow-up imaging on the same scanner as that used for the initial examination, suggesting that the present study better fits real clinical settings.
MDCT scanners allow high-resolution helical CT of the thorax in an extraordinarily short time, and recent studies have used these protocols. Volumetric acquisition permits good evaluation of small airways (up to seventh generation), and semiquantitative software can be applied to these data sets to establish bronchial wall thickening and area [26]. Other authors have reported the importance of expiratory scanning to depict air trapping; however, at the time our study began, there was no evidence on the importance of this parameter [27]. In addition, obtaining expiratory scans requires additional radiation exposure to the patient.
Our study is also limited by the lack of data obtained using a helical acquisition protocol or expiratory scanning. Another limitation is related to the retrospective nature of this study.
A collateral question, but one that cannot be ignored, is whether the additional information provided by CT counterbalances the risks related to the amount of ionizing radiation administered in a young population for a follow-up examination in asymptomatic patients.
Conclusion
In conclusion, the results of this study confirm that CT assessed with the Bhalla scoring system [11] is mildly correlated with functional pulmonary test results and has high interobserver reproducibility. The CT score significantly changed between scans obtained in a mean interval of 36.5 months, whereas functional pulmonary test results did not, suggesting that CT is more sensitive than function tests for detecting changes in the small airways. However, the variations in CT score did not predict progression of functional pulmonary test results or progression of CT findings between scans. Studies with larger populations and longer follow-up may eventually disclose the potential predictive value of CT assessed using the Bhalla scoring system over FEV1 in patients with cystic fibrosis.
Footnotes
Address correspondence to F. Cademartiri ([email protected]).
FOR YOUR INFORMATION
The AJR has made getting the articles you really want really easy with an online tool, Really Simple Syndication, available at www.ajronline.org. It's simple. Click the yellow RSS button located in the menu on the left of the page. You'll be on your way to syndicating your AJR content in no time.
References
1.
Wood RE, Boat TF, Doershuk CF. Cystic fibrosis. Am Rev Respir Dis 1976; 113:833-878
2.
FitzSimmons SC. The changing epidemiology of cystic fibrosis. J Pediatr 1993; 122:1-9
3.
Orenstein DM, Pattishall EN, Nixon PA, Ross EA, Kaplan RM. Quality of well-being before and after antibiotic treatment of pulmonary exacerbation in patients with cystic fibrosis. Chest 1990; 98:1081-1084
4.
Orenstein DM, Nixon PA, Ross EA, Kaplan RM. The quality of well-being in cystic fibrosis. Chest 1989; 95:344-347
5.
Shwachman H, Kulczycki LL. Long-term study of one hundred five patients with cystic fibrosis: studies made over a five- to fourteen-year period. AMA J Dis Child 1958; 96:6-15
6.
Santamaria F, Grillo G, Guidi G, et al. Cystic fibrosis: when should high-resolution computed tomography of the chest be obtained? Pediatrics 1998; 101:908-913
7.
Helbich TH, Heinz-Peer G, Fleischmann D, et al. Evolution of CT findings in patients with cystic fibrosis. AJR 1999; 173:81-88
8.
Reilly BJ, Featherby EA, Weng TR, Crozier DN, Duic A, Levison H. The correlation of radiological changes with pulmonary function in cystic fibrosis. Radiology 1971; 98:281-285
9.
Chrispin AR, Norman AP. The systematic evaluation of the chest radiograph in cystic fibrosis. Pediatr Radiol 1974; 2:101-106
10.
Brasfield D, Hicks G, Soong S, Tiller RE. The chest roentgenogram in cystic fibrosis: a new scoring system. Pediatrics 1979; 63:24-29
11.
Bhalla M, Turcios N, Aponte V, et al. Cystic fibrosis: scoring system with thin-section CT. Radiology 1991; 179:783-788
12.
Mastellari P, Biggi S, Lombardi A, et al. Correlation between HRCT and pulmonary functional tests in cystic fibrosis. Radiol Med (Torino) 2005; 110:325-333
13.
Naidich DP, McCauley DI, Khouri NF, Stitik FP, Siegelman SS. Computed tomography of bronchiectasis. J Comput Assist Tomogr 1982; 6:437-444
14.
Bergin C, Müller N, Nichols DM, et al. The diagnosis of emphysema: a computed tomographic-pathologic correlation. Am Rev Respir Dis 1986; 133:541-546
15.
Corey M, Levison H, Crozier D. Five- to seven-year course of pulmonary function in cystic fibrosis. Am Rev Respir Dis 1976; 114:1085-1092
16.
Taussig LM, Landau LI, Marks MI. Cystic fibrosis. In: Taussig LM, ed. Respiratory system. New York, NY: Thieme-Stratton,1984 : 115-175
17.
Ramsey BW, Boat TF. Outcome measures for clinical trials in cystic fibrosis. Summary of a Cystic Fibrosis Foundation consensus conference. J Pediatr 1994; 124:177-192
18.
de Jong PA, Lindblad A, Rubin L, et al. Progression of lung disease on computed tomography and pulmonary function tests in children and adults with cystic fibrosis. Thorax 2006; 61:80-85
19.
Judge EP, Dodd JD, Masterson JB, Gallagher CG. Pulmonary abnormalities on high-resolution CT demonstrate more rapid decline than FEV1 in adults with cystic fibrosis. Chest 2006; 130:1424-1432
20.
Shah RM, Sexauer W, Ostrum BJ, Fiel SB, Friedman AC. High-resolution CT in the acute exacerbation of cystic fibrosis: evaluation of acute findings, reversibility of those findings, and clinical correlation. AJR 1997; 169:375-380
21.
Robinson TE, Leung AN, Northway WH, et al. Spirometer-triggered high-resolution computed tomography and pulmonary function measurements during an acute exacerbation in patients with cystic fibrosis. J Pediatr 2001; 138:553-559
22.
Matthew DJ, Warner JO, Chrispin AR, Norman AP. The relationship between chest radiographic scores and respiratory function tests in children with cystic fibrosis. Pediatr Radiol 1977; 5:198-200
23.
Jacobsen LE, Houston CS, Habbick BF, Genereux GP, Howie JL. Cystic fibrosis: a comparison of computed tomography and plain chest radiographs. Can Assoc Radiol J 1986; 37:17-21
24.
Hansell DM, Strickland B. High-resolution computed tomography in pulmonary cystic fibrosis. Br J Radiol 1989; 62:1-5
25.
Cleveland RH, Neish AS, Zurakowski D, Nichols DP, Wohl ME, Colin AA. Cystic fibrosis: a system for assessing and predicting progression. AJR 1998; 170:1067-1072
26.
Montaudon M, Berger P, Cangini-Sacher A, et al. Bronchial measurement with three-dimensional quantitative thin-section CT in patients with cystic fibrosis. Radiology 2007; 242:573-581
27.
Arakawa H, Webb WR. Air trapping on expiratory high-resolution CT scans in the absence of inspiratory scan abnormalities: correlation with pulmonary function tests and differential diagnosis. AJR 1998; 170:1349-1353
Information & Authors
Information
Published In
Copyright
© American Roentgen Ray Society.
History
Submitted: August 8, 2007
Accepted: December 11, 2007
First published: November 23, 2012
Keywords
Authors
Metrics & Citations
Metrics
Citations
Export Citations
To download the citation to this article, select your reference manager software.