Abstract
BACKGROUND: Although the statistically derived lower limit of normal (LLN) for the ratio of FEV1 to FVC is considered superior to a fixed cutoff value (such as 0.70) for diagnosing airway obstruction, the fixed-cutoff method continues to be used and advocated.
OBJECTIVE: To evaluate the misclassification of spirometrically determined airway obstruction arising from the use of the fixed-percent method, in comparison to the LLN method for FEV1/FVC.
METHODS: We reviewed 27,307 spirometry records from adult men, and diagnosed airway obstruction based on the LLN (predicted value minus 1.645 times the standard error of estimate from a north Indian reference equation for FEV1/FVC) and based on a fixed cutoff of 0.70. We computed agreement and discordance between the two methods, and determined the sensitivity, specificity, and predictive values of the fixed-percent method in identifying true obstruction.
RESULTS: The results were discordant in 1,622 subjects (6%). Overall agreement between the two methods was good (kappa estimate 0.869), but worsened considerably with advancing age. 1,290 subjects (5%) who were deemed normal with the LLN method were diagnosed as having airway obstruction with the fixed-percentage method. Overall the sensitivity, specificity, and positive predictive value of the fixed-percentage method were 0.963, 0.929, and 0.871, respectively. Specificity and positive predictive value decreased sharply with advancing age.
CONCLUSIONS: The negative age-dependence of FEV1/FVC results in over-diagnosis of airway obstruction in middle-aged and elderly men, and under-diagnosis in young men, with the fixed-percentage method. Airway obstruction should be assessed with the LLN of FEV1/FVC, with the LLN derived from appropriate reference equations.
- airway obstruction
- India
- obstructive lung diseases
- predictive value of tests
- reference standards
- spirometry
Introduction
Demonstration of a reduced ratio of FEV1 to VC (vital capacity) or FVC (forced vital capacity) on spirometry remains the universally accepted criterion for diagnosis of airway obstruction in routine clinical practice. Despite this, there is no broad consensus as to how this reduction should be defined. It has been an age-old practice to use a fixed ratio as the cutoff for this purpose. Most commonly, an FEV1/VC of less than 0.70 or 0.75 is used to diagnose obstruction on spirometry. Even though there is no statistical or epidemiological basis for choosing 0.70 (or for that matter any other similar ratio) as a cutoff for this purpose, this practice remains engrained in usage worldwide.
The problems and errors of using fixed percentages of reference values were pointed out more than 4 decades ago.1 It was also proposed that the statistically derived lower limit of normal (LLN) should be preferred over a fixed percentage when interpreting spirometry data.2 We previously found that fixed-percentage cutoffs introduce unacceptable misclassification rates in interpretation of spirometry results.3 In an effort to standardize interpretation of lung function tests, the American Thoracic Society proposed its guidelines in 1991, and recommended that airway obstruction should be defined by an FEV1/VC (or FEV1/FVC) below a certain LLN.4 This LLN could be either the value below the fifth percentile or the lower 95% confidence limit of the values from a reference population. The American Thoracic Society/European Respiratory Society guidelines published in 2005, and the more recent recommendations on spirometry in the primary-care setting, have largely reiterated that stand.5,6 However, several old and recent international initiatives, mostly focusing on COPD, still recommend the use of a fixed percentage for this purpose.7–9 The Global Initiative for Chronic Obstructive Lung Disease (GOLD) guidelines, first published in 2001 and thereafter updated annually, define COPD as a post-bronchodilator FEV1/FVC below 0.70.10
Many healthy individuals have an FEV1/FVC below 0.70, and the proportion of such individuals increases with advancing age.11–14 It is pertinent to note that a large number of subjects suspected to have COPD are screened in the sixth decade or later. If the GOLD (or such similar) guidelines were universally followed, many healthy individuals could be falsely diagnosed as having obstruction, based on a cutoff value of 0.70. A review of several published reference equations for FEV1/VC clearly shows that, with very few exceptions, the LLN predicted ratio declines to well below 0.70 with advancing age.15,16 Although the developers of GOLD guidelines have acknowledged the criticism of their fixed criterion to define airway obstruction, they opined that more population-based data are required to determine outcomes of those found to be obstructed by either or both methods. We planned this study to evaluate the magnitude of difference in spirometrically determined airways obstruction with the fixed-percentage and LLN methods, and to estimate the quantum of misclassification with the fixed-percentage method, in men undergoing routine pulmonary function testing at our institute.
Methods
The study protocol was approved by our institutional ethics committee. Our pulmonary function laboratory offers spirometry as a routine service in both the out-patient and in-patient sections. Spirometry records are maintained in a computerized database specifically developed for this purpose.17 For this study we retrospectively analyzed the pulmonary-function-testing records from all men older than 15 years of age who underwent spirometry in January 1999 through December 2008. We included all reports from eligible subjects who had multiple spirometry records. We did not examine the reasons for performing spirometry, or other clinical details (including smoking status and diagnosis).
All subjects had performed spirometry with a dry rolling seal spirometer (Spiro RS232, PK Morgan, Kent, United Kingdom), per standard prevalent spirometry guidelines, assisted by technicians experienced in pulmonary function testing.18,19 Spirometer circuit leaks and equipment calibration were frequently checked to ensure performance. For each subject, the highest measurements of FVC and FEV1 from among at least 3 technically acceptable and reproducible maneuvers are expressed at body temperature and pressure saturated with water vapor. An additional slow VC maneuver was not routinely performed. We employed 2 widely used spirometry-based definitions of airway obstruction: a fixed-percentage cutoff of FEV1/FVC < 0.70, and the LLN, computed as the difference between the predicted value and 1.645 times the standard error of estimate of the reference equation for FEV1/VC in use at our institute.17,20 These equations were generated from spirometry studies performed on 962 healthy non-smoking north Indian adults, ages 15–74 years, with a water-seal spirometer. This LLN cutoff represents the lower 5% confidence limit, and is equivalent to the fifth percentile of values in the reference population. Spirometry records that showed an FEV1/FVC less than the LLN were classified as having an obstructive pattern.
We stratified the subjects into 5-year age groups and computed the prevalence of obstruction and the proportion of discordant results for each group. We calculated the agreement on obstruction diagnosis with the kappa estimate. We used the LLN definition as the reference standard to identify true obstruction.4,5 The proportion of results wrongly classified as having an obstructive defect using the fixed-percent method was accordingly estimated. We also calculated the sensitivity, specificity, and positive and negative predictive values of the fixed-percent method in identifying true obstruction.
Results
During the study period, 28,221 male subjects underwent spirometry. We excluded 30 records because of incomplete data, and 884 records because the individuals were ≤ 15 years old, so 27,307 records formed the data set for the analysis. For the analyzed cohort, the mean ± SD height was 166 ± 7 cm, and the mean ± SD age was 48 ± 16 years. 18,339 subjects (67%) were older than 40 years, and the eldest was 95 years old (Table 1).
In all, 16,999 subjects (62%) had no evidence of airways obstruction by either method, 8,686 subjects (32%) had an obstructive pattern by both definitions, and 1,622 subjects (6%) had discordant results. Of those 1,622, 1,290 (5%) had an obstructive pattern only with the 0.70 cutoff, and all these subjects were older than 35 years (Fig. 1). Another 332 subjects (1%) had obstruction only with the LLN definition, and all of these subjects were ≤ 40 years old (see Fig. 1). Overall agreement between the 2 methods was good: the kappa estimate was 0.869 ± 0.003 (standard error) for the entire cohort. On detailed analysis, this agreement was good only in the age group 26–65 years, and the misclassification progressively worsened with advancing age (see Table 1 and Fig. 1).
For the entire cohort, the sensitivity, specificity, positive predictive value, and negative predictive value of the fixed-percent method were 0.963, 0.929, 0.871, and 0.981, respectively. The sensitivity of the fixed-percentage method in picking up airway obstruction was excellent from the fifth decade onwards (see Table 1), but the specificity and positive predictive value were poor in that age group, and progressively worsened with advancing age (see Table 1).
Discussion
A person's FEV1/FVC is an individual value that depends, among other factors, on sex and age. Because of this, it is almost impossible to accurately predict an individual's normal FEV1/FVC. For epidemiological and clinical purposes we generally rely on reference values derived from observations on apparently healthy individuals from the general population. These statistically derived values take into account age, sex, and body habitus, and provide a lower estimate of the range of normal. Use of the LLN derived from regression equations provides a kind of floating estimate of obstruction. Although this method is not a perfect solution, it is certainly much better in statistical terms for identifying a truly decreased FEV1/FVC value. LLN-based estimates are clearly superior to any arbitrary fixed cutoff value (eg, 0.70) for discriminating healthy from diseased individuals. More recently, an innovative lambda-mu-sigma method was proposed, which defines the LLN for FEV1/FVC as the fifth percentile of the distribution of Z scores. Data from 3,502 American subjects ages 40–80 years suggested that FEV1/FVC below the LLN thus computed identifies individuals with a higher risk of death and higher prevalence of respiratory symptoms, which supports the use of that threshold for diagnosing COPD.21 However, that novel statistical method still needs validation in other studies.
Some healthy persons have FEV1/FVC below 0.70, and the proportion of such individuals increases with advancing age.15 In a population-based study in north Italy, spirometry was performed on 1,727 adults ages 25–73 years, of whom 40% had an FEV1/FVC less than 0.75, and 18% had an FEV1/FVC less than 0.70.11 “Abnormal” FEV1/FVC was more frequent in men, smokers, and people older than 45 years. In a study with nearly 4,000 elderly subjects in Norway, 21% of the men and 17% of the women ages 60–69 years, and 38% of the men and 26% of the women > 70 years old, had FEV1/VC less than 0.70.12 Another report on 71 asymptomatic Norwegian never-smokers > 70 years old found FEV1/FVC below 0.70 in 35% of the total sample, and in 50% of the subjects > 80 years old.13 In the primary-care setting, the use of a fixed-percentage cutoff clearly leads to over-diagnosis of COPD in elderly subjects, and to under-diagnosis in young subjects, and should hence be avoided.6
The FEV1/FVC reference equations we use at our center are based on age and height as dependent variables, and FEV1/FVC decreases with increasing height and advancing age.17,20 In a north Indian man > 40 years old, the measured FEV1/FVC can be below 0.70 but still above the LLN for his age and height. Per standard recommendations for spirometry interpretation, he would not have an obstructive defect, but would be diagnosed as having obstruction using the GOLD criterion.5,10 A large number of subjects suspected to have COPD are screened in the sixth decade or later. If the GOLD guidelines were routinely followed, many of them could be falsely diagnosed as having obstruction, based on a cutoff value of 0.70. Clearly, therefore, the 0.70 cutoff tends to overestimate obstruction in the age group in whom it is most crucial to diagnose or rule out COPD. The 0.70 cutoff underestimates obstruction in younger individuals, in whom the LLN of FEV1/FVC is considerably higher than 0.70 (Fig. 2), and in whom an FEV1/FVC above 0.70 could be associated with true airway obstruction.
Several investigators have documented a high misclassification rate with the fixed-cutoff criterion in both healthy individuals and patients (Table 2).14,22–31 Substantial misclassification was also reported with the fixed-percentage criterion in an analysis of 815 young adults (< 45 years old) with asthma in the European Community Respiratory Health Survey database.32 The 0.70 criteria had 77% sensitivity and 100% specificity among men, and 57% sensitivity and 100% specificity among women. Our results closely approximate the findings of a recent analysis of spirometric data from 14,056 Dutch subjects (both men and women) in a primary-care setting.33 The overall sensitivity, specificity, positive predictive value, and negative predictive value of the fixed 0.70 cutoff, relative to the LLN cutoff definition, were 0.979, 0.912, 0.720, and 0.995, respectively, in that study. Similar to our observations, there was a steady increase in false positive results with advancing age; false positives exceeded 15% in subjects > 70 years old. As spirometry is designed to aid diagnosis, rather than exclusion, of obstruction, the low positive predictive value of the 0.70 fixed cutoff in our findings clearly exposes the limitation of the 0.70 fixed cutoff in routine clinical practice.
The 0.70 FEV1/FVC cutoff therefore cannot be applied as a general “rule of thumb” in the general population, because it yields false negatives in young adults and false positives in older individuals. Mathematical complexities and lack of appropriate reference equations have often been proposed as reasons that physicians and researchers opt for the simpler fixed-percentage criterion. But these certainly are not such great problems that they justify the huge health costs associated with misdiagnosis of airway obstruction in a rather large proportion of our adult population. In fact, most modern computerized and programmable spirometers routinely provide LLN values for all the variables reported.
Limitations
Our analysis, though it included a large number of subjects, is not entirely without limitations. For one, our study's retrospective nature did not allow us to further evaluate patient-related factors such as clinical diagnosis or smoking status. Although we strive to meet standard performance criteria in all spirometry, our retrospective analysis did not allow us to examine individual test quality, so we could not provide exact details on the proportion of tests that met the spirometry performance criteria. Moreover, during the 10-year data-collection period there were minor changes in the recommendations on spirometry performance. For instance, during the first few years we used the then-prevalent between-maneuver FEV1 and FVC reproducibility threshold of 0.2 L.18 Later the reproducibility goal was changed to 0.15 L, per the revised American Thoracic Society/European Respiratory Society recommendations.19 These and other changes in practice might have introduced important changes in the quality of our spirometry records. Again, because the study was retrospective, we cannot ensure conformity to the current guidelines.
Post-bronchodilator spirometry values were available for only a few subjects, and were therefore not examined. This might have affected our results vis-à-vis the GOLD guidelines, which recommend using post-bronchodilator spirometry results to diagnose airway obstruction. However, we did not aim at suggesting a diagnosis of COPD among the subjects studied, but rather looked at identification of an obstructive defect on spirometry. Additionally, we did not have data on smoking habits and clinical features from some subjects, which would have been essential to consider a diagnosis of COPD. We chose to study only men, because our corresponding reference equations for women yield poor results in the elderly, possibly as a result of poor representation of this age group in the study population from which our reference equations were derived.34 Our observations are also derived from a database of men who were referred for spirometry, which indicates a higher pre-test probability of abnormal lung function. Hence, our estimates on the prevalence of obstruction cannot be extrapolated to a population or a primary-care setting. Nonetheless, the large number of subjects we studied, with adequate representation of age groups, improves the robustness of our findings.
Conclusions
Our results suggest a substantial negative age-dependence of FEV1/FVC, which leads to over-diagnosis of airway obstruction in middle-aged and elderly men, and under-diagnosis in young men, as compared to the standard LLN criterion. There is a need to rethink the method of identifying airway obstruction to avoid sacrificing proper diagnosis for simplicity, especially in the primary-care setting. Airway obstruction should be defined with an FEV1/VC below the LLN, derived from an appropriate reference equation, and not by any fixed, arbitrary cutoff.
Footnotes
- Correspondence: Ashutosh N Aggarwal MD, Department of Pulmonary Medicine, Postgraduate Institute of Medical Education and Research, Chandigarh 160012 India. E-mail: ashutosh{at}indiachest.org.
The authors have disclosed no conflicts of interest.
See the Related Editorial on Page 1861
- Copyright © 2011 by Daedalus Enterprises Inc.