Abstract
Lung function parameters vary considerably with age and body size, so that, unlike many laboratory tests, the normal range of expected values must be individualized. For spirometry, only low values are considered to be abnormal, so the lower limit of normal (LLN) is taken to be equal to the 5th percentile of a healthy, non-smoking population. Simple and commonly used “rules of thumb,” such as an FEV1/FVC < 0.70 to indicate air-flow obstruction, or assuming values < 80% of predicted to be abnormal, are inaccurate and will cause misclassification, specifically under-diagnosis of abnormalities in younger, taller individuals and over-diagnosis in those older or shorter. A much more accurate LLN for the FEV1/FVC ratio, which recognizes the change with age of this measurement, can be easily determined by subtracting 10 (10% or 0.10) from the age specific FEV1/FVC predicted for any individual. The analysis and mathematical descriptions of reference data have become increasingly sophisticated in recent years, but the interpretation of values near the LLN continues to carry uncertainty, due to an overlap in values between low normal values and those reflecting early disease. Among patients referred to a pulmonary function laboratory, the pre-test probability of disease may be relatively high, so that even individuals with values above the LLN may be more likely than not to have respiratory disease. A future goal for the pulmonary community would be the development of risk stratified outcome data that would allow an estimation of the probability of disease with progressive decrements in lung function. While interpreting spirometry results near the LLN will continue to be problematic, a more important task for the pulmonary community is to focus on finding the pool of individuals with clear-cut, but undiagnosed, COPD. And for this, good quality spirometry remains the best tool and must be widely available.
- lung function testing
- spirometry
- lung volume measurement
- reference equations
- lung diseases
- obstructive
- outcome assessment
Introduction
A primary step in the interpretation of lung function data is to determine if the values obtained fall within the expected normal range of values. It has been generally accepted in both research and clinical medicine to allow a 5% error rate (ie, P < .05 for the probability of a chance association and ± 2 standard deviations (SD) to encompass 95% of a value normally distributed in a population). In pulmonary function testing, where we are generally concerned only with low values and where not all parameters are normally distributed, this has been stated as: the lower limit of normal (LLN) is equal to the 5th percentile of a healthy population. This appeared in the 1991 ATS guidelines, after equivalent statements appeared in the early 1980s, then was reconfirmed in the 2005 American Thoracic Society/European Respiratory Society (ATS/ERS) document.1–4 For a value with a normal distribution, the 5th percentile LLN is equivalent to the mean − 1.645 × SD. For regressions the equivalent variance is expressed as standard error of the estimate (SEE) or residual standard deviation (RSD). For tests in which either high or low values may be considered abnormal, Mean ± 2 × SEE would define a normal 95% CI from the 2.5 to 97.5 percentiles.
Unlike the great majority of clinical laboratory tests, where the normal range is the same for all individuals, or at least all adults, lung function parameters vary considerably with age and body size, so that the expected value must be individualized. This is done by calculating a reference or predicted value that represents the midpoint of the normal range for healthy individuals of the same age, height, sex, and ethnicity. Preferably, but less commonly in our reports, a normal range or lower limit value would also be stated. This added complexity has led to the common use of approximations for the LLN, which lead to errors of classification, especially among individuals younger, older, shorter, or taller than the midrange.
The issues around the use of a fixed ratio of 0.70, often expressed as 70%, to establish air-flow obstruction in COPD have been dealt with elsewhere. That the FEV1/FVC ratio declines with age is not a new finding. In the classic Bates, Macklem, Christie textbook of 1971, Respiratory Function in Disease, it is noted that the value in normal subjects 20–30 years old is 87% and falls to 81% by age 50–60, referenced to a 1958 French paper (appropriately, as the FEV1/FVC ratio was described there in 1949 and became known as the Tiffeneau index).5 A decrement in the ratio with age was also clearly demonstrated in the reference data reported by Morris in 1973 and by Crapo in 1981.3,6 The change of 2% per decade in the French data is almost exactly that found in the more recent large National Health and Nutrition Examination Survey (NHANES) III data set.7 Despite the pleas of many, including much of the group assembled here, the Global Initiative for Obstructive Lung Disease (GOLD) organization8 has elected to continue to use a ratio of 0.70 in their definition of COPD, but there is no excuse for any pulmonary function laboratory, or even office based practitioner, to use this as a general criterion for air-flow obstruction.
The Lower Limits for Spirometry Data
The NHANES III reference data are recommended by the ATS/ERS 2005 guidelines for establishing spirometry predicted values for North Americans.2,7 These include separate equations to define the LLN for each parameter, and, as noted above, this should be included in the report, but, for the FEV1/FVC ratio, even this small step is unnecessary. Analysis of this data set showed that the variance (SEE) for the ratio did not vary significantly by age or height, so the interval from the predicted midpoint value to the 5th percentile LLN is constant and given by 1.645 × SEE. Conveniently, for the FEV1/FVC this value is very close to 10% (or 0.10). Exact values for men and women in the 3 ethnic groups reported are shown in Table 1. Since all labs, and even most handheld spirometers, display an age specific predicted value for the FEV1/FVC ratio, all the interpreter need do is subtract 10 to find a statistically accurate LLN. This same value would also be a reasonable approximation to the true LLN for any of the widely used reference sets, far more so than any fixed value of the ratio. One need not over-emphasize the exact value of the LLN, as there will always be some uncertainty around this point, and the interpreter would be wise to consider a range on either side (perhaps ±1; ie, between 9 and 11 units below the predicted ratio) to represent a “borderline value” rather than normal or indicative of air-flow obstruction.
More prevalent in clinical pulmonary function labs than the fixed ratio is the use of a fixed percent of predicted to determine the LLN for most other parameters, but this can lead to similar misclassification. Over the years the prediction equations have become more sophisticated and complex, but still rely primarily on age, height, sex, and ethnicity.
Typical adult equations of the 1970s and 1980s were linear regressions of the form:
The normal data were generally found to have a homoscadastic distribution: that is, the variance around values of small magnitude is the same as that around large values. In this case, the 5th percentile LLN will occur at a lower percent of a small predicted value, and at a higher percent of predicted for a large value. Thus a fixed percent predicted LLN such as the commonly used 80% will under-diagnose abnormalities in younger, taller individuals and over-diagnose in those older or shorter.
The NHANES equations of 1999 use low order polynomials in the form:
In order to cover the age range from 8 to 80, it was necessary to provide separate equations above and below age 18 (F) or 20 (M). Unlike some earlier reference data, the large NHANES III data sample did show that the population variance of FVC and FEV1 increased with the larger values associated with greater height, although it did not vary with age, and thus the equations for LLN take height into account. Even so, the calculated LLN for FVC and FEV1 ranges from 83% of the predicted value in young adults to 75% at age 70, so the widely used 80% of predicted is correct, or nearly so, only in the midrange of age (Table 2). For the forced expiratory flow during the middle half of the FVC maneuver (FEF25–75), the true LLN ranges from 65% of predicted in young individuals down to 35% by age 70, and misunderstanding this has contributed to much of its undeserved reputation for “sensitivity.”
The Lamda, Mu, Sigma Method
Recent efforts have used more complex statistical techniques to express lung function in smoothed curves through growth and decline.9 Most recently, Stanojevic et al have introduced to spirometry the use of the lambda, mu, sigma (LMS) method, previously used to develop pediatric growth charts with their bands of percentiles.10 At each point along the continuum, mu represents the median, sigma the coefficient of variance, and lambda is an index of skewness. The result is not an equation per se, but a set of tables, read by computer that creates a smooth continuous predicted value (given by the median, mu) from early childhood to old age. The sigma and lambda terms allow for the 5th percentile LLN to be independently determined throughout the age-height spectrum and further demonstrates the unsuitability of a fixed percent value for LLN. By this analysis, the normal range, as a percent of predicted, of both the ratio and its components is wider in young children and widens progressively above age 40 (Fig. 1).
This LMS method is being utilized in a current ERS Task Force effort to merge many data sets into new prediction models. One example of its flexibility is shown in the recent publication describing an unusual variation in the FEV1/FVC ratio during the adolescent growth spurt, where the ratio, which falls rapidly during childhood, then transiently increases before resuming the steady decline of adulthood.11 This apparently reflects differences in the rate of development of airway properties and thoracic dimensions. This task force, the Global Lungs Initiative group, has now gathered raw spirometric data from over 150,000 healthy subjects in 70 data sets from 34 countries, in an effort to form a single source for multiethnic, all age reference data. Early results from this large data pool show that there has been no secular trend in spirometry values over a 30 year span, and provide evidence that to reliably reflect a population, the reference sample needs to have at least 150 individuals of each sex.12
Indicators of Severity
It has been traditional to express deviations from the expected reference value as a percent of predicted, and these percentages are widely used to categorize grades of severity. Individual values can also be reported as a percentile of the normal population. While this provides some information within the low normal range that might be helpful in risk assessment, it breaks down for the broader distribution of abnormal values, as these would all be compressed from the 5th percentile to near zero. Since the 5th percentile LLN is equivalent to predicted −1.645 × SEE, the LLN can also be described as a z-score of −1.645, with larger negative numbers showing increased severity. Reporting the z-score has the advantage that the LLN is the same number (−1.64) for all spirometric parameters. This could be quite helpful in weaning practitioners away from the flawed 80% shortcut, but its utility for severity grading is less clear. The unit of the z-score tells us how likely this value would be in a normal population, but it is not intrinsically related to the impact of the disease. At this point, knowing the FEV1 is 40% of normal conveys more meaning to me, and certainly to the patient, than a z-score of −4.9.
Spirometry Reference Data
For spirometry a great many reference equations exist, some based on rather small samples and others on large populations, some with samples of convenience and others by statistically valid population sampling, and most, recently at least, with efforts to restrict to healthy non-smokers. Many reflect specific geographic or ethnic groups. The NHANES III reference data have been recommended because based upon a large group, appropriately sampled, with good quality spirometry, and provides separate equations for whites, African-Americans, and Mexican-Americans.7 The Stanojevic et al publication applies the LMS method to the NHANES data, and merges it with additional pediatric data to extend the age range down to 4.10,13 It is thus helpful in tracking the function of growing children without having to jump among equations from preschool, to school age, through adolescence to adulthood. However, these data are currently available only for whites, and, for adults, differ very little from NHANES. New multi-ethnic reference data spanning age 3–90, collated by an ERS Task Force, are expected to be published in 2012.
Accepting Uncertainty
The NHANES spirometry data match quite well with earlier publications by Crapo and others, so, although newer and larger, it does not appear that interpretive results would be much different using the earlier studies.3,14,15 We face much greater uncertainties than whether the LLN of a particular individual's FEV1/FVC ratio should be 0.72 or 0.73. There are the issues of whether a particular reference set matches our patient, our equipment, and the skills of our technicians, but more dominant is the question of the clinical meaning of a value near the LLN in this individual. We know that 95% of healthy non-smokers would have higher values, but that does not tell us the likelihood that this person is normal or has disease. For that we would need to know something about the pretest probability of disease in this individual and among the population seen in our laboratory. If we are testing an asymptomatic non-smoker at the recent World Spirometry Day, then even a value a point or 2 below the LLN may be more likely than not to be “normal.” However, in a hospital based referral lab, those with FEV1/FVC at, or even a few points above, the LLN may be quite likely to have early air-flow obstruction, although we dare not say it. This led one physician many years ago to state, “The general tendency in pulmonary function testing at the clinical level is toward cautiously noncommittal over-interpretation in language replete with modifiers.”16 Rather than the dichotomy of normal/abnormal, we need the risk profiles and outcome data that would allow us to make a rational statement about the probability of disease at any point on the spectrum of FEV1/FVC values from mid-normal down to the LLN and below. Lacking that, we will have to continue to be “cautiously noncommittal,” and although we can certainly do better than the tarnished GOLD standard of 70%, it seems unnecessary to obsess over the exact LLN.
This uncertainty is also not a new idea. It was the underlying message of the 1968 “cautiously noncommittal” editorial referred to above,16 and Clausen discussed this very nicely in a 1982 publication of the California Thoracic Society, stating that “the large and inherent overlap between normalcy and disease states will persist as a limitation in pulmonary function test interpretation.”4 The ATS Spirometry 1994 update made a distinction between the related tasks of “(1) the classification of the derived values with respect to a reference population . . . and (2) the integration of the spirometric values into the diagnosis, therapy and prognosis for an individual patient.” And, noting the difficulty of the latter, concluded, “Accordingly, no specific guidelines for interpretative procedures are recommended that would be applicable to all laboratories.”17
Spirometry Based Outcome Studies
We are beginning now to see some outcome data addressing the diagnostic criteria for COPD. Mortality follow-up to 22 years after the first NHANE Survey showed that even mild COPD, defined as FEV1/FVC < 0.70 and FEV1 > 80% (aka GOLD 1) carried a future risk of excess death, indicated by a hazard ratio (HR) of 1.3.18 This supports the recommendation to define obstruction by a low ratio, even when the FEV1 is “normal,” but does not help us interpret borderline values. Although some in this cohort would have had a ratio < 0.70 but above the true LLN, less than 25% of the population was over 60 at the time of spirometry, so the majority, below the LLN, would have been sufficient to drive the outcome of increased mortality.
Subsequently, in response to the 70% controversy, Mannino et al used the Cardiovascular Health Study population to report mortality outcomes comparing GOLD criteria to the LLN determined from NHANES III reference data.19 Baseline spirometry and follow-up for mortality and hospitalizations for up to 11 years were available for 4,965 participants: white and black (Table 3). Participants with spirometry considered normal (ratio > 0.70) but with respiratory symptoms did demonstrate an increased risk of death with HR 1.2. In this population, all > 65 at the time of spirometry, a full 75% of the GOLD 1 strata (FEV1/FVC < 0.70, FEV1 > 80%) had an FEV1/FVC above the 5th percentile LLN. The adjusted HR for death in this subgroup was not significantly different from those with normal spirometry, while those with FEV1/FVC below the LLN, but with FEV1 > 80%, had a significantly elevated risk of death: HR 1.4. Even in the GOLD 2 strata (FEV1/FVC < 0.70, FEV1 < 80%, > 50%), one third had a ratio > LLN. This group did show an elevated risk of death (HR 2.2). The conclusion was that older individuals with FEV1/FVC < 0.70 but “normal” by LLN were more likely to die, but this was shown only for the ∼25% that also had a reduced FEV1. All of these groups did show an increased risk of “COPD related” hospitalizations in the ensuing years. These data do support the idea that individuals with FEV1/FVC above the LLN, but with respiratory symptoms or borderline values, are at risk for morbidity. However, if we wish to incorporate that into our interpretations, it should be done in a more systematic, age independent, fashion, such as raising the LLN to the 7.5 or 10th percentile in those with risk factors for airway disease.
The latter strategy was recently considered by Fragoso et al, who used white participants in NHANES III aged 40–80 (not just the healthy non-smokers used for prediction equations), recalculated their predicted FEV1/FVC ratio from the Stanojevic LMS data, then looked at mortality at 6–12 years follow-up (Table 4).20 Those with a ratio less than the 5th percentile LLN were compared to those in successively higher 5 percentile strata, and to a reference group with ratio > 25th percentile. By this analysis only those with ratio < LLN (including all degrees of severity) had an elevated risk of death over this relatively short-term (adjusted HR of 1.68). The borderline group of interest, between the 5–10 percentiles, had a modestly elevated point estimate, but the confidence interval spanned 1.
Thus, while there are certainly people progressing toward COPD who have spirometry within the normal range, these studies do not suggest major short-term consequences. However, these were studies of general population samples, so not likely to reflect the much higher likelihood of early disease among those referred for pulmonary function testing. And, for COPD, the time from borderline spirometry to major morbidity or death is likely to be over 20 years—a serious challenge for outcome studies. The HRs calculated are adjusted for smoking exposure to reflect only the added impact of low spirometry, but, for prognosis, the combined risk might be more relevant. For example, in the Fragoso et al study, the unadjusted HR for death for participants with FEV1/FVC between the 5–10th percentiles was significantly elevated at 1.59, even with the relatively short 6–12 year follow-up.
So dealing with spirometry results near the LLN will continue to be problematic, but a more important task for the pulmonary community is to focus on finding the pool of individuals with clear-cut, but undiagnosed, COPD. And for this, good quality spirometry remains the best tool.
Lung Volume Reference Data
For lung volumes, the reference data question is quite different: not, which of many reference sets to use, but rather, are there any out there that should be used? The 2005 ATS/ERS documents included a table listing the lung volume reference data published 1993–2002, and among many small, ethnically specific studies there were none dealing with North American populations and only a few for specific European countries. For guidance, reference was made to a 1995 paper in which Stocks and Quanjer lamented the unsatisfactory state of lung volume reference data and republished the 1993 European Community for Steel and Coal reference equations.21,22 These, in turn were first published in 1983 and resulted from a merging of many previously published studies, mostly done in the 1960s and 1970s. Unlike the current international effort, which is using original data, the European Community for Steel and Coal equations were mathematically developed from the prior published equations: a method that adds an additional layer of uncertainty, particularly to the normal range. Also in common use are the data of Crapo et al,23 done in the same population from which spirometry and DLCO reference data were published. This has the advantage that the FVC predicted for spirometry matches the VC predicted with the lung volumes on the same report. The TLC was obtained from the single-breath helium dilution of the diffusing capacity test, and the number studied (123 M, 122 F) was, perhaps, marginal, but was well distributed over the age range. Most PFT software programs still offer the Goldman-Becklake data of 1959. These are based on a small sample of convenience (44 M, 50 F) from Johannesburg, South Africa, with no mention of smoking status.24 There are some newer data available, reporting plethysmographic lung volumes of healthy non-smokers, including 482 white adults from the Barcelona area25 and 212 New Zealand adults of European background.26 By collating these and others we can hope for better lung volume reference data if the Global Lungs Initiative group maintains its energy beyond spirometry.
Diffusing Capacity Reference Data
The reference data problem for diffusing capacity has been widely and long recognized. While spirometry predictions vary by up to a few percent, the variation among reference values for DLCO can be up to 40%. This reflects the fact that the test itself, even when nominally done by the same technique, varies considerably from lab to lab. Recent equipment has become better standardized, at least out of the box, but with time and in different laboratories, the variation returns.27 The latter problem can be addressed through careful biologic controls or by the simulator device available.28 Still, labs are well advised to select from the available reference sources the set that best matches a diverse group of normal subjects tested in that lab. This is becoming an increasing problem as patients may be excluded from surgery or chemotherapy due to a DLCO percentage cutoff, such as < 40%, but it is important to know, 40% of what? In reviewing the preoperative literature, I have usually not found a reference source for the predicted DLCO stated. If our labs are not using reference data well matched to our equipment and technique, we will be doing a disservice to our patients and colleagues.29,30
Reporting the Numbers
Having given due diligence to all of the above considerations, our final task in the PFT laboratory is to produce a report that conveys the data and our interpretation clearly and efficiently. The ATS/ERS guidelines make some recommendations for reporting, and the ATS PFT committee has further discussed more specific formatting. There is general consensus that only the more meaningful tests be reported (eg, for spirometry the FVC, FEV1 and FEV1/FVC ratio suffice). Unless careful attention is paid to the wide normal range, the FEF25–75 may be more often misleading than helpful, and flows at specific expired volumes do not add utility. Vedal and Crapo showed that, with 14 parameters reported, at least one would fall below the 5th percentile LLN in 24% of a healthy group.31 Given the need for an individually specific predicted value for each parameter, it has been common to list this first, but most other clinical laboratory reports show the actual value first, and this variability occasionally leads to the wrong value being transcribed into notes. There is strong consensus that uniformity among PFT labs would be desirable, and a poll of the speakers at this conference showed a 70% preference to list the actual value first. As discussed above, an indicator of the LLN should be included, both for the interpreter and the subsequent readers of the report. Options include the numerical value of the LLN, a percentile or a z-score. Showing only the percent predicted encourages continued use of the 20% “rule,” but this index is widely, and I think helpfully, used in severity assessment, so I do not think it can be abandoned. Perhaps seeing FEV1 or TLC values of 75% but above the LLN, or 82% but below the LLN, will help in the educational process. It is more than a bit disappointing to reread the thoughtful “cautiously noncommittal” editorial written when I was in medical school and still be discussing, 40 years later, the issues of a fixed percentage FEV1/FVC and ± 20% normal range, and the overlap of health and disease so well outlined then.16
Discussion
Coates:
Could we come back to one of the issues that I raised in pediatrics yesterday? That was the exclusion criteria in the NHANES III meant more than a quarter of the children were excluded. A few for smoking, most because they said they coughed occasionally or had physician-diagnosed asthma or wheezing or tightness in the chest. On the other hand, three quarters of the sample of adults were excluded. I'm not saying it is right or wrong, but we've talked a lot about not having a gold standard for asthma or COPD. Did we deal only with super-normals? What is the normal population? It seems very different than how we would come up with predictive values and LLN for a biochemical value such as serum sodium.
Culver:
Yes, but it's not that different than what we've done with cholesterol; we don't just take a population average of everybody's cholesterol. There have been outcome data, and our lab says “less than 200,” but I think there are now guidelines that say even less than 180 is what we should be aiming for. That's not a population average but an idealized number. The bone density people primarily relate to young healthy normals, not to the general population of their aged bone loss compatriots.
If we're trying to detect a disease, we don't want to include that same disease in the population, so I think we want a healthy population. Whether we've gone too far in excluding everything, I'm not sure. If you think the reference data aren't sufficiently inclusive, you might want to use the ±1.96 standard deviation for your lower limit to be a little more cautious, at least in people who you're screening, if you're doing that kind of thing, with minimal symptoms. I still think it's rational to try and select a healthy non-smoking population for our reference data.
Coates:
I guess the corollary to the question, then, is, when you start putting various series together, the criteria for deciding if you're healthy or not should be pretty similar, if not ideally the same.
Culver:
Yes, and that's one of the things that this global initiative is dealing with, because that's defined differently for different studies, and better known in some than others.
MacIntyre:
I want to focus on diffusing capacity, if I may. I'm not a statistician so I don't completely understand LLN calculations. I was told of a case of an older patient whose predicted DLCO was 15, and if you do the LLN calculation, it comes out to be 7, and that means that 40% predicted is still normal. The referring physician asked the very valid question, “What do I do with that kind of information?” My first question is, does the calculation of LLN use the same number for every predicted value? You showed data that say that works for vital capacity, but does it work for DLCO too?
Culver:
You should know better than I for diffusion capacity, because you've looked at those data. If the distribution is fairly homoscedastic, if it doesn't narrow down as the numbers get smaller, then, yes, it's the same absolute decrement below the predicted value. And, yes, that does reduce the LLN. We see the same thing with the FEF25-75, to be abnormal when you're 60 years old, it probably has to be about 40% predicted.
MacIntyre:
But people don't make major decisions based on the FEF25-75. They do make big decisions based on diffusing capacity.
Culver:
I think that probably tells us that it's a pretty shaky thing to make big decisions on in older folks.
Miller:
I think part of the reason is that most of the reference populations don't have a lot of people at the upper extreme of age. Most of the adult reference populations begin at age 18-20, so there are a lot of those, but there aren't a lot of presumably normal 70 and 80-year-olds. Then you get into the question of those who are normal: are they really super-normal? That's why the Thompson equation1 for DLCO, which is deliberately over-representing older people, may be useful, because you get shaky statistically when you get to extremes of age and heights. There aren't a lot of 80-inch people in the reference populations either.
Culver:
And even shakier if, as often happens, the computer has simply extrapolated an equation that was designed with data up to age 65 or 70 and may have extrapolated to age 75 or 80: those may not necessarily be linear. If the regression line isn't exactly lined up right, it shoots off into the sky or into the basement as it goes further off the page. I think that in looking at some of these data, that may be the case with the Goldman lung volume data,1 the female data has a fairly steep age regression, so that when I compared it to one of the others, it's 400 mL higher in younger individuals and 500 mL lower in older individuals.
MacIntyre:
I'm going to show my statistical stupidity here. To calculate the LLN, don't you take the predicted value and have a single number that is derived from the standard deviation of the prediction equation?
Enright:
Indeed, that is true for many published studies.
MacIntyre:
OK, it's a fixed value. And you use that same number whether it's a big diffusion capacity or a little diffusion capacity. I'm just trying to figure out, is that the right way to do this?
Culver:
Yes. You can determine that number for any regression, but whether that was a valid number would depend on whether it met criteria for what they call homoscedasticity (ie, the variance is the same around small and large values).
MacIntyre:
Now, that's a term I'm not familiar with, but I kind of like it because it explains my conundrum here. We're assuming the diffusion capacity is homoscedastic, but is that in fact true?
Culver:
That might be a data set that, if one applied the LMS method,1 which can adjust the variance all the way along the age-height spectrum, might come up with differences. This method would not say that the FEV1/FVC was perfectly homoscedastic. If you fine-tune it, you can find variation as you go along the age spectrum. So you might see that in the diffusion capacity, but it also might get even wider in the elderly. It often tends to because you have fewer people, and maybe with survivor populations in their old age there can be quite a bit more variability. Be careful what you wish for, because it might get worse.
MacIntyre:
I hate to keep harping on this, but it's bothering me a lot. It just seems that this single value to calculate LLN using a predicted value of 35 versus using a predicted value of 15 yields results that seem quite different clinically. Subtracting 8 from 35 and your LLN is now 27. Subtracting 8 from 15 is now 7. Help me here.
Miller:
These are the questions we asked our statisticians when we published our data. The answer there is, if you're concerned about these extremes of age and height, you have to get more people who are in that range. You're stuck otherwise. The basic conclusion, that when you're younger or taller, you're below 80% predicted to be abnormal, and when you're shorter or older, you're well below 80% to be abnormal—that still holds. Whether you could fine-tune that offset, that confidence interval, you need people in that age or height group.
Enright:
There is light at the end of the tunnel. More than half of us around the table are directors of PFT labs that test in one week several times the number of healthy people that comprise the reference equations1 that we're currently using for DLCO. It's very embarrassing. The Global Lungs Initiative group has recently begun gathering DLCO data from large studies of healthy people. Not just the equations as published2 in 1983 by the European Community for Steel and Coal, but the raw data. Al Miller has contributed his raw DLCO data to it already, and we already have about 1,000 individuals, but we hope to have many times more by the end of next year. We'll use the updated methods that do not assume on homoscedasticity.
We will include data from older people. The PLATINO [Latin American Project for the Investigation of Obstructive Lung Disease] study3 in Mexico City and in South America have just bought DLCO machines and are going to apply them to general population samples in 5 of their cities. The investigators have promised to offer those data to the Global Lungs Initiative group, and we'll have the data from Australia that you mentioned.
So there is some light at the end of the tunnel, and I urge all of you to, if you know a study of normal population samples with DLCO (especially in children and elderly, because they're sorely lacking), please offer them to the group. I think it's really a shame that NHLBI [National Heart, Lung, and Blood Institute] is not going to fund a study of many lung volumes in population-based samples. The CDC [Centers for Disease Control and Prevention] with NHANES IV is not doing it; they're almost completely done gathering their spirometry data. It's a major problem that we should have addressed decades ago.
Kaminsky:
Bruce, I just wanted to make one comment. The theme here is how to interpret results. One thing you didn't mention but is very important is following results over time. As we all know, we can have someone come in with a FEV1 of 110% predicted but a low ratio, and they have symptoms of cough or wheeze with exercise, they may still have asthma. Likewise for the diffusion capacity: if it's dropping over time more than you would expect based on their age—I determine that just by looking at percent predicted—if that percent predicted is dropping where it shouldn't as the person ages and this correlates with something clinically, either desaturation with exercise or progression of disease radiographically, those are the things that help us in this gray zone as well.
Footnotes
- Correspondence: Bruce H Culver MD, Division of Pulmonary and Critical Care Medicine, University of Washington School of Medicine, UWMC 356522, Seattle WA 98915. E-mail: bculver{at}uw.edu.
Dr Culver presented a version of this paper at the 48th Respiratory Care Journal Conference, “Pulmonary Function Testing,” held March 25–27, 2011, in Tampa, Florida.
The author has disclosed no conflicts of interest.
- Copyright © 2010 by Daedalus Enterprises Inc.