Abstract
BACKGROUND: Although quality control standards are recommended to ensure accurate test results, the coefficient of variation for the FVC and FEV1 biologic quality control (BioQC) is not specified. The primary aim of this study was to evaluate variations in spirometry BioQCs in a large and diverse cohort of individuals to determine an acceptable standard for the coefficient of variation.
METHODS: The FVC and FEV1 biologic control data were secondary analyses from an inhaled medication trial that was conducted over 3 y ending in 2018 that included 114 laboratories. Results were sent to a central repository for expert review. The FVC and FEV1 coefficients of variation were based upon a minimum of 10 spirometry values annually separated by at least 5 d. A second method of computing the coefficient of variation used 10 values within 28 d. Descriptive statistics were computed. Wilcoxon signed-rank tests were conducted to compare whether the median coefficient of variation values between the 2 methods differed, tested at α = 0.05 using SPSS.
RESULTS: Of 249 biologic control participants, 170 met the first year’s inclusion criteria. The coefficient of variation for the 5-d separated method was < 5% for 94.1% of FVC and 93.5% of FEV1 values in the first year. By year 3, 90% of FVC and FEV1 coefficient of variation values were < 4%. The medians for the 5-d separated and the 28-d measure showed no difference for either FVC coefficient of variation or FEV1 coefficient of variation, Z = −1.764, P = .78, and Z = −0.980, P = .33, respectively.
CONCLUSIONS: Interlab biologic control variation values of < 4% for FVC and FEV1 are achievable; however, individual labs should strive to attain lower values. Acceptable coefficients of variation can be achieved within 28 d.
- Forced expiratory volume
- quality control
- spirometry standards
- reproducibility of results
- coefficient of variation
Introduction
Spirometry is the most common pulmonary function test used in diagnosis, monitoring, disability and impairment evaluations, as well as research and epidemiological surveys. The basic measures of spirometry are FVC and FEV1. An accurate interpretation of test results and clinical changes over time requires strong quality control.1 Prior to testing patients, technologists calibrate spirometry equipment to assure the equipment displays accurate values.2 Biologic quality control (BioQC) assessments further evaluate the entire testing system to include the equipment, the software, and procedures by technologists within the laboratory.3 The BioQC test is performed by a healthy, nonsmoking individual with stable lung function who can conduct an acceptable and repeatable spirometry. The individual’s BioQC mean value ± 2 SD establishes an expected range of values used to monitor quality control.4
The American Thoracic Society (ATS)/European Respiratory Society (ERS) guidelines recommend quality control standards; however, BioQC standards for FVC and FEV1 are not clearly specified.2 The ATS/ERS guidelines refer to the ATS Pulmonary Function Laboratory Management and Procedure Manual;4 however, the manual does not include statements on BioQC testing frequency nor an acceptable coefficient of variation. The published data on coefficient of variation were measured at various frequencies (hourly,3,5,6 daily,7-10 weekly5,8), in mostly small sample sizes (n < 1001,3,5-14 and n > 10015), varied smoking status (100% nonsmokers3,6,9,14,15 and some smoking history5,7,8,10,11), gender (< 50% male6,7,9,15 and < 50% female1,3,8,10,11,13,14), conducted by the same3,6,9 or different technologists,5,8,15 and with the same3,8,9,11,13,15 or different spirometers.1,3,13
Stronger evidence demonstrating the acceptable spirometry variation in biologic control subjects must be evaluated in order to establish consistent targets for spirometry BioQC coefficient of variation for FVC and FEV1. Therefore, the primary aim of this study was to assess FVC and FEV1 BioQC coefficient of variation in a large, global multi-center clinical trial with a diverse cohort for 3 years. Additionally, the coefficient of variation changes in each of the 3 years were assessed to determine how often BioQC coefficient of variation should be evaluated.
QUICK LOOK
Current Knowledge
A comprehensive spirometry quality control program includes conducting biologic quality controls (BioQCs). The first step in a BioQC program is to identify a healthy volunteer and establish the individual’s coefficient of variation.
What This Paper Contributes to Our Knowledge
This paper specified a process for establishing a spirometry BioQC coefficient of variation value. This study also identified an achievable standard for interlaboratory spirometry BioQC coefficient of variation across multiple manufacturers. Further, the study indicated that coefficient of variation values were stable over 2 years.
Methods
Study Design
A retrospective analysis was performed on FVC and FEV1 BioQC data collected from an inhaled medication study that was conducted over 3 y ending in 2018. This study was granted exemption status by the institutional review board from Rush University Medical Center.
Procedures
Onsite training on BioQC for FVC and FEV1 testing was provided to each of the 114 pulmonary function labs (PFLs) across North America, Europe, and Israel as a part of the inhaled medication trial. These training sessions included oversight of equipment calibration procedures and process of conducting spirometry BioQC testing. The equipment that was used for the testing was routinely calibrated according to the 2005 ATS/ERS quality control specifications for spirometer accuracy and precision.16 Five different brands of pulmonary function equipment were used in this study. The spirometers used were the Encore (CareFusion, San Diego, California), EasyOne Pro (ndd Medical Technologies, Andover, Massachusetts), Jaeger (Jaeger/Cardinal Health, Hochberg, Germany), MGC Diagnostics (St. Paul, Minnesota), and KoKo - formerly nSpire Health (Longmont, Colorado).
BioQC participants enrolled in the study were technical staff working regularly in PFLs. Participants in this study were free of any known lung diseases and were nonsmokers based on self-report. The following demographic data were collected from each subject’s submitted laboratory report: age, gender, race, and height. BioQC participants underwent study-specific training or were coached by a trained technologist to perform serial FVC and FEV1 measurements lasting from 1 y to > 3 y as part of the quality program.
Each PFL participating in this study sent spirometry test results from their BioQC subjects and mechanical quality control findings to a central repository for expert review. The expert reviewers examined calibration reports, BioQC numerical results, volume-time graphs, and flow-volume loops to assure adherence to 2005 ATS/ERS acceptability and repeatability standards.16 The 6 reviewers were registered respiratory therapists and registered pulmonary function technologists with over 100 years combined experience working with quality control. Feedback on errors were sent back to local sites. All satisfactory BioQC test results were transferred to a spreadsheet.
The initial coefficient of variation assessment included sequential BioQC tests separated by at least 5 d. Tests measured more frequently were removed to prevent values that may create an unrealistically low coefficient of variation value. The resulting coefficient of variation comprised the 5-d separated measure. A second assessment of BioQC coefficient of variation used the first 10 values provided they occurred within 28 d to determine whether the measurement time frame affected the coefficient of variation. This coefficient of variation represented the 28-d measure. Both spirometry coefficient of variation assessments were based on the first 10 values annually. Data were transferred from the spreadsheet to SPSS 22.0 (IBM, Armonk, New York) for the statistical analysis.
Data Analysis
Descriptive statistics were reported for demographic data and outcome variables. The Wilcoxon signed-rank test was used to evaluate if median values of coefficient of variation for FVC and FEV1 differed between the 5-d separated measure compared to the 28-d measure. The tests were conducted at alpha = 0.05 using SPSS 22.0. The Friedman test evaluated whether there were any coefficient of variation changes between each year, and pairwise Wilcoxon signed-rank tests evaluated the location of differences using P = .02 (.05/3) for a family-wise alpha = 0.05.
Results
There was a total of 249 BioQC participants enrolled in the study, of which 170 met the inclusion criteria for spirometry BioQC for the first year. There were fewer participants in the subsequent years; 87 in year 2 and 31 in year 3, respectively. The sample population included predominantly white females with the mean age of 47 y (see Table 1).
In the first year, > 93% of values for FVC and FEV1 had a coefficient of variation < 5%. The percentage of coefficient of variation values < 5% rose in the second and third years. There were > 90% coefficient of variation for FVC and FEV1 values that were < 4% by year 3. The values for coefficients of variation for BioQC for FVC and FEV1 over 3 y are listed in Table 2. Figures 1 and 2 show FVC and FEV1 BioQC coefficient of variation values for each year’s first 10 data points. The determination of whether the 5-d separated measure differed from the 28-d measure showed no difference for either FVC coefficient of variation or FEV1 coefficient of variation, Z = −1.764, P = .78, and Z = −0.980, P = .33, respectively. Only 70 (41%) of the 170 sites completed their coefficient of variation data collection within the 28-d time frame.
Analyses were conducted to see if median values of coefficient of variation differed between years. The Friedman test, n = 28, was significant at P < .001. The comparisons between FVC coefficient of variation and FEV1 coefficient of variation between years 1 and 2 and years 1 and 3 showed statistically significant differences at a P = .02 (see Table 3). There was a < 1% change in median coefficient of variation differences for FVC and FEV1 values over the 2-y period.
Discussion
High-quality spirometry requires careful attention to quality control practices to ensure the equipment’s accuracy and precision for producing reliable test results.2 Thus, routine performance of BioQC testing in addition to calibration procedures is vital for each PFL’s quality control program to detect systematic errors that may occur with the technologist, the software, or the specific spirometer.3,17,18 Regarding technologists, a study that compared 13 laboratories in the Denver area attributed interlaboratory variations due to differences in practices and procedures.14 Similar findings occurred in the SAPALDIA study when different operators used the same equipment.3 The current study had a large number of diverse individuals (n = 170) conducting BioQC values across a range of ages and race/ethnicity and equal distribution of sex.
The current study used 5 different brands of equipment as well as equipment at 101 different sites. Equipment variation was demonstrated by Kunzli and colleagues3 when the same technologist measured subjects using the same versus different spirometers. The variation for FVC coefficient of variation rose from 2.2% to 2.7%, and FEV1 coefficient of variation rose from 2.2% to 3.3% when measurements were made with different spirometers. In another study, instrumentation in a longitudinal study accounted for 13–58% of total FEV1 variability observed in subjects.3,14 Thus, BioQC targets need to be based on studies that include diverse equipment and sites. Further, the values must come from acceptable tests conducted on spirometers with confirmed volume linearity.
The Association for Respiratory Technology & Physiology (ARTP) recently recommended that BioQC values be computed by obtaining 10 measures over 10 consecutive working days.18 The data from the current study demonstrated that only 41% of the BioQCs successfully completed acceptable tests to establish their baseline coefficient of variation within 28 d. This indicates that although a 10 consecutive working-day target may be appropriate it is not pragmatic for most PFLs. Computing the coefficient of variation on 10 data points within 28 d did not yield significantly different results than if the data were collected over more than 50 d.
Table 4 shows a historical summary of spirometry BioQC FVC and FEV1 coefficients of variation including the variability in number of participants, number of sessions, the measurement frequency, the trials/session that were conducted, and whether published standards were followed. All these variables affect the resulting coefficient of variation. For example, note that the smallest coefficient of variation results from tests conducted on the same day at hourly intervals. The number of participants in each study also affects the results by reducing the coefficient of variation. The SD decreases with increasing sample size, which directly lowers the coefficient of variation.15 This concept is highlighted clearly in a study that evaluated a single person who tested in 21 PFLs over a period of 13 y. The coefficients of variation were smallest when the number of tests was highest. Further, the interlab variability was higher than the intralab variability.13 This finding is expected due to the variances in equipment, available training, technologists conducting the test, characteristics of BioQC individuals (smoking history), regional differences, and standards utilized.
The BioQC participants in the current study received training at the study outset on-site specific to their equipment. The proper function of their equipment was confirmed prior to data collection. Experts assured all test results met the 2005 ATS/ERS criteria.16 The data accounted for 5 different spirometry systems across 114 labs and demonstrated that an acceptable target for the coefficient of variation was < 4%. Although interlaboratory standards may be higher due to the variables identified above, each lab should have values < 4% and strive to achieve an even lower coefficient of variation to improve data integrity. For example, the British Columbia Accreditation program requires a coefficient of variation < 3% for individual labs (https://www.cpsbc.ca/files/pdf/DAP-SP-Quality-Control-Plan.pdf, Accessed July 27, 2021).
One additional finding that emerged from the current study was the individual’s experience conducting a BioQC. Figures 1 and 2 demonstrate that the majority of BioQC values > 4% appeared in year 1. Greater reproducibility and precision are possible when PFL staff understands potential error sources and calibration standards. The smaller coefficient of variation range by year 3 demonstrates how a user’s experience lowers the range. The fact that 80% of FVC BioQC coefficient of variation values were < 3% by year 3 could have been due to combined learning about equipment and procedures. This reinforces a prior statement that improvements in BioQC testing can improve laboratory precision.13
Additionally, this study investigated the coefficient of variation changes in each of the 3 y to determine how often BioQC coefficient of variation should be evaluated. There were statistically significant changes in both FVC and FEV1 between years 1 and 2 and between years 1 and 3. However, this < 1% change in medians of coefficient of variation is not practically significant. This finding suggests that the frequency of calculating the coefficient of variation depends on the experience with testing. More variability was observed in BioQC testing in the first year of the study. Technologists conducting BioQC measures mastered their skills, and less variability was noted with each year. Therefore initially, laboratories may recalculate their BioQC values annually until greater subject consistency is achieved. Once the stability of results is achieved, it might be appropriate to recalculate BioQC limits every few years due to natural lung changes. The latest 2020 ARTP spirometry statement recommends recalculations of the ranges for BioQC spirometry values and coefficients of variation at least once every 2 years.18
There were several limitations to this study. The main limitation is that the number of participants declined each year, making the sample size smaller for the between-year comparisons. The decreased sample size in years 2 and 3 was a result of a decreasing number of active sites and BioQCs in the clinical trial. Sites were closed as they completed the study, resulting in natural attrition for a 3-y study. The ending date for each site in the study influenced the total number of participants in year 3. Fewer participants in year 3, however, did not affect the power to find a difference between years 1 and 2 or years 1 and 3 of the study. It may have influenced finding a difference in coefficient of variation between years 2 and 3 due to the smaller effect size present between those years. Also, the majority of participants were white females. It is not known if variability would change if more men or a more diverse population participated. In addition, the smoking status and health history were based on self-report, and the measurements did not account for diurnal variation.
Conclusions
A strong spirometry BioQC program needs to adhere to current ATS/ERS spirometry standards to achieve acceptable results. The spirometry BioQC coefficient of variation values in this study came from a large and diverse cohort of individuals. These data support that FVC and FEV1 coefficient of variation standards < 4% are achievable and should be the maximal acceptable value for a spirometry BioQC program. Initially, values should be recalculated each year until values stabilize. After coefficient of variation values stabilize, it might be appropriate to recalculate coefficient of variation limits every few years.
Footnotes
- Correspondence: Ellen A Becker PhD RRT RRT-NPS RPFT AE-C FAARC, Cardiopulmonary Sciences Department, 600 S. Paulina Street Suite 750, Chicago, IL 60612. E-mail: Ellen_Becker{at}rush.edu
The authors have disclosed no conflicts of interest.
- Copyright © 2022 by Daedalus Enterprises