Abstract
BACKGROUND: Predicting mortality has become a necessary step for selecting patients for clinical trials and defining outcomes. We examined whether stratification by tertiles of respiratory and ventilatory variables at the onset of acute respiratory distress syndrome (ARDS) identifies patients with different risks of death in the intensive care unit.
METHODS: We performed a secondary analysis of data from 220 patients included in 2 multicenter prospective independent trials of ARDS patients mechanically ventilated with a lung-protective strategy. Using demographic, pulmonary, and ventilation data collected at ARDS onset, we derived and validated a simple prediction model based on a population-based stratification of variable values into low, middle, and high tertiles. The derivation cohort included 170 patients (all from one trial) and the validation cohort included 50 patients (all from a second trial).
RESULTS: Tertile distribution for age, plateau airway pressure (Pplat), and PaO2/FIO2 at ARDS onset identified subgroups with different mortalities, particularly for the highest-risk tertiles: age (> 62 years), Pplat (> 29 cm H2O), and PaO2/FIO2 (< 112 mm Hg). Risk was defined by the number of coexisting high-risk tertiles: patients with no high-risk tertiles had a mortality of 12%, whereas patients with 3 high-risk tertiles had 90% mortality (P < .001).
CONCLUSIONS: A prediction model based on tertiles of patient age, Pplat, and PaO2/FIO2 at the time the patient meets ARDS criteria identifies patients with the lowest and highest risk of intensive care unit death.
- acute respiratory distress syndrome
- ARDS
- age
- mortality
- PaO2/FIO2
- plateau airway pressure
- prediction
- tertiles
Introduction
Various attempts have been made to predict which patients with acute respiratory distress syndrome (ARDS) are likely to have an unfavorable outcome.1–5 Associations between indicators of illness severity and mortality have been reported, but their predictive power remains controversial.2–11 Observational studies of ARDS patients have had variable findings, but in general, the prognosis seems to be related to age, underlying disease, severity of lung damage, and extrapulmonary organ dysfunction.3,4,6,11–14 A joint American-European Consensus Conference formalized the criteria for diagnosing acute lung injury (ALI) and ARDS,15 but no 2 single large2–5,11 or small7–10 epidemiological studies that used the American-European Consensus Conference ARDS definition found the same mortality predictors. Many of those studies used median or mean values of selected variables collected during the ARDS disease process and multivariable regression analysis to analyze the associations between those variables and ARDS outcomes, including death. Although reductionism to a mean or median value is useful, it can lead to loss of important information about the whole population, as was shown in a recent meta-analysis on the benefits of antidepressant medications. That meta-analysis included 5,133 patients in 35 randomized controlled trials in which the use of mean or median values gave a poor description of outcome.16
We analyzed data from a prospective multicenter study of patients who met the American-European Consensus Conference ARDS definition and were mechanically ventilated with a lung-protective ventilation strategy. Previously we assessed whether a systematic method, namely a fixed standard ventilation setting, identifies patients with persistent ARDS, and those results were published elsewhere.17 In the present study we explored whether selected respiratory, ventilation, and physiological variables at the time the patients met the ARDS definition were associated with intensive care unit (ICU) mortality, independent of the underlying disease or specific therapy. Our assessment of established ARDS after a PEEP/FIO2 trial was described elsewhere,17 but none of the outcome data reported in the present study have been published.
We hypothesized that tertile stratification of values of clinical variables, collected at the time the patients meet the American-European Consensus Conference ARDS criteria, would identify patients at greater risk of ICU death.
Methods
This study was approved by the Ethics Committee for Clinical Research at Hospital Universitario Nuestra Señora de Candelaria, Tenerife, Spain, the Hospital Universitario Río Hortega, Valladolid, Spain, and the local institutional review boards of all the participating hospitals.
Study Population
We performed a secondary analysis of data from 2 independent multicenter trials of ARDS patients ventilated with a lung-protective ventilation strategy. For the model derivation population, data were drawn from the Hospitales Españoles Para el Estudio de la Lesión Pulmonar (HELP) study database.17 We analyzed only patients who developed ARDS. The database comprised 170 ARDS patients admitted into the HELP network of ICUs. All consecutive patients age > 18 years and who met the American-European Consensus Conference ARDS criteria15 were screened for enrollment, regardless of their current status or medical history. We excluded only patients with brain death, terminal-stage cancer, or a do-not-resuscitate order. The validation cohort consisted of 50 patients with established ARDS in the low-tidal-volume (low-VT, 5–8 mL/kg predicted body weight) arm of the Acute Respiratory Insufficiency España Study (ARIES study), by our group.18 Our validation cohort also had the data obtained at the time the patients initially met the American-European Consensus Conference ARDS criteria.
Study Design
We examined whether it is possible to assess and quantify the risk of death in ARDS patients at the onset of ARDS (at the time the patients meet ARDS criteria), independent of their disease process or response to treatment, provided they were ventilated with a lung-protective strategy. Patients were screened daily, and data were recorded with standardized forms. We recorded demographic data (eg, age, sex); cause of ARDS (eg, sepsis, pneumonia, multiple trauma, aspiration); timing of ARDS (ICU days before meeting the ARDS definition); respiratory physiology (PaO2/FIO2, PaCO2, pH, respiratory-system compliance); and mechanical ventilation data (FIO2, VT, respiratory rate, PEEP, plateau airway pressure [Pplat]) at the time of ARDS onset, before considering changing the ventilator settings. We also recorded Acute Physiology and Chronic Health Evaluation (APACHE II) score during the first 24 hours in the ICU19 and Lung Injury Severity Score20 at the time the patient met the ARDS definition. We also recorded the occurrence of shock and number of organ failures included in the Sequential Organ Failure Assessment scale21 at baseline, ICU and hospital stay, and ICU mortality. All patients were mechanically ventilated. For ventilator management we recommended a VT of 5–9 mL/kg predicted body weight, a respiratory rate that maintained adequate PaCO2, and PEEP and FIO2 that maintained PaO2 > 60 mm Hg or SpO2 > 90%. None of the patients in this cohort received activated protein C or corticosteroids as an adjunctive treatment.
Statistical Analysis
For the power calculations for the derivation cohort we estimated an absolute 20% mortality-rate reduction, alpha = .05 and power goal = 0.80. Since there are no previous studies on tertile stratification in ARDS patients, we studied various group-size scenarios with cohorts of 132 to 168 patients. The population size of our derivation cohort satisfied all scenarios.
Data are reported as percentages or mean ± SD unless otherwise specified. We compared continuous variables with the Student t test. We used the Mann-Whitney U rank test for variables with non-normal distribution. We compared categorical variables with the chi-square and Fisher exact tests. We stratified the derivation cohort values into low, middle, and high risk tertiles, with approximately equal numbers of patients. Since no data on tertiles have been previously reported in the ARDS literature, we did not choose any pre-selected cutoff points for tertiles; instead, the tertile ranges were based on the distribution of each variable in the derivation cohort. Specifically, we listed in sequential order the actual values of each variable for all 170 patients and divided them into 3 equal groups. We performed a univariate analysis of each variable, categorized by its tertiles, as a predictor of outcome. We used the chi-square statistic (displayed as P for trends) to determine the overall significance of each independent association between the variable and the outcome. We also calculated the relative risk of death and 95% confidence intervals associated with each tertile with the Jeffrey interval for a binomial proportion.
Once we had determined the ICU mortality associated with each tertile range, we identified the variables that could be used as regressors, based on the P values. We tested for linear trends across tertiles using the tertile with the lowest mortality as the reference group and considering the tertile as an ordinal variable. We deemed the tertiles associated with the highest mortality as risk factors for ARDS death. We aimed to develop a simple score based on the number of high-risk tertiles to identify patients with the best and worst outcomes. Then we analyzed the cross-tabulated groups with the chi-square test.
To assess the internal validity of our model, we adjusted for multiple comparisons with the Monte-Carlo simulation test, with multiple comparisons extracted from the database.22,23 We assessed external validation by applying our model to an independent patient population, and tested with the maximum-likelihood chi-square test.24,25 We evaluated the model discrimination in the derivation and validation cohorts with the area under the receiver operating characteristic curve26 and compared the overall performance of our model to that of APACHE II score. All analyses were performed with statistics software (SPSS 15.0, SPSS, Chicago, Illinois). We considered a 2-sided P value < .05 statistically significant.
Results
On average, the 170 patients in the derivation cohort met the American-European Consensus Conference ARDS criteria at 2.4 ± 3.0 days after ICU admission. The overall ICU mortality was 34%. None of the primary ARDS risk factors was significantly associated with ICU outcome. In general, patients who died were older, had higher APACHE II score, and stayed more days in the ICU before ARDS onset than did survivors. The mean respiratory and ventilation variables showed no statistically significant differences between the survivors and non-survivors (Table 1).
We divided the data into tertiles (Table 2). Applied PEEP at study entry was the only variable that could not be distributed with a comparable number of cases in each tertile. The applied PEEP at ARDS onset was mostly (and unevenly) distributed at the PEEP levels 5 cm H2O (n = 24), 8 cm H2O (n = 37), 10 cm H2O (n = 44), and 12 cm H2O (n = 20). Despite the lack of comparable tertile sizes for PEEP, a re-analysis of their distribution showed that the best associations between increasing PEEP level at the time of ARDS onset and increasing ICU mortality were PEEP < 8 cm H2O (n = 46), PEEP 8–9 cm H2O (n = 41), and PEEP ≥ 10 cm H2O (n = 83), although this trend did not reach statistical significance (P = .11) (Table 3). The tertile distribution for the other variables at ARDS onset identified patients with a wide range of ICU mortality risk, although only the tertile distribution for Pplat and PaO2/FIO2 reached statistical significance (see Table 3).
Subjects in the tertiles age > 62 years, Pplat > 29 cm H2O, and PaO2/FIO2 < 112 mm Hg had significantly higher ICU mortality than the patients in the other 2 tertiles for each variable. The relative risk of death, for comparison of the patients in the highest-risk tertile and those in the lowest tertile at study entry was 2.5 (95% CI 1.4–4.5) for age, 2.4 (95% CI 1.8–3.3) for Pplat, and 1.59 (1.0–2.6) for PaO2/FIO2 (see Table 3). Each of those 3 variables correlated independently with ARDS outcome. When considering only age, Pplat, and PaO2/FIO2, we found large differences in the mortality risk across the 4 possibilities for increasing presence of highest-risk tertiles (ie, none, 1, 2, or 3 highest-risk tertiles) (Fig. 1). The 50 patients with no highest-risk tertiles had a 12% mortality rate (95% CI 6–21%). The 79 patients with only one highest-risk tertile had a 33% mortality rate (95% CI 25–42%, relative risk 2.7, 95% CI 1.8–4.5). The 31 patients with 2 highest-risk tertiles had a 52% mortality rate (95% CI 37–66%, relative risk 4.3, 95% CI 2.8–7.0). The 10 patients with all 3 highest-risk tertiles had a 90% mortality rate (95% CI 67–98%, relative risk 7.5, 95% CI 4.9–11.5) (P < .001). The log odds ratios describing the effect of being in the highest-risk tertile for age, Pplat, and PaO2/FIO2 versus being in the other 2 tertiles were 0.58, 0.86, and 0.58, respectively.
Using Monte-Carlo simulation, the chi-square statistic computed for each data set validated the statistical significance of the 3 risk factors (age plus Pplat plus PaO2/FIO2) with similar statistical significance (P < .001). External validation of our model to the low-VT group from the ARIES trial (n = 50)18 showed good performance for our model. Table 4 describes the validation cohort. Overall ICU mortality was 32% (Table 5). When the highest-risk values from the derivation cohort were examined in the validation population, we found that the 11 patients with no highest-risk tertiles had a mortality of 0%. The 26 patients with only one highest-risk tertile had a mortality of 27%. The 9 patients with 2 highest-risk tertiles had a 56% mortality. And the 4 patients with all 3 highest-risk tertiles had 100% mortality (P < .001).
When we compared the receiver operating characteristic curve for the model in the derivation and validation cohorts to the APACHE II scores, our tertile model outperformed APACHE II (Fig. 2). The area under the curve for the tertiles model in the derivation data set was 0.725, compared to 0.695 for APACHE II. For the validation cohort the area under the curve for the model was 0.810, whereas the area under the curve for APACHE II was 0.620 (P < .001).
Discussion
Our main finding is that our prediction model based on tertiles of age, Pplat, and PaO2/FIO2 at the time patients ventilated with a lung-protective ventilation strategy met American-European Consensus Conference ARDS criteria, identifies patients with the lowest and highest risk of ICU death. The present study is the first to explore ARDS mortality prediction based on tertiles of respiratory and ventilation variables. Risk stratification using tertiles is a common practice in the endocrine and cardiovascular literature.27–30 By evaluating physiological variables and biomarkers involved in the development and progression of several disease states, those studies revealed that tertile stratification can predict a profile associated with the greatest or the lowest risk for a selected outcome. Tertiles allow expression of clinical values in ordinal range categories, akin to how clinicians routinely categorize patients into risk groups. In our study, tertile categorization of certain variables detected useful information about the overall population that may not be as evident when evaluating the mean values of those variables. Tertile categorization could have several benefits. First, patients in the highest-risk tertile may warrant closer clinical attention to improve outcomes. On the other hand, tertiles might identify patients in whom benefit from treatment may be limited or disproportional to the resources used. Some interventions may have greater benefits in patients with moderate disease severity (middle tertile). Second, the identification of ARDS patients in the lowest-risk tertile could permit substantial cost savings in hospital days, through appropriate and efficient discharge planning.
The major 3 reasons substantiating the use of tertiles are:
Simplicity: Tertiles are easy to apply at the bedside.
Stability: Tertiles have been compared to other percentiles of the distribution studied, and the high-risk tertile is near the center of a stable plateau-shaped significance maximum.
Reasonable Group Size: The number of patients with the maximum score (clinical values at ARDS onset in all 3 high-risk tertiles) was about 6% in our study. This is above the expected 1/27 proportion expected for the upper tertiles of 3 independent variables, since all of our variables are predictors of severity: they are not independent.
In most epidemiological studies that have used the American-European Consensus Conference ALI and ARDS definitions,2–5,7–11 age, the underlying medical condition, degree of lung damage, extrapulmonary organ dysfunction, and ongoing sepsis are the most commonly reported predictors of mortality. In general, outcome is worse with increasing age.31 Patients with more severe lung disease tend to have lower PaO2/FIO2. One of the most interesting of our findings is the importance of the initial PaO2/FIO2 (at the time of ARDS diagnosis) as a survival predictor in ARDS patients, independent of the underlying disease. In our 2 independent ARDS populations, the mean PaO2/FIO2 at study entry was similar in ICU survivors and non-survivors. However, by stratifying PaO2/FIO2 in tertiles, the tertile of PaO2/FIO2 < 112 mm Hg identified patients with an absolute mortality that was almost double that of the other 2 tertiles combined (47% vs 25%, P < .001). We think that this subgroup of patients should be the target of extraordinary measures in clinical trials aimed to decrease ARDS mortality. This speculation is supported by the results of 2 recent large systematic reviews/meta-analyses of clinical trials in patients with ALI/ARDS that evaluated the survival impact of prone positioning32 and high PEEP.33 Prone positioning and high PEEP were associated with significantly improved survival only in the subgroup of patients with the lowest PaO2/FIO2 range.
One third of our derivation population had an initial Pplat > 29 cm H2O. Note that the initial mean VT (data not shown in the tables) in each of the Pplat tertiles did not differ (7.5 ± 1.8 mL/kg, 7.7 ± 1.4 mL/kg, and 7.8 ± 1.7 mL/kg for the low, middle, and high tertiles, respectively, P = .62). It is well established that there is a direct relationship between Pplat and mortality.34–37 In many epidemiological studies that used the American-European Consensus Conference ALI and ARDS definitions2–5,7–11 the impact on outcome of VT or Pplat was not evaluated. In most of those studies the patients were treated with VT above the current recommendation. In a recent secondary analysis of patients with ALI, screened during 1999–2000, Cooke et al11 found that the mortality predictors were similar to those in the general population of critically ill patients. Variables that predicted mortality were age, non-pulmonary organ dysfunction, history of leukemia and hepatic or congestive heart failure, arterial pH, ICU stay prior to ALI onset, minute ventilation, and PaCO2. Our study did not fully reproduce those findings; only mean age and ICU stay prior to ARDS onset were significantly different in survivors versus non-survivors.
It is important to emphasize that our data, in both the derivation cohort (patients from the HELP study)17 and the validation cohort (patients from the ARIES treatment arm),18 were from the time the patients met the American-European Consensus Conference ARDS definition,15 and the patients were subsequently ventilated with a lung-protective ventilation strategy, so we cannot expect this model to hold for patients ventilated in a non-lung-protective manner. It is quite clear that large VT and high Pplat cause ventilator-induced lung injury on top of the preexisting ARDS, and we do not expect our model to predict outcomes in that setting.
It is also important that Pplat be measured during a passive inspiration followed by an end-inspiratory pause of sufficient duration to allow the pressure to plateau. We do expect our model to apply to all patients with ARDS, regardless of etiology or comorbidities. The prediction model was developed from 170 patients in the HELP study,17 which had no exclusion criteria: all patients, regardless of ARDS etiology or prognosis were included.
Limitations
Because of several study limitations our findings must be considered preliminary and exploratory. First, although our model showed external validity with an independent group of ARDS patients, our validation cohort sample was relatively small. Second, additional variables that influence ICU outcome may be identified in further derivation cohorts. Third, continuous multivariate methods may be more practical in cases where variables have intricate dependencies and associations with outcome that might not be correctly represented by tertiles; however, we have no reason to believe that this is applicable to the variables we studied.
Our study has several strengths. First, our study design included all consecutive patients who met the ARDS criteria, except patients with brain death, terminal-stage cancer, or a do-not-resuscitate order, so we believe that our patients closely represented routine ARDS patients. Second, patients were in a multidisciplinary network of teaching hospitals, not just one institution. Third, we validated our model with an independent cohort of patients with established ARDS. Fourth, our prediction model combines variables of potentially modifiable severity (PaO2/FIO2 and Pplat) and a non-modifiable risk factor (age) that are readily measured at the bedside. Finally, our tertiles model outperformed APACHE II in predicting mortality. APACHE II is a composite score developed to assess the risk of hospital death across a spectrum of illnesses rather than for a particular disease process such as ARDS. However, our approach must be further validated with other ARDS populations and by other groups. Clearly, our a posteriori selection of patients from our own previous studies may have biased our results in some unknown way. Regardless of whether the values in the high-risk tertiles are associated with higher mortality, the differences in care (ventilation and oxygenation) require further investigation. Since our study was observational and we did not exclude any patients during the study period, we are aware that patient characteristics influence physicians' treatment decisions and could be important in guiding future studies. The measure of potential confounders that might influence both survival and care decisions is beyond the scope of the present study. We do not know whether the treatment-dependent variables (PaO2/FIO2 and Pplat) will influence the performance of our model under different practice patterns. We acknowledge that our results apply only to a prescribed set of circumstances. If circumstances change (eg, a novel method of oxygenation/ventilation or a new therapy for severe sepsis), the distribution may differ.
Conclusions
The use of tertile stratification for age, Pplat, and PaO2/FIO2 at the time patients meet ARDS criteria can identify patients who are at greatest risk of death in the ICU. This model should be further assessed in future studies.
Footnotes
- Correspondence: Jesús Villar MD PhD, Multidisciplinary Organ Dysfunction Evaluation Research Network, Hospital Universitario Dr Negrin Barranco de la Ballena, s/n 4th Floor, South Wing, 35010 Las Palmas de Gran Canaria, Canary Islands, Spain. E-mail: jesus.villar54{at}gmail.com.
↵* Members of the Hospitales Españoles Para el Estudio de la Lesión Pulmonar (HELP) Network are listed in the supplementary materials at http://www.rcjournal.com.
-
This research was partly supported in part by grant 07/0113 from Fondo de Investigación Sanitaria, Madrid, Spain, and by the Asociación Científica Pulmón y Ventilación Mecánica, Las Palmas, Spain.
-
To avoid potential conflict of interest, Editor in Chief Dean R Hess was blinded to the peer review process, deferring to Deputy Editor Richard D Branson MSc.
-
See the Related Editorial on Page 533
- Copyright © 2011 by Daedalus Enterprises Inc.