Abstract
BACKGROUND: Respiratory muscle strength is an important part of lung function. Assessment of the respiratory muscles' ability to generate force is important for recognizing respiratory muscle weakness in both sick and healthy people.
OBJECTIVE: To assess the test/retest reliability of the MicroRPM portable manometer's measurements of maximum inspiratory pressure (MIP) and maximum expiratory pressure (MEP) in the sitting and standing positions; the number of expiratory maneuvers needed with the MicroRPM for reliability in MIP and MEP measurement; and the MicroRPM's test/retest reliability in other respiratory function indices, such as the maximum rate of pressure development (MRPD), the time constant of relaxation (tau), and the maximum relaxation rate (MRR).
METHODS: We recruited 15 healthy volunteers (mean age 21.6 ± 1.1 years). We assessed respiratory muscle strength on 3 separate occasions, each a week apart. We calculated reliability with the intraclass correlation coefficient (ICC), the standard error of measurement (SEM) and the smallest detectable difference (SDD).
RESULTS: MicroRPM reliably measured MIP and MEP in both the sitting position (ICC 0.86–0.90, SEM 9–10, SDD 18–22) and standing position (ICC 0.78–0.83, SEM 12–14, SDD 23–26). After a 5-breath practice, 2 expiratory/inspiratory maneuvers on each testing occasion gave adequate MIP and MEP reliability (ICC > 0.90). MRR reliability was moderate to excellent (ICC 0.58–0.87), MRPD reliability was moderate (ICC 0.59–0.64), and tau reliability was insufficient (ICC 0.27–0.67).
CONCLUSIONS: The MicroRPM reliably measures MIP and MEP, but its MRPD, MRR, and tau measurements should be considered with caution.
- respiratory muscle
- lung function
- maximum expiratory pressure
- maximum inspiratory pressure
- manometer
- reliability
Introduction
Respiratory muscle strength is an important part of lung function.1 Assessment of the respiratory muscles' ability to generate force is important for recognizing respiratory muscle weakness in both sick and healthy people.2 The importance of its assessment is represented by the fact that the findings from an appropriate evaluation of respiratory muscle strength may provide insight into unexplained dyspnea and insufficient cough.3 It is also useful as a complementary outcome measure for many disorders,4 such as neuromuscular disorders,5 COPD,6 and heart failure.7 These reasons have strengthened the need to extensively research the development of reliable manometers for measuring respiratory muscle strength.
The literature describes many respiratory muscle-strength-measurement tools,3,8 which can be divided into invasive and noninvasive.8 Although invasive techniques, such as esophageal and gastric balloons for recording esophageal, gastric, and transdiaphragmatic pressure, are considered more reliable, they require difficult, long, and unpleasant procedures.3 Therefore, noninvasive procedures, such as measurement of mouth or nasal pressure, which are easily performed, are usually preferred and are widely applied and accepted,8 despite the fact that their results are effort-dependent, resulting in potentially less accurate values than non-volitional methods such as electrical stimulation, magnetic stimulation, or phonomyography.3,8
The maximum inspiratory pressure (MIP) and maximum expiratory pressure (MEP) reflect the respiratory muscles' ability to generate force during a short quasi-static contraction.9 MIP and MEP measurement is conducted with a manometer that measures mouth-pressure, and can be affected by lack of patient motivation or coordination.3 In addition to their noninvasive characteristics and the fact that normative MIP and MEP values have been established in many studies,10–13 the portability of a MIP/MEP manometer enables assessment both in the clinic and at the bedside.14,15
The MicroRPM (Micro Medical/CareFusion, Kent, United Kingdom) is a modern manometer that was recently used in studies that recorded MIP and MEP.16,17 Although maximum-mouth-pressure measurements with some portable manometers have been found reliable and valid both in healthy volunteers and pulmonary and neuromuscular patients,2,9,14,18–21 there is no published evidence about the reliability of the MicroRPM. Most previous manometer reliability studies used inappropriate or insufficient statistical indices of reliability (eg, Pearson correlation coefficient), making their assumptions problematic.2,9,14,20
Moreover, the measurement reliability of other respiratory-function indices that can be measured with the MicroRPM is unknown. These indices are the maximum rate of pressure development (MRPD, which is the initial linear incline of the maximum respiratory pressure curve, which reflects muscle function), the time constant of relaxation (tau), and the maximum relaxation rate (MRR). A slowing in the rate of decline, confirmed by an increase of tau or a slowing of MRR, reflects respiratory muscle fatigue.20
The main objectives of this study were to investigate the MicroRPM's test/retest reliability for measuring MIP and MEP in healthy people, in both the sitting and standing positions, and to determine the needed number of repetitions with each subject, in one session, to obtain reliable MIP and MEP values. We recruited healthy people to find the measurement error independent of any pathologies. A secondary objective was to examine the MicroRPM's test/retest reliability in measuring MRR, MRPD, and tau. Test/retest reliability is defined as the degree to which an instrument's measurements are stable when the measurement administered to the same individuals over a specified time period and the measurement conditions remain the same.22
Methods
This study was approved by the ethics committee of, and performed in the Physiotherapy Department of the Technological Educational Institution of Lamia, Lamia, Athens, Greece, and performed according to the Declaration of Helsinki. All participants gave written informed consent.
Subjects
We recruited a convenience sample of 15 healthy volunteers (9 males, 6 females) who were physiotherapy students of the Technological Educational Institution of Lamia. Participants were required to be between 18 and 50 years old and healthy to be included. The exclusion criteria were obesity, history of respiratory or neuromuscular disease, pulmonary infection, menstrual period, smoking, and drug consumption during the 2 weeks before the measurements, as these are known to influence respiratory function. To screen for study eligibility we used a demographics questionnaire, and a digital body fat caliper (FatTrack II, Accufitness, Greenwood Village, Colorado) for determining obesity.
Protocol
Demographic characteristics were recorded at baseline, followed by recording of MIP and MEP, in 3 testing sessions, a week apart from each other. The measurements took place at the same time of day for each participant. Volunteers were asked to perform 18 maximum inspiratory efforts and 18 maximum expiratory efforts in both the sitting and standing position,23,24 with a 30-seconds interval between efforts, and 3-min interval between the 4 test conditions (sitting MIP, standing MIP, sitting MEP, and standing MEP). The tests were in random order. At all the testing sessions, instructions about the procedure were given in a standardized manner. All the measurements were performed by an appropriately trained physical therapist.
Measurements
MIP and MEP were measured with the MicroRPM, which is a small, portable, lightweight, noninvasive, mouth-pressure manometer with a rubber flanged mouthpiece and a small monitor that digitally displays the test results in cm H2O. The display uses a piezo resistive pressure-sensing technology. The MicroRPM's Software (Puma PC, Micro Medical, Kent, England) calculates the MIP, MEP, MRPD, MRR, and tau. The MIP and MEP values are calculated from the one-second average maximum pressure.
Before the measurements, each participant performed 5 maximum inspiratory and 5 maximum expiratory warm-up efforts to familiarize with the procedure. During the MIP and MEP measurement the participant was asked to hold the gauge with both hands and to close his or her lips firmly around the flanged mouthpiece. We applied a nose clip to avoid nasal air leak. For the MEP maneuver, the participant was asked to inhale as much as possible and then to exhale maximally for more than one second against the resistance of the gauge. For the MIP maneuver the participant was asked to exhale as much as possible (to residual volume) and then to inhale maximally for more than one second against the resistance of the gauge.4 All measurements were recorded for analysis.
Data Analysis
Data analysis was performed with statistics software (SPSS 15.0, SPSS, Chicago, Illinois). The normality of the data were evaluated with the Kolmogorov-Smirnov test. The test/retest reliability of all respiratory measurements was calculated with the intraclass correlation coefficient (ICC), the standard error of measurement (SEM, which is the square root of the within-subject mean squared error from the repeated-measures analysis of variance), and the smallest detectable difference (SDD, which is 1.96 × × SEM, expressed as a percentage of the parameter's grand mean).22 We calculated the inter-session test/retest reliability for all 3 sessions (all-3-inter-session ICC) and for the last 2 sessions (last-2-inter-session ICC). We calculated the intra-session ICC for both the mean MIP and MEP and the best MIP and MEP of the 18 repetitions. ICC values more than 0.6 were considered acceptable reliability, and ICC values more than 0.8 were considered high reliability.25 Sample size calculation revealed that the appropriate sample size for ICC values ≥ 0.9 was 11 participants (w = 0.2, α = .05, 3 measurements).26 We recruited 15 participants to increase the power.
We used the Bland-Altman method, which includes a scatter plot of the differences between test and retest against their mean, to determine the magnitude of disagreement between the second and third sessions' values.27 This method also includes the 95% limits of agreement of differences and the mean difference. To evaluate changes over time in an individual (eg, the effect of clinical rehabilitation), the magnitude of the change has to exceed the inherent variability of the outcome. Thus, if the limits of agreement are very broad and clinically important, it suggests that test/retest could result in a clinically meaningful intra-patient difference.
We used the Spearman-Brown formula to estimate the number of repetitions (k) necessary to achieve an ICC > 0.9.28 We calculated the reliability coefficient of the mean (Rk) by averaging k repetitions with a one-measure reliability coefficient (R):
We calculated the required number of repetitions (k) as:
where R* is the target coefficient of reliability.
Results
Table 1 describes the subjects, who were young, thin, and with no mouth pathologies that could preclude maximum effort or mouth pressure.
The Kolmogorov-Smirnov test showed that all the data were normally distributed, so we used parametric statistics. Table 2 shows the inter-session ICC, SEM, and SDD values. The all-3-inter-session ICC range was 0.82–0.90 for MEP, and 0.78–0.87 for MIP. MIP and MEP had higher ICC values in the sitting position (mean MEP all-3-inter-session ICC 0.88, best MEP all-3-inter-session ICC 0.90, mean MIP all-3-inter-session ICC 0.86, best MIP all-3-inter-session ICC 0.87) than in standing position (mean MEP all-3-inter-session ICC 0.82, best MEP all-3-inter-session ICC 0.83, mean MIP all-3-inter-session ICC 0.78, best MIP all-3-inter-session ICC 0.81), but these reliability indices were very good for both positions.
The mean scores yielded similar ICC values (mean MEP all-3-inter-session ICC range 0.82–0.88, mean MIP all-3-inter-session ICC range 0.78–0.86) to the best scores (best MEP all-3-inter-session ICC range 0.83–0.90, best MIP all-3-inter-session ICC range 0.81–0.87) in both positions.
All of these reliability indices were improved when the results from the first testing occasion were discarded. The SEM and SDD values were relatively small for all the variables. The Bland-Altman plots (Figs. 1 and 2) show that the limits of agreement were reasonably narrow, indicating small bias and systematic error. MRR had moderate to excellent reliability (all-3-inter-session ICC range 0.58–0.87), MRPD had moderate reliability (all-3-inter-session ICC range 0.59–0.64), and tau had insufficient reliability (all-3-inter-session ICC range 0.27–0.67). MRR, MRPD, and tau all had high SEM and SDD values, which makes their clinical application questionable.
Table 3 shows the intra-session ICC values, which were all very high (intra-session ICC range 0.87–0.91). Only 2 expiratory maneuvers and one standing inspiratory maneuver were needed for an ICC > 0.9. Figure 3 shows the incremental curves of MIP and MEP reliability, calculated with the Spearman-Brown formula.
Discussion
This study provides new evidence regarding the reliability of the MicroRPM, suggesting that it can be used with high confidence in research and especially in clinical practice for the assessment of MIP and MEP. The ICC values we obtained for most of the MIP and MEP measurements were higher than 0.8, indicating high reliability.25 Moreover, the low SEM and SDD values also confirm the usefulness of these measurements in clinical practice, as only a small amount of error needs to be overcome for detecting a real change.22 Discarding the first-session values increased the reliability further, which can be attributed to a partial elimination of learning effects. Interestingly, only 2 repetitions per session were needed to obtain reliable MIP and MEP measurements. Although the MIP and MEP measurements were equally reliable, the sitting position gave more reliable estimates than the standing position. However, the MRR, MRPD, and tau cannot be considered reliable.
Our MIP and MEP results agree with other reports.2,9,14,20,21 Larson and co-workers9 assessed the reliability of an aneroid manometer for measuring MIP in patients with COPD. Although that analysis was inappropriately performed with the Pearson correlation coefficient (r), MIP reliability was very high (r 0.97) when the MIP data were taken from the third and forth test sessions. Similar were the conclusions by Hamnegard and co-workers,14 who compared MIP and MEP measurements with a portable manometer versus a laboratory standard, in healthy people and patients with respiratory disease. Hamnegard et al concluded that the portable manometer was reliable and accurate for MIP and MEP, as the Bland-Altman plot revealed small bias and narrow limits of agreement. However, their Bland-Altman plots compared the agreement between 2 sessions rather than the agreement of the portable-manometer measurements to those from the laboratory tool, which put more focus on reliability than on accuracy.
McConnell and Copestake2 examined the reliability of a hand-held manometer for measuring MIP and MEP in healthy elderly subjects, and constructed Bland-Altman plots similar to ours in the present study. The manometer they tested was very reliable: the agreement between the 2 sessions was −0.3 ± 6.4 cm H2O for MIP, and 1.4 ± 5.1 cm H2O for MEP. Those limits of agreement are better than we found in the present study, which might be explained by the different study populations. The elderly tend to have less respiratory muscle strength than healthy young adults. Our Bland-Altman plots show a slightly greater dispersion of MIP and MEP around the mean difference of the 2 sessions when MIP and MEP are high. This suggests that lower respiratory strength (such as in the elderly) might lead to less bias and narrower limits of agreement. On the other hand, although Romer and McConnell20 examined the reliability of a hand-held manometer for measuring MIP and MEP in a population quite similar to ours, they came to a similar conclusion, as their Bland-Altman plots showed negligible bias and narrow limits of agreement. However, in that study the interval between the 2 sessions ranged from 2 days to 3 weeks, which affects the standardization of the procedure and might explain their narrower limits of agreement, because the potential measurement of most of the participants during the first week may give less potential for historical effects. Finally, Maillard and co-workers21 reported high reliability (ICC 0.88–0.92) of MIP in young healthy people, measured with a portable manometer. Those ICC values agree with our findings and indicate clinically acceptable reliability of these manometers.
Our MEP values are similar to those in other reports, considering sex, equipment (flanged mouthpiece), and protocol, whereas our MIP values are slightly higher.4,13 A possible reason is that we measured MIP starting at residual volume. Evans and Whitelaw,13 in their 2009 review, concluded also that MIP is approximately 16% larger at residual volume, in comparison with TLC. To establish MicroRPM's accuracy, however, requires a separate study of MicroRPM versus validated tools and techniques.
Another interesting finding of the present study is the similarity of MIP and MEP reliability in the standing and sitting positions, which were both highly reliable, especially when the first-session data were omitted. Thus, the standing or sitting position can be used with confidence, depending on the protocol under investigation.
Interestingly, unlike MIP and MEP, the MRPD, MRR, and tau were less reliable, which does not support their use in clinical research. However, these results disagree with the findings by Romer and McConnell.20 Their Bland-Altman plots suggested that, in contrast to MRPD, the MRR and tau had acceptable bias and limits of agreement. Although the discrepancy between these studies is difficult to explain, we think that reducing the time interval between the test sessions may give more consistent values. Therefore, the reliability of MRPD, MRR, and tau remain unclear and merit future study.
One of our main objectives was to determine the number of inspiratory and expiratory maneuver repetitions per session needed for reliable MIP and MEP, since there is no consensus in the literature.23,24 Our volunteers all performed 18 repetitions per session (the maximum suggested number of repetitions), which can be extremely time-consuming, especially since each effort has to be followed by an adequate recovery period to avoid fatigue. Moreover, factors such as fatigue (peripheral and central), learning effect, and inattention over time can all affect reliability estimates. This is obvious in Table 3, where the real ICC values were rarely higher than 0.90, despite 18 repetitions (after 5 warm-ups), in contrast to the estimated ICC values obtained with the Spearman-Brown formula (see Fig. 3), where those factors could not be considered. On the other hand, our statistical analysis indicates that only 2 repetitions, average, are needed to obtain excellent reliability (ICC > 0.90) for both MIP and MEP. This conclusion is confirmed by both the predicted ICC values obtained with the Spearman-Brown formula and the real ICC values calculated from the first 2 repetitions (see Table 3). These results also agree with previous research24,29 and this finding emphasizes that, even without prior submaximal respiratory activity (as has been suggested), a well organized protocol including a familiarization procedure (ie, with 5 maximum efforts) can provide reliable MIP and MEP measurements with very few repetitions.
Our findings are very important for clinicians and researchers. Clinicians are provided with a reliable, noninvasive, and convenient MIP and MEP measurement tool that is acceptable to patients and can be used at the bedside, sitting or standing. The mean and maximum MIP and MEP values from 18 repetitions were equally reliable, and choosing the number of repetitions can be dependent on the rationale of the researcher and the purpose of the study.
Our findings also suggest additional studies of the MicroRPM such as its inter-rater reliability and the validity of its measurements compared to well established invasive and noninvasive techniques. The MicroRPM should also be validated in various patient populations, such as patients with COPD, to examine its suitability to investigate and diagnose disease.
Conclusions
The MicroRPM reliably measured MIP and MEP in the sitting and standing positions. Following a 5-breath maximum-effort warm-up, 2 repetitions are needed to ensure MIP and MEP reliability. The MicroRPM's MRPD, MRR, and tau measurements should be interpreted with caution, as their reliability has not been confirmed. The establishment of MicroRPM's MIP and MEP reliability provides clinicians and researchers with a valid noninvasive measurement tool for respiratory muscle strength, which has high clinical utility.
Acknowledgments
We thank Labrini Komianou PT for technical assistance, Vagellis Vourazanis PT for help during data selection, Ourania Kolokotroni MD for reviewing the manuscript, and Papathomas Michail MD, Department of Epidemiology and Public Health, Imperial College London, for assistance with statistical analysis.
Footnotes
- Correspondence: Eleni Kapreli PhD, Physiotherapy Department, Technological Educational Institution of Lamia, 3rd Km Old National Road, Lamia, Athens, 35100 Greece. E-mail: ekapreli{at}teilam.gr.
-
The authors have disclosed no conflicts of interest.
- Copyright © 2011 by Daedalus Enterprises Inc.