Effects of Intensity of Rehabilitation After Stroke
A Research Synthesis
- Gert Kwakkel, MSc;
- Robert C. Wagenaar, PhD;
- Tim W. Koelman, PT;
- Gustaaf J. Lankhorst, MD, PhD;
- Johan C. Koetsier, MD, PhD
- Gert Kwakkel, MSc;
- Robert C. Wagenaar, PhD;
- Tim W. Koelman, PT;
- Gustaaf J. Lankhorst, MD, PhD;
- Johan C. Koetsier, MD, PhD
Abstract
-
Background and Purpose A research synthesis was performed to (1) critically review controlled studies evaluating effects of different intensities of stroke rehabilitation in terms of disabilities and impairments and (2) quantify patterns by calculating summary effect sizes. The influences of organizational setting of rehabilitation management, blind recording, and amount of rehabilitation on the summary effect sizes were calculated.
Methods A Medline literature search was performed for a critical review of the literature. The internal and external validity of the studies was evaluated. In addition, a meta-analysis was performed by applying the fixed (Hedges’s g) effects model.
Results The effects of different intensities of rehabilitation were studied in nine controlled studies involving 1051 patients. Analysis of the methodological quality revealed scores varying from 14% to 47% of the maximum feasible score. Meta-analysis demonstrated a statistically significant summary effect size for activities of daily living (0.28±0.12). Lower summary effect sizes (0.19±0.17) were found for studies in which experimental and control groups were treated in the same setting compared with studies in which the two groups of patients were treated in different settings (0.40±0.19). Variables defined on a neuromuscular level (0.37±0.24) showed larger summary effect sizes than variables defined on a functional level (0.10±0.21). Weighting individual effect sizes for the difference in amount of rehabilitation between experimental and control groups resulted in larger summary effect sizes for activities of daily living and functional outcome parameters for studies that were not confounded by organizational setting.
Conclusions A small but statistically significant intensity-effect relationship in the rehabilitation of stroke patients was found. Insufficient contrast in the amount of rehabilitation between experimental and control conditions, organizational setting of rehabilitation management, lack of blinding procedures, and heterogeneity of patient characteristics were major confounding factors.
Key Words:
In a 1989 review of efficacy of rehabilitation after stroke, Dobkin1 asserted that “convincing evidence concerning the therapeutic usefulness of stroke rehabilitation does not yet exist.” Other reviewers claim that any of the available approaches in physical and occupational therapy will improve the patient’s functional status,2 3although spontaneous neurological recovery appears to account for most of the improvement in functional ability.4 5 Wagenaar and Meyer6 7 concluded that the effects of rehabilitation methods are highly specific, indicating that improvements on the impairment level do not generalize to disabilities (ADL). In addition, they found that expert care has a positive effect on functional recovery compared with traditional care. However, in their opinion it remains unclear which part of expert care (eg, team care, active family participation, special staff education, early start of treatment, and/or intensity of treatment) has caused this effect. Recently, Gladman and colleagues8 stated that “till so far the knowledge about what it is in the ‘black box’ of a stroke unit that is effective is lacking.” The results of a few controlled studies suggest that an early start of intensive treatment is an important aspect of expert care.4 6 7
Langhorne et al9 10 11 showed in a meta-analysis combining the findings of 10 randomized controlled trials that stroke rehabilitation wards statistically significantly reduce mortality (approximately 28%) in comparison to general medical wards. In a similar approach, Ottenbacher and Jarnell12 demonstrated that programs of focused stroke rehabilitation may improve functional performance for some patients who have sustained a stroke. They have combined the findings of 36 studies on the effects of stroke rehabilitation wards as well as different methods of rehabilitation in one summary effect size.
In a more recent meta-analysis, Langhorne et al13 combined the results of seven randomized trials on the effects of different intensities of physical therapy and reported a small but significant reduction in mortality as well as significant improvements in ADL and impairments as a result of higher intensities of treatment. Sensitivity analysis revealed that the organizational setting in which the physiotherapy was delivered was an important confounding factor; studies in which the experimental and control conditions were applied in different settings (confounded studies) resulted in a smaller overall treatment effect than studies carried out in one setting (unconfounded studies).
The purpose of this article was to add a critical review of studies evaluating the efficacy of different intensities of stroke rehabilitation to trace variables that may influence rehabilitation outcome. The impact of these factors was then analyzed by including them as an independent variable in the calculation of the summary effect size. Since the present study includes two more studies than the analysis of Langhorne et al,13 first the effect size of different intensities of rehabilitation in terms of disabilities and impairments will be evaluated.
- Background and Purpose A research synthesis was performed to (1) critically review controlled studies evaluating effects of different intensities of stroke rehabilitation in terms of disabilities and impairments and (2) quantify patterns by calculating summary effect sizes. The influences of organizational setting of rehabilitation management, blind recording, and amount of rehabilitation on the summary effect sizes were calculated.Methods A Medline literature search was performed for a critical review of the literature. The internal and external validity of the studies was evaluated. In addition, a meta-analysis was performed by applying the fixed (Hedges’s g) effects model.Results The effects of different intensities of rehabilitation were studied in nine controlled studies involving 1051 patients. Analysis of the methodological quality revealed scores varying from 14% to 47% of the maximum feasible score. Meta-analysis demonstrated a statistically significant summary effect size for activities of daily living (0.28±0.12). Lower summary effect sizes (0.19±0.17) were found for studies in which experimental and control groups were treated in the same setting compared with studies in which the two groups of patients were treated in different settings (0.40±0.19). Variables defined on a neuromuscular level (0.37±0.24) showed larger summary effect sizes than variables defined on a functional level (0.10±0.21). Weighting individual effect sizes for the difference in amount of rehabilitation between experimental and control groups resulted in larger summary effect sizes for activities of daily living and functional outcome parameters for studies that were not confounded by organizational setting.Conclusions A small but statistically significant intensity-effect relationship in the rehabilitation of stroke patients was found. Insufficient contrast in the amount of rehabilitation between experimental and control conditions, organizational setting of rehabilitation management, lack of blinding procedures, and heterogeneity of patient characteristics were major confounding factors.In a 1989 review of efficacy of rehabilitation after stroke, Dobkin1 asserted that “convincing evidence concerning the therapeutic usefulness of stroke rehabilitation does not yet exist.” Other reviewers claim that any of the available approaches in physical and occupational therapy will improve the patient’s functional status,2 3although spontaneous neurological recovery appears to account for most of the improvement in functional ability.4 5 Wagenaar and Meyer6 7 concluded that the effects of rehabilitation methods are highly specific, indicating that improvements on the impairment level do not generalize to disabilities (ADL). In addition, they found that expert care has a positive effect on functional recovery compared with traditional care. However, in their opinion it remains unclear which part of expert care (eg, team care, active family participation, special staff education, early start of treatment, and/or intensity of treatment) has caused this effect. Recently, Gladman and colleagues8 stated that “till so far the knowledge about what it is in the ‘black box’ of a stroke unit that is effective is lacking.” The results of a few controlled studies suggest that an early start of intensive treatment is an important aspect of expert care.4 6 7Langhorne et al9 10 11 showed in a meta-analysis combining the findings of 10 randomized controlled trials that stroke rehabilitation wards statistically significantly reduce mortality (approximately 28%) in comparison to general medical wards. In a similar approach, Ottenbacher and Jarnell12 demonstrated that programs of focused stroke rehabilitation may improve functional performance for some patients who have sustained a stroke. They have combined the findings of 36 studies on the effects of stroke rehabilitation wards as well as different methods of rehabilitation in one summary effect size.In a more recent meta-analysis, Langhorne et al13 combined the results of seven randomized trials on the effects of different intensities of physical therapy and reported a small but significant reduction in mortality as well as significant improvements in ADL and impairments as a result of higher intensities of treatment. Sensitivity analysis revealed that the organizational setting in which the physiotherapy was delivered was an important confounding factor; studies in which the experimental and control conditions were applied in different settings (confounded studies) resulted in a smaller overall treatment effect than studies carried out in one setting (unconfounded studies).The purpose of this article was to add a critical review of studies evaluating the efficacy of different intensities of stroke rehabilitation to trace variables that may influence rehabilitation outcome. The impact of these factors was then analyzed by including them as an independent variable in the calculation of the summary effect size. Since the present study includes two more studies than the analysis of Langhorne et al,13 first the effect size of different intensities of rehabilitation in terms of disabilities and impairments will be evaluated.
Materials and Methods
Study Identification
-
A Medline literature search was performed for the period 1966 to 1995 (CD-ROM, Medical Subject Headings, volume 36, 1996). We used the key words (MeSH) stroke, cerebrovascular disorders, dose-response relationship, effectiveness, cost-effectiveness, rehabilitation, therapy, physical therapy, physiotherapy, occupational therapy, and exercise therapy. References presented in relevant publications were examined, and abstracts published in proceedings of conferences were searched. Investigators of studies were contacted if more information about the trial was needed. Studies had to meet the following criteria: (1) patients suffering from a stroke are studied; (2) the effects of different intensities of PT and/or OT are evaluated; (3) the study concerns true or quasi-experimentation14 ; (4) the rehabilitation outcome is measured in terms of ADL; and (5) the study is published in a journal or book.
- A Medline literature search was performed for the period 1966 to 1995 (CD-ROM, Medical Subject Headings, volume 36, 1996). We used the key words (MeSH) stroke, cerebrovascular disorders, dose-response relationship, effectiveness, cost-effectiveness, rehabilitation, therapy, physical therapy, physiotherapy, occupational therapy, and exercise therapy. References presented in relevant publications were examined, and abstracts published in proceedings of conferences were searched. Investigators of studies were contacted if more information about the trial was needed. Studies had to meet the following criteria: (1) patients suffering from a stroke are studied; (2) the effects of different intensities of PT and/or OT are evaluated; (3) the study concerns true or quasi-experimentation14 ; (4) the rehabilitation outcome is measured in terms of ADL; and (5) the study is published in a journal or book.
Critical Review
-
A methodological quality score was developed with items recommended by the Potsdam standards15 and other investigators in this field16 17 18 19 20 to identify independent variables (eg, elements of study design and contrast in amount of therapy) that might modify the overall effect size.
The following items were evaluated: (1) randomization or matching procedures; (2) blinding procedures; (3) description of dropouts and intention-to-treat analysis; (4) reliability and validity of assessment instruments; (5) control for cointervention(s); (6) comparability of baseline patient characteristics; and (7) control for amount of therapy (see “Appendix”). To each item a binary weight (0/1) was attached. In total 16 items were scored. Two reviewers (G.K., E. van Wegen) assessed the methodological quality of each study independently. Names of author(s), institution(s), and journal were masked to establish independent extraction of relevant data. Interrater reliability of individual items was assessed with Cohen’s κ with adjustment for tied ranks. In a meeting the reviewers tried to accomplish agreement on differences in scoring. When disagreement persisted, a third reviewer (R.C.W.) made the final decision.
- A methodological quality score was developed with items recommended by the Potsdam standards15 and other investigators in this field16 17 18 19 20 to identify independent variables (eg, elements of study design and contrast in amount of therapy) that might modify the overall effect size.The following items were evaluated: (1) randomization or matching procedures; (2) blinding procedures; (3) description of dropouts and intention-to-treat analysis; (4) reliability and validity of assessment instruments; (5) control for cointervention(s); (6) comparability of baseline patient characteristics; and (7) control for amount of therapy (see “Appendix”). To each item a binary weight (0/1) was attached. In total 16 items were scored. Two reviewers (G.K., E. van Wegen) assessed the methodological quality of each study independently. Names of author(s), institution(s), and journal were masked to establish independent extraction of relevant data. Interrater reliability of individual items was assessed with Cohen’s κ with adjustment for tied ranks. In a meeting the reviewers tried to accomplish agreement on differences in scoring. When disagreement persisted, a third reviewer (R.C.W.) made the final decision.
Summary Effect Sizes
-
The effect size gi (Hedges’s g) of each individual study was calculated by the difference between means of experimental and control group divided by the average population standard deviation (SDi).21 To estimate SDi for gi’s, baseline estimate standard deviations of control and experimental groups were pooled.21 22Alternatively, Hedges’s gi estimates were obtained from probability and t values.23Since the gi’s tend to overestimate the population effect size in studies with a small number of patients, a correction was made to obtain an unbiased estimator gu.2123 The impact of sample size was addressed by estimating a weighting factor wi for each study, assigning larger weights to effect sizes from studies with larger study samples and thus smaller variances. Subsequently, gu’s of individual studies were averaged, resulting in a weighted summary effect size (¯Tu).21 22 23 24 Finally, the wi’s were combined to estimate the variance of the summary effect size T̄u.24
On the disability level, effect size T̄u was computed for global outcome of ADL (eg, Barthel Index score) as well as separate domains of functional outcome (eg, dexterity, walking performance, and walking velocity). On the impairment level, neuromuscular outcome variables were used for assessing recovery of hemiplegia (eg, muscle strength and synergism).
In one study the effects of three treatment conditions were compared.25 Only the comparisons of the two experimental conditions with the control condition were included. Including two of three possible comparisons in one overall summary effect size would have introduced “dependency of findings,” which would violate further statistics. Therefore, summary effect sizes were also calculated including each comparison from the same study separately.
The homogeneity (or heterogeneity) test statistic (Q statistic) of each set of effect sizes was examined to determine whether studies shared a common effect size of which the variance could be explained by sampling error alone.21 26 The fixed effects model was used to decide whether a summary effect size was statistically significant.27 28 If a significant heterogeneity in summary effect sizes was found, a random effects model was applied.
Post hoc analyses were performed for the organizational setting of rehabilitation management,13 the amount of rehabilitation given in the experimental and control group, and the effects of blinding. The post hoc analysis for organizational setting was performed by comparing the summary effect sizes of studies in which all patients were treated in the same setting of rehabilitation with those studies in which experimental and control groups were treated in different settings. In a similar vein, a post hoc analysis for blinding was performed.
A sensitivity analysis was performed on the amount of rehabilitation by weighting the additional time spent (ti) on PT and OT in the experimental group compared with the control group by dividing the difference by the total amount of rehabilitation in the experimental group. In this way, studies with a proportionally greater difference in amount of rehabilitation between groups were given more weight than studies in which this contrast was small. For all outcome variables, the critical value for rejecting H0 was set at .05.
- The effect size gi (Hedges’s g) of each individual study was calculated by the difference between means of experimental and control group divided by the average population standard deviation (SDi).21 To estimate SDi for gi’s, baseline estimate standard deviations of control and experimental groups were pooled.21 22Alternatively, Hedges’s gi estimates were obtained from probability and t values.23Since the gi’s tend to overestimate the population effect size in studies with a small number of patients, a correction was made to obtain an unbiased estimator gu.2123 The impact of sample size was addressed by estimating a weighting factor wi for each study, assigning larger weights to effect sizes from studies with larger study samples and thus smaller variances. Subsequently, gu’s of individual studies were averaged, resulting in a weighted summary effect size (¯Tu).21 22 23 24 Finally, the wi’s were combined to estimate the variance of the summary effect size T̄u.24On the disability level, effect size T̄u was computed for global outcome of ADL (eg, Barthel Index score) as well as separate domains of functional outcome (eg, dexterity, walking performance, and walking velocity). On the impairment level, neuromuscular outcome variables were used for assessing recovery of hemiplegia (eg, muscle strength and synergism).In one study the effects of three treatment conditions were compared.25 Only the comparisons of the two experimental conditions with the control condition were included. Including two of three possible comparisons in one overall summary effect size would have introduced “dependency of findings,” which would violate further statistics. Therefore, summary effect sizes were also calculated including each comparison from the same study separately.The homogeneity (or heterogeneity) test statistic (Q statistic) of each set of effect sizes was examined to determine whether studies shared a common effect size of which the variance could be explained by sampling error alone.21 26 The fixed effects model was used to decide whether a summary effect size was statistically significant.27 28 If a significant heterogeneity in summary effect sizes was found, a random effects model was applied.Post hoc analyses were performed for the organizational setting of rehabilitation management,13 the amount of rehabilitation given in the experimental and control group, and the effects of blinding. The post hoc analysis for organizational setting was performed by comparing the summary effect sizes of studies in which all patients were treated in the same setting of rehabilitation with those studies in which experimental and control groups were treated in different settings. In a similar vein, a post hoc analysis for blinding was performed.A sensitivity analysis was performed on the amount of rehabilitation by weighting the additional time spent (ti) on PT and OT in the experimental group compared with the control group by dividing the difference by the total amount of rehabilitation in the experimental group. In this way, studies with a proportionally greater difference in amount of rehabilitation between groups were given more weight than studies in which this contrast was small. For all outcome variables, the critical value for rejecting H0 was set at .05.
Results
Critical Review
-
The results of the methodological quality score of the 9 trials involving 1051 patients are presented in Table 2⇓. Initially there was disagreement between two independent reviewers on 10 (7%) of the 144 criteria scored. Cohen’s κ was 0.86. The methodological quality score varied from 13%32 33 34 to 47%35 37 38 of the maximum feasible score.
View this table:
Eight studies were classified as randomized trials,25 32 33 34 35 36 37 38 of which only 2 studies reported the method of randomization.33 34 The observers were blinded in 4 studies,35 36 37 38 and dropouts were described for experimental and control groups separately in 7 studies.25 33 34 35 36 37 38 However, none of these controlled trials reported intercurrent dropouts or applied an intention-to-treat analysis. Three studies were confounded by the organizational setting. In 3 studies the patients in the experimental and control groups were comparable for age, initial ADL index, and type of stroke.35 36 37
On average, the intensive rehabilitation group received daily almost twice as much PT (ie, 48.4 minutes) and OT (ie, 44 minutes) as the control group (ie, 23.4 and 18.5 minutes, respectively) (Table 1⇑). However, this difference in amount of rehabilitation was larger in the unconfounded studies (n=6)25 32 35 36 37 38 than in confounded studies (n=3).31 33 34 The contrast ti between rehabilitation intensities of unconfounded and confounded studies was 0.68 and 0.34, respectively.
- The results of the methodological quality score of the 9 trials involving 1051 patients are presented in Table 2⇓. Initially there was disagreement between two independent reviewers on 10 (7%) of the 144 criteria scored. Cohen’s κ was 0.86. The methodological quality score varied from 13%32 33 34 to 47%35 37 38 of the maximum feasible score.View this table:Eight studies were classified as randomized trials,25 32 33 34 35 36 37 38 of which only 2 studies reported the method of randomization.33 34 The observers were blinded in 4 studies,35 36 37 38 and dropouts were described for experimental and control groups separately in 7 studies.25 33 34 35 36 37 38 However, none of these controlled trials reported intercurrent dropouts or applied an intention-to-treat analysis. Three studies were confounded by the organizational setting. In 3 studies the patients in the experimental and control groups were comparable for age, initial ADL index, and type of stroke.35 36 37On average, the intensive rehabilitation group received daily almost twice as much PT (ie, 48.4 minutes) and OT (ie, 44 minutes) as the control group (ie, 23.4 and 18.5 minutes, respectively) (Table 1⇑). However, this difference in amount of rehabilitation was larger in the unconfounded studies (n=6)25 32 35 36 37 38 than in confounded studies (n=3).31 33 34 The contrast ti between rehabilitation intensities of unconfounded and confounded studies was 0.68 and 0.34, respectively.
Meta-analysis
ADL
-
The unbiased overall summary effect size T̄u for the 9 studies was 0.28 (CI, ±0.12) SDU (Fig 1⇓ and Table 3⇓). The test statistic for heterogeneity was not statistically significant. Controlling for dependency of estimated effect sizes as a result of including two multiple comparisons from the study of Smith et al25 showed almost similar summary effect sizes (0.28 to 0.26 SDU) as well as standard errors (±0.13). Weighting each study by factor ti resulted in a slight increase of overall summary effect size T̄u from 0.28 (CI, ±0.12) to 0.34 (CI, ±0.15) SDU (Table 3⇓).
View this table:
The summary effect sizes T̄u for unconfounded and confounded studies were 0.19 (CI, ±0.17) and 0.40 (CI, ±0.19) SDU, respectively (Table 3⇑). No significant heterogeneity was found for confounded and unconfounded studies separately or between these two groups of studies. Controlling for dependency of estimated effect sizes as a result of including two multiple comparisons from the study of Smith et al25 decreased the summary effect sizes of unconfounded studies to 0.12 (CI, ±0.18) and 0.19 SDU (CI, ±0.17), respectively.
A marked difference was observed in overall effect size for ADL in studies with and without blind assessment recording, that is, 0.05 (CI, ±0.23) versus 0.38 (CI, ±0.23). Three of 5 studies in which assessment was not blinded were confounded by management of experimental and control groups at separate locations.
Weighting each study by a factor ti resulted for the unconfounded studies in a statistically significant summary effect size T̄u for ADL; that is, T̄u increased from 0.19 (CI, ±0.17) to 0.25 (CI, ±0.19) SDU (Table 3⇑). The unbiased summary effect size of confounded studies changed from 0.40 (CI, ±0.19) to 0.45 (CI, ±0.22) SDU. The homogeneity test statistic for ADL scores decreased slightly in both confounded and unconfounded studies by weighting the summary effect sizes by a factor ti, but remained not statistically significant. In addition, heterogeneity was not statistically significant between these two groups of studies. Weighting the four blinded studies by factor ti hardly changed the summary effect size (ie, 0.1 [CI, ±0.26]).
- The unbiased overall summary effect size T̄u for the 9 studies was 0.28 (CI, ±0.12) SDU (Fig 1⇓ and Table 3⇓). The test statistic for heterogeneity was not statistically significant. Controlling for dependency of estimated effect sizes as a result of including two multiple comparisons from the study of Smith et al25 showed almost similar summary effect sizes (0.28 to 0.26 SDU) as well as standard errors (±0.13). Weighting each study by factor ti resulted in a slight increase of overall summary effect size T̄u from 0.28 (CI, ±0.12) to 0.34 (CI, ±0.15) SDU (Table 3⇓).View this table:The summary effect sizes T̄u for unconfounded and confounded studies were 0.19 (CI, ±0.17) and 0.40 (CI, ±0.19) SDU, respectively (Table 3⇑). No significant heterogeneity was found for confounded and unconfounded studies separately or between these two groups of studies. Controlling for dependency of estimated effect sizes as a result of including two multiple comparisons from the study of Smith et al25 decreased the summary effect sizes of unconfounded studies to 0.12 (CI, ±0.18) and 0.19 SDU (CI, ±0.17), respectively.A marked difference was observed in overall effect size for ADL in studies with and without blind assessment recording, that is, 0.05 (CI, ±0.23) versus 0.38 (CI, ±0.23). Three of 5 studies in which assessment was not blinded were confounded by management of experimental and control groups at separate locations.Weighting each study by a factor ti resulted for the unconfounded studies in a statistically significant summary effect size T̄u for ADL; that is, T̄u increased from 0.19 (CI, ±0.17) to 0.25 (CI, ±0.19) SDU (Table 3⇑). The unbiased summary effect size of confounded studies changed from 0.40 (CI, ±0.19) to 0.45 (CI, ±0.22) SDU. The homogeneity test statistic for ADL scores decreased slightly in both confounded and unconfounded studies by weighting the summary effect sizes by a factor ti, but remained not statistically significant. In addition, heterogeneity was not statistically significant between these two groups of studies. Weighting the four blinded studies by factor ti hardly changed the summary effect size (ie, 0.1 [CI, ±0.26]).
Functional and Neuromuscular Outcome Parameters
-
Only one study investigating the effects of intensity of rehabilitation on neuromuscular outcome was confounded because patients were treated at separate locations34 (Fig 2⇓ and Table 4⇓). The studies in which functional outcome was assessed were not confounded by organizational setting. In addition, all studies were blindly recorded. The unbiased summary effect size T̅u’s for functional and neuromuscular scores were estimated at 0.10 (CI, ±0.21) and 0.37 (CI, ±0.24) SDU, respectively. No significant heterogeneity was found. To control for organizational setting and blinding procedures, a post hoc analysis for neuromuscular scores revealed comparable summary effect sizes at 0.35 (CI, ±0.30) and 0.36 (CI, ±0.31), respectively).
View this table:
Weighting each study by a time factor ti resulted in a significant summary effect size T̄u for variables defined on a functional level; that is, an increment from 0.10 (CI, ±0.25) to 0.36 (CI, ±0.22) SDU was found. On the neuromuscular level the summary effect size decreased from 0.37 (CI, ±0.23) to 0.32 (CI, ±0.25) SDU after weighting for ti. When we weighted the blindly recorded studies only (n=3), a slightly higher summary effect size was found (0.36; CI, ±0.33).
- Only one study investigating the effects of intensity of rehabilitation on neuromuscular outcome was confounded because patients were treated at separate locations34 (Fig 2⇓ and Table 4⇓). The studies in which functional outcome was assessed were not confounded by organizational setting. In addition, all studies were blindly recorded. The unbiased summary effect size T̅u’s for functional and neuromuscular scores were estimated at 0.10 (CI, ±0.21) and 0.37 (CI, ±0.24) SDU, respectively. No significant heterogeneity was found. To control for organizational setting and blinding procedures, a post hoc analysis for neuromuscular scores revealed comparable summary effect sizes at 0.35 (CI, ±0.30) and 0.36 (CI, ±0.31), respectively).View this table:Weighting each study by a time factor ti resulted in a significant summary effect size T̄u for variables defined on a functional level; that is, an increment from 0.10 (CI, ±0.25) to 0.36 (CI, ±0.22) SDU was found. On the neuromuscular level the summary effect size decreased from 0.37 (CI, ±0.23) to 0.32 (CI, ±0.25) SDU after weighting for ti. When we weighted the blindly recorded studies only (n=3), a slightly higher summary effect size was found (0.36; CI, ±0.33).
Discussion
-
In a number of critical reviews it has been suggested that an early start of intensive rehabilitation may be an important part of expert care in stroke patients.4 5 6 7 8 39In the present research synthesis small40 but statistically significant improvements in ADL as well as in neuromuscular and functional outcome variables were found as a result of higher intensities of rehabilitation. These results support the findings of Langhorne et al, who applied Fisher’s inverse χ2 test and odds ratios. After probability values of the 9 studies included in the present study were combined with Fisher’s inverse χ2 test,21 41 significant improvements in ADL as well as neuromuscular and functional outcome parameters as a result of higher intensities of rehabilitation were found.
In the present study Hedges’s g method was applied, because ADL, muscle strength,32 34 35 synergism,37 38 dexterity,35 38 and walking velocity36 37 are assessed on an ordinal or interval scale. This method allows for sensitivity analysis by weighting effect sizes for covariates such as amount of rehabilitation. The clinical relevance of obtained or reported summary effect sizes remains an important problem.
The overall unbiased summary effect size of 0.28 SDU for ADL is smaller than the summary effect size of 0.57 SDU found by Ottenbacher and Jarnell.12 Even the application of a random effects model in the present study resulted in almost comparable effect sizes (ie, 0.29; CI, ±0.13 SDU) (Table 5⇓).
View this table:
As in the study of Langhorne et al,13 a difference was found in summary effect size between studies in which experimental and control groups were managed in the same setting (unconfounded studies) compared with the management of experimental and control groups in different settings (confounded studies). However, in the present study the effect size was smaller in the unconfounded studies (0.19; CI, ±0.17) than in the confounded studies (0.40; CI, ±0.19). The higher summary effect size for confounded studies compared with that for unconfounded studies may be the result of comparing the effects of stroke rehabilitation wards with those of general medical wards in 2 of 3 studies.33 34Combining the findings of these 2 studies resulted in a summary effect size of 0.50 (CI, ±0.33) SDU. Smith et al42 and Kalra et al43 44 demonstrated improved ADL and reduced hospital stay in stroke rehabilitation wards compared with patients admitted to general medical wards, while the amount of therapy was almost similar in both experimental conditions. Factors such as early onset of therapy,42 43 44 45better education of staff members,45 better organization of stroke care,43 44 and family participation45 46 may explain differences in better outcome of stroke rehabilitation wards.6 7 8
The low to moderate summary effect size found for the studies investigating the effects of intensity of rehabilitation may be caused by heterogeneity of the patient population, limited responsiveness of used assessment instruments, and insufficient modulation of intensity of therapy.47 With respect to the heterogeneity of the patient population, it should be noted that the number of previous strokes, lesion size, and localization of stroke have an important impact on functional recovery.48 49 50 Only one study38 treated stroke as a single diagnostic category.
Ottenbacher and Jarnell12 were unable to find a statistically significant correlation coefficient (r=.11) between length or extent of therapy and effect size. In the present study, weighting the additional amount of rehabilitation in the experimental group compared with the control group in the individual studies increased the summary effect sizes for ADL in the unconfounded studies from 0.19 (±0.17) to 0.25 (±0.18) SDU. This finding provides further evidence for the presence of an intensity-effect relationship and suggests that this relationship is more likely to occur when the difference in intensity of rehabilitation between experimental conditions is sufficiently large.
Remarkable in our research synthesis was the observation that in studies conducted in North America31 32 37 38 the amount of rehabilitation in both experimental and control groups was twice that of European studies.25 33 34 35 36 It should be noted, however, that the actual amount of rehabilitation has been adequately controlled in only one study.37 Treatment days,34 frequency of treatment,34 or amount of treatment without correction for duration of admission25 are only rough indicators for intensity of therapy (see Reference 2929 for discussion).
Another important result is that the summary effect size for outcome variables defined on the neuromuscular level is almost three times as high (ie, 0.37 SDU) as the summary effect size for functional outcome parameters (ie, 0.10 SDU). This finding may reflect the higher responsiveness of assessment instruments for neuromuscular functioning and supports the assumption that improvements on an impairment level are not unequivocally related to improvements in disability.6 7 37However, weighting the additional amount of rehabilitation in the experimental group compared with the control group in individual studies resulted in a significant summary effect size for functional outcome variables, providing more evidence for the presence of an intensity-effect relationship. However, lower (weighted) summary effect sizes were obtained when the four blinded studies (with the highest methodological quality35 36 37 38 ) were analyzed separately (see also Reference 1212 ). The latter finding suggests that observation bias may also explain part of the reported differences in summary effect sizes.
In summary, the present research synthesis demonstrates small but statistically significant improvements in terms of ADL and functional outcome parameters. However, generalization of the results of the present research synthesis is difficult because of the low methodological quality of the included studies. Further research on the effects of intensity of physical and occupational therapy is necessary. These studies should experimentally control for (1) sufficient contrast in amount of therapy spent, (2) organizational setting of rehabilitation management, and (3) specification of patient characteristics such as type and localization of stroke, number of previous strokes, and initial ADL score. In addition, these studies should adhere to the methodological principles, especially blind recording, as described in the present research synthesis.
- In a number of critical reviews it has been suggested that an early start of intensive rehabilitation may be an important part of expert care in stroke patients.4 5 6 7 8 39In the present research synthesis small40 but statistically significant improvements in ADL as well as in neuromuscular and functional outcome variables were found as a result of higher intensities of rehabilitation. These results support the findings of Langhorne et al, who applied Fisher’s inverse χ2 test and odds ratios. After probability values of the 9 studies included in the present study were combined with Fisher’s inverse χ2 test,21 41 significant improvements in ADL as well as neuromuscular and functional outcome parameters as a result of higher intensities of rehabilitation were found.In the present study Hedges’s g method was applied, because ADL, muscle strength,32 34 35 synergism,37 38 dexterity,35 38 and walking velocity36 37 are assessed on an ordinal or interval scale. This method allows for sensitivity analysis by weighting effect sizes for covariates such as amount of rehabilitation. The clinical relevance of obtained or reported summary effect sizes remains an important problem.The overall unbiased summary effect size of 0.28 SDU for ADL is smaller than the summary effect size of 0.57 SDU found by Ottenbacher and Jarnell.12 Even the application of a random effects model in the present study resulted in almost comparable effect sizes (ie, 0.29; CI, ±0.13 SDU) (Table 5⇓).View this table:As in the study of Langhorne et al,13 a difference was found in summary effect size between studies in which experimental and control groups were managed in the same setting (unconfounded studies) compared with the management of experimental and control groups in different settings (confounded studies). However, in the present study the effect size was smaller in the unconfounded studies (0.19; CI, ±0.17) than in the confounded studies (0.40; CI, ±0.19). The higher summary effect size for confounded studies compared with that for unconfounded studies may be the result of comparing the effects of stroke rehabilitation wards with those of general medical wards in 2 of 3 studies.33 34Combining the findings of these 2 studies resulted in a summary effect size of 0.50 (CI, ±0.33) SDU. Smith et al42 and Kalra et al43 44 demonstrated improved ADL and reduced hospital stay in stroke rehabilitation wards compared with patients admitted to general medical wards, while the amount of therapy was almost similar in both experimental conditions. Factors such as early onset of therapy,42 43 44 45better education of staff members,45 better organization of stroke care,43 44 and family participation45 46 may explain differences in better outcome of stroke rehabilitation wards.6 7 8The low to moderate summary effect size found for the studies investigating the effects of intensity of rehabilitation may be caused by heterogeneity of the patient population, limited responsiveness of used assessment instruments, and insufficient modulation of intensity of therapy.47 With respect to the heterogeneity of the patient population, it should be noted that the number of previous strokes, lesion size, and localization of stroke have an important impact on functional recovery.48 49 50 Only one study38 treated stroke as a single diagnostic category.Ottenbacher and Jarnell12 were unable to find a statistically significant correlation coefficient (r=.11) between length or extent of therapy and effect size. In the present study, weighting the additional amount of rehabilitation in the experimental group compared with the control group in the individual studies increased the summary effect sizes for ADL in the unconfounded studies from 0.19 (±0.17) to 0.25 (±0.18) SDU. This finding provides further evidence for the presence of an intensity-effect relationship and suggests that this relationship is more likely to occur when the difference in intensity of rehabilitation between experimental conditions is sufficiently large.Remarkable in our research synthesis was the observation that in studies conducted in North America31 32 37 38 the amount of rehabilitation in both experimental and control groups was twice that of European studies.25 33 34 35 36 It should be noted, however, that the actual amount of rehabilitation has been adequately controlled in only one study.37 Treatment days,34 frequency of treatment,34 or amount of treatment without correction for duration of admission25 are only rough indicators for intensity of therapy (see Reference 2929 for discussion).Another important result is that the summary effect size for outcome variables defined on the neuromuscular level is almost three times as high (ie, 0.37 SDU) as the summary effect size for functional outcome parameters (ie, 0.10 SDU). This finding may reflect the higher responsiveness of assessment instruments for neuromuscular functioning and supports the assumption that improvements on an impairment level are not unequivocally related to improvements in disability.6 7 37However, weighting the additional amount of rehabilitation in the experimental group compared with the control group in individual studies resulted in a significant summary effect size for functional outcome variables, providing more evidence for the presence of an intensity-effect relationship. However, lower (weighted) summary effect sizes were obtained when the four blinded studies (with the highest methodological quality35 36 37 38 ) were analyzed separately (see also Reference 1212 ). The latter finding suggests that observation bias may also explain part of the reported differences in summary effect sizes.In summary, the present research synthesis demonstrates small but statistically significant improvements in terms of ADL and functional outcome parameters. However, generalization of the results of the present research synthesis is difficult because of the low methodological quality of the included studies. Further research on the effects of intensity of physical and occupational therapy is necessary. These studies should experimentally control for (1) sufficient contrast in amount of therapy spent, (2) organizational setting of rehabilitation management, and (3) specification of patient characteristics such as type and localization of stroke, number of previous strokes, and initial ADL score. In addition, these studies should adhere to the methodological principles, especially blind recording, as described in the present research synthesis.
Acknowledgments
-
This study is part of a research project supported by a grant from The Netherlands Heart Foundation (project reference No. 93.134). The authors thank Walter Cambach and Erwin van Wegen for their contribution to the data analysis.
- This study is part of a research project supported by a grant from The Netherlands Heart Foundation (project reference No. 93.134). The authors thank Walter Cambach and Erwin van Wegen for their contribution to the data analysis.
Selected Abbreviations and Acronyms
-
ADL = activities of daily living
CI = confidence interval
OT = occupational therapy
PT = physical therapy
SDU = standard deviation units
ADL = activities of daily living CI = confidence interval OT = occupational therapy PT = physical therapy SDU = standard deviation units
Footnotes
-
-
Reprint requests to Gert Kwakkel, MSc, Department of Physical Therapy, University Hospital Vrije Universiteit, PO Box 7057, 1007 MB Amsterdam, Netherlands.
- Received March 24, 1997.
- Revision received May 13, 1997.
- Accepted May 13, 1997.
- Copyright © 1997 by American Heart Association
- Reprint requests to Gert Kwakkel, MSc, Department of Physical Therapy, University Hospital Vrije Universiteit, PO Box 7057, 1007 MB Amsterdam, Netherlands.
- Received March 24, 1997.
- Revision received May 13, 1997.
- Accepted May 13, 1997.
- Copyright © 1997 by American Heart Association
Nenhum comentário:
Postar um comentário