| HOME | ARCHIVE | SEARCH | TABLE OF CONTENTS |
|---|
| ||||||||||||||||||||||||||||||||
1 Department of Medical and Surgical Sciences
2 Department of Preventive and Social Medicine, University of Otago Medical School, Dunedin, New Zealand.
Address correspondence to Dr. M. Clare Robertson, Department of Medical and Surgical Sciences, University of Otago Medical School, P.O. Box 913, Dunedin, New Zealand. E-mail: clare.robertson{at}stonebow.otago.ac.nz
| Abstract |
|---|
|
|
|---|
Methods. We used raw data from two randomized controlled trials of a home exercise program to compare the number of falls in the exercise and control groups during the trials. We developed two different survival analysis models (Andersen-Gill and marginal Cox regression) and a negative binomial regression model for each trial. These techniques a) allow for the fact that falls are frequent, recurrent events with a non-normal distribution; b) adjust for the follow-up time of individual participants; and c) allow the addition of covariates.
Results. In one trial, the three different statistical techniques gave surprisingly similar results for the efficacy of the intervention but, in a second trial, underlying assumptions were violated for the two Cox regression models. Negative binomial regression models were easier to use.
Conclusion. We recommend negative binomial regression models for evaluating the efficacy of falls prevention programs.
A recent systematic review identified over 60 randomized controlled trials of interventions to prevent falls in elderly people, and these studies used a variety of approaches to test the efficacy of the intervention (1). Some were clinically and methodologically unsound.
Many studies reported recording all falls during the trial, but the results were presented using information on the first fall only. The clinical justification for a falls prevention program is to prevent as many falls as possible because all falls are likely to cause injury or have other adverse consequences. Using the time to a first fall or the proportion of fallers as the sole outcome ignores multiple events and wastes relevant information. For both clinical and statistical reasons, it is important to include all falls for each person as the outcome measure when evaluating the efficacy of a falls prevention program.
Some studies incorrectly assumed a normal distribution for falls and used Student's t test, linear regression, or analysis of variance. Survival analysis models make no assumption about the distribution of the outcome being compared, but there are other underlying assumptions that need to be tested, and the results of these tests were not often reported.
Intention-to-treat analysis requires that participants in randomized controlled trials are compared in the groups to which they were originally, randomly assigned. Even with the best attempts to minimize withdrawals, participants do die or withdraw during the trial; this is a particular issue in trials of elderly people. For some trials in the recent systematic review, the analysis was only for those completing the trial, whereas in others it remained unclear whether information from all participants was included (1). Dropouts in each group were usually reported, but whether dropouts resulted in the intervention or control groups being monitored for a significantly different length of time, and the possibility of bias resulting from this, was rarely discussed. There are statistical methods available for analyzing recurrent events which could take into account the individual follow-up time of participants in an intervention trial (2).
In this article we describe three different statistical modeling techniques in which all falls are considered, which allow for the fact that falls are frequent, recurrent events with a non-normal distribution over time, and which adjust for a) the variable follow-up time of participants during the trial and b) confounding variables, if appropriate. We used data from two of our four controlled trials of the Otago Exercise Programme to illustrate the use of the different techniques. We make a recommendation for future analyses of efficacy in falls prevention trials.
| METHODS |
|---|
|
|
|---|
In both trials, participants were recruited through general practices and were randomly assigned to receive either the home exercise program or usual healthcare for 1 year. Dunedin Study A (3), a trial of women aged 80 years or older (n = 233) in a research setting, was subsequently extended for another year (7), but the later results are not presented here. In Dunedin Study A, the control group also received social visits at home to match the home visits to exercise group participants by the physiotherapist delivering the exercise program (3). The West Auckland trial (n = 240) tested the home exercise program in men and women aged 75 years or older in a routine healthcare settinga community home health serviceand the program was delivered by a nurse who was trained and supervised by a physiotherapist (4).
Falls were the main outcome and were defined as "unintentionally coming to rest on the ground, floor or other lower level" (8). Fall events were monitored in all participants by asking them to return preaddressed, prepaid postcard calendars each month for 1 year.
For the present article, we analyzed the efficacy of the exercise program in each trial by using two survival-analysis techniques, Andersen-Gill (9) and marginal models (10), which are extensions of the Cox proportional hazards regression model. In these models, the dependent variable was time to each fall for each participant measured in days. The third technique was a negative binomial regression model where the dependent variable was the total number of falls for each participant during the trial (11). Group allocation was the independent variable in all the models, and each of the three techniques allowed the rate of falls in the exercise program compared with those in the control group to be expressed as a risk ratio. Robust estimates of variance were calculated for all models. A schematic representation of the three techniques is provided (see Figure 1).
|
Cox Proportional Hazards Regression
Proportional hazards or Cox regression models can test whether several factors (for example, intervention group, baseline prognostic factors) are independently related to the rate of a specific event. The models calculate the hazard function, which is the instantaneous likelihood of falling on any particular day. The estimate of the relative hazard for the variable in the model indicating group allocation is a ratio of the risk of falling in the intervention group to that in the control group throughout the trial.
Using survival probabilities to analyze time to fall events assumes that, at any time, participants who are censored before the end of the trial (withdraw from the trial) have the same risk of falling as those who complete the trial (12). An assumption of proportional hazards models is that the ratio of the risks of the events in the two groups is constant over time and that the ratio is the same for different subgroups of the data, such as age and sex groups (13). This is known as the proportionality of hazards assumption. No particular distribution is assumed for the event times, that is, the time from the trial start date for the individual to the outcome of interest (in this case, a fall event).
Andersen-Gill and marginal Cox proportional hazards regression.-- The assumption required in traditional survival analysis that event times are independent is violated when there can be multiple events per person. The Andersen-Gill extension of the proportional hazards regression model and the marginal proportional hazards regression model are both recommended for analyzing these data (2,14). In both these techniques, the dependencies between failure times (fall events) are ignored, but robust estimates of variances are used to take into account the dependence of events.
For the Andersen-Gill and marginal Cox regression models that we developed, all trial participants were included in the analyses whether or not they withdrew from the exercise intervention during the trials. We censored data after 1 year or when the person withdrew completely from the trial.
The main difference between the two models is in the way that the risk sets are defined for each fall (15). The simplest method to implement is the Andersen-Gill, which follows a counting process approach and assumes that all falls are equal or indistinguishable. In contrast, the marginal model is based on the idea of marginal risk sets, and the data are treated as a competing risk data set, as if the fall events were unordered. Each event has its own stratum and each participant appears in all strata. Therefore for the two models, the data files for the analyses are set up in two different ways.
For the Andersen-Gill models the data records for each participant consisted of a separate record for each of that person's fall events plus a final record containing the time in days from the last fall until withdrawal from the trial or the end of the trial (15). If the person had no falls, there was one data record only which included the number of days the person was monitored.
The data files for the marginal models required, for each participant, a separate record for each fall for the maximum overall number of fall events in the trial (the most frequent faller in Dunedin Study A had 10 falls and in the West Auckland trial had 15 falls) plus a final record containing the total time of follow-up for that person (15). This resulted in 2563 and 3840 data records for the two trials, respectively.
We tested the Andersen-Gill and marginal models to determine whether they met the assumption that the hazard ratio is proportional over time. The test we used is equivalent to testing that the log hazard ratio function is constant over time; the result is a chi-square value which indicates deviation from the assumption (11).
A log likelihood value is associated with each Andersen-Gill model and each marginal model. If this value is close to zero, the model is considered to describe the data well, that is, it is a "good" model. When comparing models using the same data, the model best fitting the data is indicated by the log likelihood value nearer to zero. We compared the goodness-of-fit of the Andersen-Gill model with that of the marginal model developed from the same data by noting the model log likelihood value nearer to zero.
Negative Binomial Regression
The negative binomial regression model can also be used to compare recurrent event rates in different groups; it allows investigation of the treatment effect and confounding variables, and adjusts for variable follow-up times by using time at risk as the offset (16). These models are a generalization of the more commonly used Poisson regression model.
Poisson distributions assume that a) the recurrent events being counted are occurring independently of each other and randomly in time and b) the mean and variance are equal (13). Overdispersion is the term used when, as is the case with falls, there is more variation than would be expected in a Poisson process, that is, the variance of the count variable is greater than the mean (11). Thus overdispersion models attempt to take into account the dependence of events in the same individual. One alternative is to estimate a factor to correct the inferential statistics of the Poisson regression model; another alternative is to use the negative binomial regression model. Poisson is a special case of the negative binomial and corresponds to an overdispersion parameter of zero. The data files for the negative binomial regression analyses had one record per person containing simply the total number of falls during the trial and the total number of days the person was monitored (233 and 240 data records for the two trials, respectively). In our negative binomial models we noted whether the data best fitted a Poisson or a negative binomial model by testing whether the overdispersion parameter equalled zero [indicated by the p value associated with a likelihood ratio chi-square (11)].
| RESULTS |
|---|
|
|
|---|
|
|
In the West Auckland trial data there was evidence that the proportional hazards assumption was violated in the Andersen-Gill and marginal Cox regression models (proportional hazards test, group
= 0.084,
2 = 3.88, df = 1, p =.049, and proportional hazards test, group
= 0.079,
2 = 4.09, df = 1, p =.043, respectively). Therefore, a single hazard ratio describing the effect of the exercise program would be inappropriate.
In the West Auckland trial the negative binomial model indicated there was a 46% reduction in the number of falls during the trial for the exercise group compared with the control group (incidence rate ratio 0.54, 95% confidence interval 0.320.90). The likelihood ratio test of the overdispersion parameter for both the negative binomial models indicated that these models fitted the data significantly better than a Poisson model (
2 = 107.91, df = 1, p =.0001 for Dunedin Study A;
2 = 143.03, df = 1, p =.0001 for West Auckland trial).
| DISCUSSION |
|---|
|
|
|---|
Therneau and Hamilton (14) state that, when using survival analysis to analyze recurrent events, the most difficult to analyze are those in which the events are of the same type. Survival analysis is not valid if participants who are censored (withdraw before the end of the trial) do not have the same rate of outcome (risk of falling) as those who continue in the trial (18). Those not completing a falls prevention trial are more likely to be at higher risk of falling (7) and, if fewer from one group than another group withdraw, results may be biased. As the Andersen-Gill extension of the Cox model appears to be least affected by this problem, Therneau and Hamilton suggest it should be the first choice. They also favor the marginal Cox model if the proportional hazards assumption "does not show great departures." The negative binomial regression model has been recommended for analysis of counts and rates in psychology (16), for motor vehicle crashes (19), and for other right-skewed data with clumping at zero (20). Several falls prevention intervention trials which used this model for analysis of efficacy have now been reported (4,2123). The incidence rate ratios calculated in each model are easy to interpret, and this technique is now available in standard software (11).
There is no one statistical technique which is the "right" one for testing efficacy of falls prevention interventions, but some techniques have advantages over others. We found that using the negative binomial regression model for evaluating efficacy of the intervention in our falls prevention trials was simple, straightforward, and had several advantages over survival analysis techniques. Underlying assumptions were violated in some of the Cox regression models. In contrast to the complicated data files needed for the survival analysis models, the negative binomial regression analysis required only one data record per participant.
We found negative binomial regression models useful when analyzing efficacy in a controlled trial and in a meta-analysis of the individual level data from the four controlled trials of the Otago Exercise Programme (21,24). In the Southern New Zealand trial (21), centers rather than individuals were allocated to receive the exercise program or to act as controls, so that clustering on center was required in the analysis. In the meta-analysis, the main outcome was injuries resulting from falls; we clustered on the trial center and also added interaction terms to the models for subgroup analyses (24).
A number of successful falls prevention trials have now been reported (1). All interventions require healthcare resources, and healthcare planners need to be able to compare successful programs. Robust analytical techniques, using data for all falls, are essential for this. Similar techniques would certainly make comparison easier and would also enable pooling of data. We recommend using negative binomial regression models for evaluating the efficacy of falls prevention programs.
| Acknowledgments |
|---|
| Footnotes |
|---|
Received September 3, 2003
Accepted October 13, 2003
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
M. G. Donaldson, B. Sobolev, W. L. Cook, P. A. Janssen, and K. M. Khan Analysis of recurrent events: a systematic review of randomised controlled trials of interventions to prevent falls Age Ageing, December 23, 2008; (2008) afn279v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Kulmala, A. Viljanen, S. Sipila, S. Pajala, O. Parssinen, M. Kauppinen, M. Koskenvuo, J. Kaprio, and T. Rantanen Poor vision accompanied with other sensory impairments as a predictor of falls in older women Age Ageing, November 13, 2008; (2008) afn228v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Campbell and M. C. Robertson Reply Age Ageing, May 1, 2008; 37(3): 353 - 353. [Full Text] [PDF] |
||||
![]() |
S. Pajala, P. Era, M. Koskenvuo, J. Kaprio, T. Tormakangas, and T. Rantanen Force Platform Balance Measures as Predictors of Indoor and Outdoor Falls in Community-Dwelling Women Aged 63-76 Years J. Gerontol. A Biol. Sci. Med. Sci., February 1, 2008; 63(2): 171 - 178. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Campbell and M. C. Robertson Rethinking individual and community fall prevention strategies: a meta-regression comparing single and multifactorial interventions Age Ageing, November 1, 2007; 36(6): 656 - 662. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Donaldson, B. Sobolev, L. Kuramoto, W. L. Cook, K. M. Khan, and P. A. Janssen Utility of the Mean Cumulative Function in the Analysis of Fall Events J. Gerontol. A Biol. Sci. Med. Sci., April 1, 2007; 62(4): 415 - 419. [Abstract] [Full Text] [PDF] |
||||
![]() |
S J La Grow, M C Robertson, A J Campbell, G A Clarke, and N M Kerse Reducing hazard related falls in people 75 years and older with significant visual impairment: how did a successful program work? Inj. Prev., October 1, 2006; 12(5): 296 - 301. [Abstract] [Full Text] [PDF] |
||||
![]() |
A J. Campbell, M C. Robertson, S. J La Grow, N. M Kerse, G. F Sanderson, R. J Jacobs, D. M Sharp, and L. A Hale Randomised controlled trial of prevention of falls in people aged >=75 with severe visual impairment: the VIP trial BMJ, October 8, 2005; 331(7520): 817. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||
| HOME | ARCHIVE | SEARCH | TABLE OF CONTENTS |
|---|