Baron Gabriel, Ravaud Philippe, Samson Adeline, Giraudeau Bruno
AP-HP, Hôpital Bichat, Biostatistique et Recherche Clinique, INSERM, U738, and Université Paris, France.
Arthritis Rheum. 2008 Jan 15;59(1):25-31. doi: 10.1002/art.23253.
To assess the impact, in terms of statistical power and bias of treatment effect, of approaches to dealing with missing data in randomized controlled trials of rheumatoid arthritis with radiographic outcomes.
We performed a simulation study. The missingness mechanisms we investigated copied the process of withdrawal from trials due to lack of efficacy. We compared 3 methods of managing missing data: all available data (case-complete), last observation carried forward (LOCF), and multiple imputation. Data were then analyzed by classic t-test (comparing the mean absolute change between baseline and final visit) or F test (estimation of treatment effect with repeated measurements by a linear mixed-effects model).
With a missing data rate close to 15%, the treatment effect was underestimated by 18% as estimated by a linear mixed-effects model with a multiple imputation approach to missing data. This bias was lower than that obtained with the case-complete approach (-25%) or LOCF approach (-35%). This statistical approach (combination of multiple imputation and mixed-effects analysis) was moreover associated with a power of 70% (for a 90% nominal level), whereas LOCF was associated with a power of 55% and a case-complete power of 58%. Analysis with the t-test gave qualitatively equivalent but poorer quality results, except when multiple imputation was applied.
Our simulation study demonstrated multiple imputation, offering the smallest bias in treatment effect and the highest power. These results can help in planning trials, especially in choosing methods of imputation and data analysis.
从统计学效能和治疗效果偏倚的角度,评估在以放射学结果为指标的类风湿关节炎随机对照试验中,处理缺失数据的方法所产生的影响。
我们进行了一项模拟研究。我们所研究的缺失机制模仿了因疗效不佳而退出试验的过程。我们比较了3种处理缺失数据的方法:所有可用数据(病例完整法)、末次观察值结转(LOCF)和多重填补。然后通过经典t检验(比较基线和末次访视之间的平均绝对变化)或F检验(通过线性混合效应模型对重复测量的治疗效果进行估计)对数据进行分析。
当缺失数据率接近15%时,采用多重填补方法处理缺失数据的线性混合效应模型估计,治疗效果被低估了18%。这种偏倚低于病例完整法(-25%)或LOCF法(-35%)所产生的偏倚。此外,这种统计方法(多重填补和混合效应分析相结合)的效能为70%(名义水平为90%),而LOCF的效能为55%,病例完整法的效能为58%。除应用多重填补时外,t检验分析得出的结果在性质上相当,但质量较差。
我们的模拟研究表明,多重填补在治疗效果方面的偏倚最小,效能最高。这些结果有助于试验设计,特别是在选择填补方法和数据分析方法方面。