Nguyen Tri-Long, Collins Gary S, Lamy André, Devereaux Philip J, Daurès Jean-Pierre, Landais Paul, Le Manach Yannick
Laboratory of Biostatistics, Epidemiology, Clinical Research and Health Economics, UPRES EA2415, University of Montpellier, Montpellier, France; Department of Clinical Epidemiology and Biostatistics, Michael DeGroote School of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Canada; Department of Anesthesia, Michael DeGroote School of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Canada; Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Perioperative Medicine and Surgical Research Unit, Perioperative Research Group, Population Health Research Institute, McMaster University, Hamilton, Canada.
Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, University of Oxford, Windmill Road, Oxford, United Kingdom.
J Clin Epidemiol. 2017 Apr;84:105-113. doi: 10.1016/j.jclinepi.2017.02.010. Epub 2017 Feb 28.
By removing systematic differences across treatment groups, simple randomization is assumed to protect against bias. However, random differences may remain if the sample size is insufficiently large. We sought to determine the minimal sample size required to eliminate random differences, thereby allowing an unbiased estimation of the treatment effect.
We reanalyzed two published multicenter, large, and simple trials: the International Stroke Trial (IST) and the Coronary Artery Bypass Grafting (CABG) Off- or On-Pump Revascularization Study (CORONARY). We reiterated 1,000 times the analysis originally reported by the investigators in random samples of varying size. We measured the covariates balance across the treatment arms. We estimated the effect of aspirin and heparin on death or dependency at 30 days after stroke (IST), and the effect of off-pump CABG on a composite primary outcome of death, nonfatal stroke, nonfatal myocardial infarction, or new renal failure requiring dialysis at 30 days (CORONARY). In addition, we conducted a series of Monte Carlo simulations of randomized trials to supplement these analyses.
Randomization removes random differences between treatment groups when including at least 1,000 participants, thereby resulting in minimal bias in effects estimation. Later, substantial bias is observed. In a short review, we show such an enrollment is achieved in 41.5% of phase 3 trials published in the highest impact medical journals.
Conclusions drawn from completely randomized trials enrolling a few participants may not be reliable. In these circumstances, alternatives such as minimization or blocking should be considered for allocating the treatment.
通过消除各治疗组之间的系统差异,简单随机化被认为可以防止偏倚。然而,如果样本量不够大,随机差异可能仍然存在。我们试图确定消除随机差异所需的最小样本量,从而能够对治疗效果进行无偏估计。
我们重新分析了两项已发表的多中心、大型且简单的试验:国际卒中试验(IST)和冠状动脉搭桥术(CABG)非体外循环或体外循环血运重建研究(CORONARY)。我们在不同大小的随机样本中,将研究者最初报告的分析重复了1000次。我们测量了各治疗组之间协变量的平衡情况。我们估计了阿司匹林和肝素对卒中后30天死亡或依赖的影响(IST),以及非体外循环冠状动脉搭桥术对30天死亡、非致命性卒中、非致命性心肌梗死或需要透析的新肾衰竭这一复合主要结局的影响(CORONARY)。此外,我们进行了一系列随机试验的蒙特卡洛模拟以补充这些分析。
当纳入至少1000名参与者时,随机化消除了治疗组之间的随机差异,从而在效应估计中产生最小的偏倚。之后,观察到明显的偏倚。在一篇简短的综述中,我们表明在影响最大的医学期刊上发表的3期试验中,41.5%的试验实现了这样的入组人数。
从纳入少数参与者的完全随机试验得出的结论可能不可靠。在这些情况下,应考虑采用如最小化或区组化等替代方法来分配治疗。