McGrath Sean, Zhao XiaoFei, Steele Russell, Thombs Brett D, Benedetti Andrea
Respiratory Epidemiology and Clinical Research Unit (RECRU), McGill University Health Centre, Montreal, Quebec, Canada.
Department of Mathematics and Statistics, McGill University, Montreal, Quebec, Canada.
Stat Methods Med Res. 2020 Sep;29(9):2520-2537. doi: 10.1177/0962280219889080. Epub 2020 Jan 30.
Researchers increasingly use meta-analysis to synthesize the results of several studies in order to estimate a common effect. When the outcome variable is continuous, standard meta-analytic approaches assume that the primary studies report the sample mean and standard deviation of the outcome. However, when the outcome is skewed, authors sometimes summarize the data by reporting the sample median and one or both of (i) the minimum and maximum values and (ii) the first and third quartiles, but do not report the mean or standard deviation. To include these studies in meta-analysis, several methods have been developed to estimate the sample mean and standard deviation from the reported summary data. A major limitation of these widely used methods is that they assume that the outcome distribution is normal, which is unlikely to be tenable for studies reporting medians. We propose two novel approaches to estimate the sample mean and standard deviation when data are suspected to be non-normal. Our simulation results and empirical assessments show that the proposed methods often perform better than the existing methods when applied to non-normal data.
研究人员越来越多地使用荟萃分析来综合多项研究的结果,以估计一个共同效应。当结果变量是连续变量时,标准的荟萃分析方法假定原始研究报告了结果的样本均值和标准差。然而,当结果呈偏态分布时,作者有时会通过报告样本中位数以及(i)最小值和最大值中的一个或两个,和(ii)第一和第三四分位数来汇总数据,但不报告均值或标准差。为了将这些研究纳入荟萃分析,已经开发了几种方法来从报告的汇总数据中估计样本均值和标准差。这些广泛使用的方法的一个主要局限性是,它们假定结果分布是正态的,而对于报告中位数的研究来说这一假设不太可能成立。我们提出了两种新方法,用于在数据疑似非正态时估计样本均值和标准差。我们的模拟结果和实证评估表明,当应用于非正态数据时,所提出的方法通常比现有方法表现更好。