Department of Epidemiology and Biostatistics, School of Public Health, Medical College of Soochow University, Suzhou 215123, China.
Stat Appl Genet Mol Biol. 2023 Nov 29;22(1). doi: 10.1515/sagmb-2023-0031. eCollection 2023 Jan 1.
High-throughput technologies have made high-dimensional settings increasingly common, providing opportunities for the development of high-dimensional mediation methods. We aimed to provide useful guidance for researchers using high-dimensional mediation analysis and ideas for biostatisticians to develop it by summarizing and discussing recent advances in high-dimensional mediation analysis. The method still faces many challenges when extended single and multiple mediation analyses to high-dimensional settings. The development of high-dimensional mediation methods attempts to address these issues, such as screening true mediators, estimating mediation effects by variable selection, reducing the mediation dimension to resolve correlations between variables, and utilizing composite null hypothesis testing to test them. Although these problems regarding high-dimensional mediation have been solved to some extent, some challenges remain. First, the correlation between mediators are rarely considered when the variables are selected for mediation. Second, downscaling without incorporating prior biological knowledge makes the results difficult to interpret. In addition, a method of sensitivity analysis for the strict sequential ignorability assumption in high-dimensional mediation analysis is still lacking. An analyst needs to consider the applicability of each method when utilizing them, while a biostatistician could consider extensions and improvements in the methodology.
高通量技术使得高维设置变得越来越普遍,为高维中介分析方法的发展提供了机会。我们旨在通过总结和讨论高维中介分析的最新进展,为使用高维中介分析的研究人员提供有用的指导,并为生物统计学家提供开发思路。当将单一和多重中介分析扩展到高维设置时,该方法仍然面临许多挑战。高维中介方法的发展试图解决这些问题,例如筛选真正的中介,通过变量选择估计中介效应,减少中介维度以解决变量之间的相关性,以及利用复合零假设检验来检验它们。尽管这些高维中介问题在一定程度上得到了解决,但仍存在一些挑战。首先,在选择中介变量时很少考虑中介变量之间的相关性。其次,在不纳入先验生物学知识的情况下进行降尺度处理使得结果难以解释。此外,高维中介分析中严格的顺序可忽略性假设的敏感性分析方法仍然缺乏。分析师在使用这些方法时需要考虑每种方法的适用性,而生物统计学家可以考虑方法的扩展和改进。