Suppr超能文献

因果中介效应的大规模假设检验及其在全基因组表观遗传学研究中的应用

Large-Scale Hypothesis Testing for Causal Mediation Effects with Applications in Genome-wide Epigenetic Studies.

作者信息

Liu Zhonghua, Shen Jincheng, Barfield Richard, Schwartz Joel, Baccarelli Andrea A, Lin Xihong

机构信息

Department of Statistics and Actuarial Science, University of Hong Kong.

Department of Population Health Sciences, University of Utah School of Medicine.

出版信息

J Am Stat Assoc. 2022;117(537):67-81. doi: 10.1080/01621459.2021.1914634. Epub 2021 May 19.

Abstract

In genome-wide epigenetic studies, it is of great scientific interest to assess whether the effect of an exposure on a clinical outcome is mediated through DNA methylations. However, statistical inference for causal mediation effects is challenged by the fact that one needs to test a large number of composite null hypotheses across the whole epigenome. Two popular tests, the Wald-type Sobel's test and the joint significant test using the traditional null distribution are underpowered and thus can miss important scientific discoveries. In this paper, we show that the null distribution of Sobel's test is not the standard normal distribution and the null distribution of the joint significant test is not uniform under the composite null of no mediation effect, especially in finite samples and under the singular point null case that the exposure has no effect on the mediator and the mediator has no effect on the outcome. Our results explain why these two tests are underpowered, and more importantly motivate us to develop a more powerful Divide-Aggregate Composite-null Test (DACT) for the composite null hypothesis of no mediation effect by leveraging epigenome-wide data. We adopted Efron's empirical null framework for assessing statistical significance of the DACT test. We showed analytically that the proposed DACT method had improved power, and could well control type I error rate. Our extensive simulation studies showed that, in finite samples, the DACT method properly controlled the type I error rate and outperformed Sobel's test and the joint significance test for detecting mediation effects. We applied the DACT method to the US Department of Veterans Affairs Normative Aging Study, an ongoing prospective cohort study which included men who were aged 21 to 80 years at entry. We identified multiple DNA methylation CpG sites that might mediate the effect of smoking on lung function with effect sizes ranging from -0.18 to -0.79 and false discovery rate controlled at level 0.05, including the CpG sites in the genes AHRR and F2RL3. Our sensitivity analysis found small residual correlations (less than 0.01) of the error terms between the outcome and mediator regressions, suggesting that our results are robust to unmeasured confounding factors.

摘要

在全基因组表观遗传学研究中,评估暴露因素对临床结局的影响是否通过DNA甲基化介导具有重大科学意义。然而,因果中介效应的统计推断面临挑战,因为需要在整个表观基因组中检验大量复合零假设。两种常用检验方法,即Wald型Sobel检验和使用传统零分布的联合显著性检验,功效不足,因此可能错过重要的科学发现。在本文中,我们表明,在无中介效应的复合零假设下,Sobel检验的零分布不是标准正态分布,联合显著性检验的零分布也不是均匀分布,特别是在有限样本以及暴露因素对中介变量无影响且中介变量对结局无影响的奇点零假设情况下。我们的结果解释了为什么这两种检验功效不足,更重要的是,促使我们利用全表观基因组数据,为无中介效应的复合零假设开发一种功效更强的分-总复合零检验(DACT)。我们采用Efron的经验零框架来评估DACT检验的统计显著性。我们通过分析表明,所提出的DACT方法提高了功效,并且能够很好地控制I型错误率。我们广泛的模拟研究表明,在有限样本中,DACT方法能够正确控制I型错误率,并且在检测中介效应方面优于Sobel检验和联合显著性检验。我们将DACT方法应用于美国退伍军人事务部规范老化研究,这是一项正在进行的前瞻性队列研究,入组时年龄在21至80岁的男性。我们确定了多个可能介导吸烟对肺功能影响的DNA甲基化CpG位点,效应大小范围为-0.18至-0.79,错误发现率控制在0.05水平,包括AHRR和F2RL3基因中的CpG位点。我们的敏感性分析发现结局回归和中介回归误差项之间的残余相关性较小(小于0.01),这表明我们的结果对于未测量的混杂因素具有稳健性。

相似文献

1
Large-Scale Hypothesis Testing for Causal Mediation Effects with Applications in Genome-wide Epigenetic Studies.
J Am Stat Assoc. 2022;117(537):67-81. doi: 10.1080/01621459.2021.1914634. Epub 2021 May 19.
2
Methods for large-scale single mediator hypothesis testing: Possible choices and comparisons.
Genet Epidemiol. 2023 Mar;47(2):167-184. doi: 10.1002/gepi.22510. Epub 2022 Dec 8.
3
Testing for the indirect effect under the null for genome-wide mediation analyses.
Genet Epidemiol. 2017 Dec;41(8):824-833. doi: 10.1002/gepi.22084. Epub 2017 Oct 29.
4
Testing cell-type-specific mediation effects in genome-wide epigenetic studies.
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa131.
5
Adaptive bootstrap tests for composite null hypotheses in the mediation pathway analysis.
J R Stat Soc Series B Stat Methodol. 2023 Nov 14;86(2):411-434. doi: 10.1093/jrsssb/qkad129. eCollection 2024 Apr.
7
Variance component tests of multivariate mediation effects under composite null hypotheses.
Biometrics. 2019 Dec;75(4):1191-1204. doi: 10.1111/biom.13073. Epub 2019 Jun 17.
10
A multiple-testing procedure for high-dimensional mediation hypotheses.
J Am Stat Assoc. 2022;117(537):198-213. doi: 10.1080/01621459.2020.1765785. Epub 2020 Jun 24.

引用本文的文献

1
Post-selection inference for high-dimensional mediation analysis with survival outcomes.
Scand Stat Theory Appl. 2025 Jun;52(2):756-776. doi: 10.1111/sjos.12770. Epub 2025 Feb 9.
4
Debiased machine learning for ultra-high dimensional mediation analysis.
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf282.
6
DETECTING MULTIPLE REPLICATING SIGNALS USING ADAPTIVE FILTERING PROCEDURES.
Ann Stat. 2022 Aug;50(4):1890-1909. doi: 10.1214/21-aos2139. Epub 2022 Aug 25.
8
Mediation analysis in longitudinal study with high-dimensional methylation mediators.
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae496.
9
STAREG: Statistical replicability analysis of high throughput experiments with applications to spatial transcriptomic studies.
PLoS Genet. 2024 Oct 3;20(10):e1011423. doi: 10.1371/journal.pgen.1011423. eCollection 2024 Oct.
10
MASH: MEDIATION ANALYSIS OF SURVIVAL OUTCOME AND HIGH-DIMENSIONAL OMICS MEDIATORS WITH APPLICATION TO COMPLEX DISEASES.
Ann Appl Stat. 2024 Jun;18(2):1360-1377. doi: 10.1214/23-aoas1838. Epub 2024 Apr 5.

本文引用的文献

1
Mediation analysis for common binary outcomes.
Stat Med. 2019 Feb 20;38(4):512-529. doi: 10.1002/sim.7945. Epub 2018 Sep 6.
2
Causal effect of smoking on DNA methylation in peripheral blood: a twin and family study.
Clin Epigenetics. 2018 Feb 9;10:18. doi: 10.1186/s13148-018-0452-9. eCollection 2018.
3
Testing for the indirect effect under the null for genome-wide mediation analyses.
Genet Epidemiol. 2017 Dec;41(8):824-833. doi: 10.1002/gepi.22084. Epub 2017 Oct 29.
5
The effect of smoking on lung function: a clinical study of adult-onset asthma.
Eur Respir J. 2016 Nov;48(5):1298-1306. doi: 10.1183/13993003.00850-2016. Epub 2016 Sep 22.
6
Estimating and testing high-dimensional mediation effects in epigenetic studies.
Bioinformatics. 2016 Oct 15;32(20):3150-3154. doi: 10.1093/bioinformatics/btw351. Epub 2016 Jun 29.
8
DNA Methylation of the Aryl Hydrocarbon Receptor Repressor Associations With Cigarette Smoking and Subclinical Atherosclerosis.
Circ Cardiovasc Genet. 2015 Oct;8(5):707-16. doi: 10.1161/CIRCGENETICS.115.001097. Epub 2015 Aug 25.
9
Smoking-Associated DNA Methylation Biomarkers and Their Predictive Value for All-Cause and Cardiovascular Mortality.
Environ Health Perspect. 2016 Jan;124(1):67-74. doi: 10.1289/ehp.1409020. Epub 2015 May 27.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验