MRC (Medical Research Council) Integrative Epidemiology Unit, University of Bristol, Bristol, United Kingdom; email:
Annu Rev Biomed Data Sci. 2022 Aug 10;5:1-17. doi: 10.1146/annurev-biodatasci-122120-024910. Epub 2022 Apr 1.
statistics for genome-wide association studies (GWAS) are increasingly available for downstream analyses. Meanwhile, the popularity of causal inference methods has grown as we look to gather robust evidence for novel medical and public health interventions. This has led to the development of methods that use GWAS summary statistics for causal inference. Here, we describe these methods in order of their escalating complexity, from genetic associations to extensions of Mendelian randomization that consider thousands of phenotypes simultaneously. We also cover the assumptions and limitations of these approaches before considering the challenges faced by researchers performing causal inference using GWAS data. GWAS summary statistics constitute an important data source for causal inference research that offers a counterpoint to nongenetic methods when triangulating evidence. Continued efforts to address the challenges in using GWAS data for causal inference will allow the full impact of these approaches to be realized.
全基因组关联研究(GWAS)的统计数据越来越多地可用于下游分析。与此同时,随着我们寻求为新的医学和公共卫生干预措施收集有力证据,因果推断方法的普及程度也在不断提高。这导致了使用 GWAS 汇总统计数据进行因果推断的方法的发展。在这里,我们按照其复杂程度的顺序描述这些方法,从遗传关联到同时考虑数千种表型的孟德尔随机化扩展。我们还介绍了这些方法的假设和局限性,然后考虑了使用 GWAS 数据进行因果推断的研究人员所面临的挑战。GWAS 汇总统计数据是因果推断研究的重要数据来源,当对证据进行三角测量时,它为非遗传方法提供了一个对照。为了解决使用 GWAS 数据进行因果推断所面临的挑战,将继续努力,以使这些方法的全部影响得以实现。