Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America.
Department of Oncology, Johns Hopkins School of Medicine, Baltimore, Maryland, United States of America.
PLoS Genet. 2018 Oct 5;14(10):e1007549. doi: 10.1371/journal.pgen.1007549. eCollection 2018 Oct.
Genome-wide association studies have shown that pleiotropy is a common phenomenon that can potentially be exploited for enhanced detection of susceptibility loci. We propose heritability informed power optimization (HIPO) for conducting powerful pleiotropic analysis using summary-level association statistics. We find optimal linear combinations of association coefficients across traits that are expected to maximize non-centrality parameter for the underlying test statistics, taking into account estimates of heritability, sample size variations and overlaps across the traits. Simulation studies show that the proposed method has correct type I error, robust to population stratification and leads to desired genome-wide enrichment of association signals. Application of the proposed method to publicly available data for three groups of genetically related traits, lipids (N = 188,577), psychiatric diseases (Ncase = 33,332, Ncontrol = 27,888) and social science traits (N ranging between 161,460 to 298,420 across individual traits) increased the number of genome-wide significant loci by 12%, 200% and 50%, respectively, compared to those found by analysis of individual traits. Evidence of replication is present for many of these loci in subsequent larger studies for individual traits. HIPO can potentially be extended to high-dimensional phenotypes as a way of dimension reduction to maximize power for subsequent genetic association testing.
全基因组关联研究表明,多效性是一种常见现象,可用于增强对易感基因座的检测。我们提出了基于遗传力的功效优化(HIPO)方法,用于使用汇总关联统计数据进行强大的多效性分析。我们找到了跨性状的关联系数的最优线性组合,这些组合预计将最大化基础检验统计量的非中心参数,同时考虑遗传力、样本量变化和性状之间的重叠的估计。模拟研究表明,所提出的方法具有正确的Ⅰ型错误率,对群体分层稳健,并且导致关联信号在全基因组范围内得到期望的富集。将所提出的方法应用于三个遗传相关性状的公开可用数据,脂质(N = 188,577)、精神疾病(Ncase = 33,332,Ncontrol = 27,888)和社会科学性状(N 在每个性状之间的 161,460 到 298,420 范围内),与分析单个性状相比,全基因组显著基因座的数量分别增加了 12%、200%和 50%。在随后的个体性状的更大研究中,这些基因座中有许多都有复制的证据。HIPO 可以作为一种降维方法扩展到高维表型,以最大化随后的遗传关联测试的功效。