Murphy Amy, Weiss Scott T, Lange Christoph
Channing Laboratory, Brigham and Women's Hospital, Boston, Massachusetts, United States of America.
PLoS Genet. 2008 Sep 19;4(9):e1000197. doi: 10.1371/journal.pgen.1000197.
For genome-wide association studies in family-based designs, we propose a powerful two-stage testing strategy that can be applied in situations in which parent-offspring trio data are available and all offspring are affected with the trait or disease under study. In the first step of the testing strategy, we construct estimators of genetic effect size in the completely ascertained sample of affected offspring and their parents that are statistically independent of the family-based association/transmission disequilibrium tests (FBATs/TDTs) that are calculated in the second step of the testing strategy. For each marker, the genetic effect is estimated (without requiring an estimate of the SNP allele frequency) and the conditional power of the corresponding FBAT/TDT is computed. Based on the power estimates, a weighted Bonferroni procedure assigns an individually adjusted significance level to each SNP. In the second stage, the SNPs are tested with the FBAT/TDT statistic at the individually adjusted significance levels. Using simulation studies for scenarios with up to 1,000,000 SNPs, varying allele frequencies and genetic effect sizes, the power of the strategy is compared with standard methodology (e.g., FBATs/TDTs with Bonferroni correction). In all considered situations, the proposed testing strategy demonstrates substantial power increases over the standard approach, even when the true genetic model is unknown and must be selected based on the conditional power estimates. The practical relevance of our methodology is illustrated by an application to a genome-wide association study for childhood asthma, in which we detect two markers meeting genome-wide significance that would not have been detected using standard methodology.
对于基于家系设计的全基因组关联研究,我们提出了一种强大的两阶段检验策略,该策略可应用于有亲子三联体数据且所有后代都受所研究性状或疾病影响的情况。在检验策略的第一步中,我们在完全确定的受影响后代及其父母样本中构建遗传效应大小的估计量,这些估计量在统计上独立于在检验策略第二步中计算的基于家系的关联/传递不平衡检验(FBATs/TDTs)。对于每个标记,估计遗传效应(无需估计SNP等位基因频率)并计算相应FBAT/TDT的条件功效。基于功效估计,加权邦费罗尼程序为每个SNP分配一个单独调整的显著性水平。在第二阶段,使用FBAT/TDT统计量在单独调整的显著性水平下对SNP进行检验。通过对多达1,000,000个SNP、不同等位基因频率和遗传效应大小的模拟研究,将该策略的功效与标准方法(例如,经过邦费罗尼校正的FBATs/TDTs)进行比较。在所有考虑的情况下,即使真实遗传模型未知且必须根据条件功效估计进行选择,所提出的检验策略也比标准方法显示出显著的功效提升。我们通过将其应用于儿童哮喘的全基因组关联研究来说明我们方法的实际相关性,在该研究中,我们检测到两个达到全基因组显著性的标记,而使用标准方法则无法检测到。