Liu Changning, Xuan Zhenyu
Department of Biological Sciences, Center for Systems Biology, University of Texas at Dallas, Richardson, Texas, USA. ; Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China.
Department of Biological Sciences, Center for Systems Biology, University of Texas at Dallas, Richardson, Texas, USA.
Cancer Inform. 2015 Apr 1;14(Suppl 2):57-70. doi: 10.4137/CIN.S17288. eCollection 2015.
We have developed a general framework to construct an association network of single nucleotide polymorphisms (SNPs) (SNP association network, SAN) based on the functional interactions of genes located in the flanking regions of SNPs. SAN, which was constructed based on protein-protein interactions in the Human Protein Reference Database (HPRD), showed significantly enriched signals in both linkage disequilibrium (LD) and long-range chromatin interaction (Hi-C). We used this network to further develop two methods for predicting and prioritizing disease-associated genes from genome-wide association studies (GWASs). We found that random walk with restart (RWR) using SAN (RWR-SAN) can greatly improve the prediction of lung-cancer-associated genes by comparing RWR with the use of network in HPRD (AUC 0.81 vs 0.66). In a reanalysis of the GWAS dataset of age-related macular degeneration (AMD), SAN could identify more potential AMD-associated genes that were previously ranked lower in the GWAS study. The interactions in SAN could facilitate the study of complex diseases.
我们基于单核苷酸多态性(SNP)侧翼区域基因的功能相互作用,开发了一个构建SNP关联网络(SNP association network,SAN)的通用框架。基于人类蛋白质参考数据库(HPRD)中的蛋白质-蛋白质相互作用构建的SAN,在连锁不平衡(LD)和远程染色质相互作用(Hi-C)中均显示出显著富集的信号。我们利用这个网络进一步开发了两种从全基因组关联研究(GWAS)中预测疾病相关基因并对其进行优先级排序的方法。我们发现,与使用HPRD中的网络进行随机游走重启(RWR)相比,使用SAN的随机游走重启(RWR-SAN)能极大地提高对肺癌相关基因的预测能力(AUC分别为0.81和0.66)。在对年龄相关性黄斑变性(AMD)的GWAS数据集进行重新分析时,SAN能够识别出更多在GWAS研究中先前排名较低的潜在AMD相关基因。SAN中的相互作用有助于复杂疾病的研究。