Wang Xu, Alshawaqfeh Mustafa, Dang Xuan, Wajid Bilal, Noor Amina, Qaraqe Marwa, Serpedin Erchin
Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843, USA.
Institute of Genomic Medicine, University of California San Diego, La Jolla, CA 92093, USA.
Microarrays (Basel). 2015 Nov 16;4(4):596-617. doi: 10.3390/microarrays4040596.
In systems biology, the regulation of gene expressions involves a complex network of regulators. Transcription factors (TFs) represent an important component of this network: they are proteins that control which genes are turned on or off in the genome by binding to specific DNA sequences. Transcription regulatory networks (TRNs) describe gene expressions as a function of regulatory inputs specified by interactions between proteins and DNA. A complete understanding of TRNs helps to predict a variety of biological processes and to diagnose, characterize and eventually develop more efficient therapies. Recent advances in biological high-throughput technologies, such as DNA microarray data and next-generation sequence (NGS) data, have made the inference of transcription factor activities (TFAs) and TF-gene regulations possible. Network component analysis (NCA) represents an efficient computational framework for TRN inference from the information provided by microarrays, ChIP-on-chip and the prior information about TF-gene regulation. However, NCA suffers from several shortcomings. Recently, several algorithms based on the NCA framework have been proposed to overcome these shortcomings. This paper first overviews the computational principles behind NCA, and then, it surveys the state-of-the-art NCA-based algorithms proposed in the literature for TRN reconstruction.
在系统生物学中,基因表达的调控涉及一个复杂的调控网络。转录因子(TFs)是这个网络的重要组成部分:它们是通过与特定DNA序列结合来控制基因组中哪些基因开启或关闭的蛋白质。转录调控网络(TRNs)将基因表达描述为蛋白质与DNA之间相互作用所指定的调控输入的函数。对TRNs的全面理解有助于预测各种生物学过程,并诊断、表征并最终开发出更有效的治疗方法。生物高通量技术的最新进展,如DNA微阵列数据和下一代测序(NGS)数据,使得转录因子活性(TFAs)和TF-基因调控的推断成为可能。网络组件分析(NCA)是一种从微阵列、芯片上染色质免疫沉淀(ChIP-on-chip)提供的信息以及关于TF-基因调控的先验信息推断TRN的有效计算框架。然而,NCA存在几个缺点。最近,已经提出了几种基于NCA框架的算法来克服这些缺点。本文首先概述了NCA背后的计算原理,然后,综述了文献中为TRN重建提出的基于NCA的最新算法。