Chen Wei, Chang Chunqi, Hung Y S
Department of Electrical and Electronic Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong.
Annu Int Conf IEEE Eng Med Biol Soc. 2010;2010:1061-4. doi: 10.1109/IEMBS.2010.5627641.
Transcription factors (TFs) play an important role in regulating the expression of genes. The accurate measurement of transcription factor activities (TFAs) depends on a series of experimental technologies of molecular biology and is intractable in most practical situations. Some signal processing methods for blind source separation have been applied in the prediction of TFAs from gene expression data. Most of such methods make use of statistical properties of the gene expression data only, leading to the inaccurate detection of TFAs. In contrast, network component analysis (NCA) can provide much improved result through utilizing the structural information of the gene regulatory network. However, the structure of the gene regulatory network, required by NCA, is not available in most practical cases so that NCA is not directly applicable. In this paper, we propose to use particle swarm optimization (PSO) to find the most plausible network structure iteratively from the gene expression data, with the assistance of recently developed fast algorithm for network component analysis (FastNCA). This novel approach to TFA inference can thus take advantage of NCA, even when the required network structure is unknown. The effectiveness of our novel approach has been demonstrated by applications to both simulated data and real gene expression microarray data, in the sense that TFAs can be inferred with high accuracy.
转录因子(TFs)在调控基因表达中发挥着重要作用。转录因子活性(TFAs)的准确测量依赖于一系列分子生物学实验技术,并且在大多数实际情况下都难以处理。一些用于盲源分离的信号处理方法已被应用于从基因表达数据预测TFAs。大多数此类方法仅利用基因表达数据的统计特性,导致TFAs检测不准确。相比之下,网络成分分析(NCA)通过利用基因调控网络的结构信息可以提供更好的结果。然而,NCA所需的基因调控网络结构在大多数实际情况下并不存在,因此NCA不能直接应用。在本文中,我们建议使用粒子群优化(PSO),借助最近开发的网络成分分析快速算法(FastNCA),从基因表达数据中迭代地找到最合理的网络结构。即使所需的网络结构未知,这种用于TFA推断的新方法也能够利用NCA。通过应用于模拟数据和真实基因表达微阵列数据,我们的新方法的有效性得到了证明,即可以高精度地推断TFAs。