Max Planck Institute for Molecular Genetics, Otto Warburg Laboratory, Berlin, Germany.
Zhengzhou Tobacco Research Institute of China National Tobacco Corporation, Zhengzhou, China.
IET Syst Biol. 2024 Feb;18(1):14-22. doi: 10.1049/syb2.12085. Epub 2024 Jan 9.
The transforming growth factor-β (TGF-β) superfamily, including Nodal and Activin, plays a critical role in various cellular processes. Understanding the intricate regulation and gene expression dynamics of TGF-β signalling is of interest due to its diverse biological roles. A machine learning approach is used to predict gene expression patterns induced by Activin using features, such as histone modifications, RNA polymerase II binding, SMAD2-binding, and mRNA half-life. RNA sequencing and ChIP sequencing datasets were analysed and differentially expressed SMAD2-binding genes were identified. These genes were classified into activated and repressed categories based on their expression patterns. The predictive power of different features and combinations was evaluated using logistic regression models and their performances were assessed. Results showed that RNA polymerase II binding was the most informative feature for predicting the expression patterns of SMAD2-binding genes. The authors provide insights into the interplay between transcriptional regulation and Activin signalling and offers a computational framework for predicting gene expression patterns in response to cell signalling.
转化生长因子-β(TGF-β)超家族,包括 Nodal 和 Activin,在各种细胞过程中发挥着关键作用。由于 TGF-β 信号转导具有多种生物学作用,因此研究其复杂的调控和基因表达动力学具有重要意义。本文使用机器学习方法,利用组蛋白修饰、RNA 聚合酶 II 结合、SMAD2 结合和 mRNA 半衰期等特征,预测 Activin 诱导的基因表达模式。对 RNA 测序和 ChIP 测序数据集进行了分析,并确定了差异表达的 SMAD2 结合基因。根据其表达模式,这些基因被分为激活和抑制两类。使用逻辑回归模型评估了不同特征和组合的预测能力,并评估了它们的性能。结果表明,RNA 聚合酶 II 结合是预测 SMAD2 结合基因表达模式的最具信息量的特征。作者深入探讨了转录调控与 Activin 信号之间的相互作用,并提供了一个用于预测细胞信号响应基因表达模式的计算框架。