Fattore Matteo, Arrigo Patrizio
CNR ISMAC, Section of Genoa, Via De Marini 6, 16149 Genova, Italy.
In Silico Biol. 2005;5(2):199-208.
The possibility to study an organism in terms of system theory has been proposed in the past, but only the advancement of molecular biology techniques allow us to investigate the dynamical properties of a biological system in a more quantitative and rational way than before . These new techniques can gave only the basic level view of an organisms functionality. The comprehension of its dynamical behaviour depends on the possibility to perform a multiple level analysis. Functional genomics has stimulated the interest in the investigation the dynamical behaviour of an organism as a whole. These activities are commonly known as System Biology, and its interests ranges from molecules to organs. One of the more promising applications is the 'disease modeling'. The use of experimental models is a common procedure in pharmacological and clinical researches; today this approach is supported by 'in silico' predictive methods. This investigation can be improved by a combination of experimental and computational tools. The Machine Learning (ML) tools are able to process different heterogeneous data sources, taking into account this peculiarity, they could be fruitfully applied to support a multilevel data processing (molecular, cellular and morphological) that is the prerequisite for the formal model design; these techniques can allow us to extract the knowledge for mathematical model development. The aim of our work is the development and implementation of a system that combines ML and dynamical models simulations. The program is addressed to the virtual analysis of the pathways involved in neurodegenerative diseases. These pathologies are multifactorial diseases and the relevance of the different factors has not yet been well elucidated. This is a very complex task; in order to test the integrative approach our program has been limited to the analysis of the effects of a specific protein, the Cyclin dependent kinase 5 (CDK5) which relies on the induction of neuronal apoptosis. The system has a modular structure centred on a textual knowledge discovery approach. The text mining is the only way to enhance the capability to extract ,from multiple data sources, the information required for the dynamical simulator. The user may access the publically available modules through the following site: http://biocomp.ge.ismac.cnr.it.
过去曾有人提出从系统理论的角度研究生物体的可能性,但只有分子生物学技术的进步使我们能够以比以往更定量、更合理的方式研究生物系统的动态特性。这些新技术只能提供生物体功能的基本层面的视图。对其动态行为的理解取决于进行多层次分析的可能性。功能基因组学激发了人们对研究生物体整体动态行为的兴趣。这些活动通常被称为系统生物学,其研究范围从分子到器官。其中一个更有前景的应用是“疾病建模”。使用实验模型是药理学和临床研究中的常见程序;如今这种方法得到了“计算机模拟”预测方法的支持。通过实验和计算工具的结合可以改进这项研究。机器学习(ML)工具能够处理不同的异构数据源,考虑到这一特性,它们可以有效地应用于支持多层次数据处理(分子、细胞和形态学),这是形式模型设计的前提条件;这些技术可以使我们提取用于数学模型开发所需的知识。我们工作的目的是开发和实现一个结合机器学习和动态模型模拟的系统。该程序旨在对神经退行性疾病所涉及的通路进行虚拟分析。这些疾病是多因素疾病,不同因素的相关性尚未得到很好的阐明。这是一项非常复杂的任务;为了测试这种综合方法,我们的程序仅限于分析一种特定蛋白质——细胞周期蛋白依赖性激酶5(CDK5)的作用,该蛋白依赖于神经元凋亡的诱导。该系统具有以文本知识发现方法为中心的模块化结构。文本挖掘是提高从多个数据源提取动态模拟器所需信息能力的唯一途径。用户可以通过以下网站访问公开可用的模块:http://biocomp.ge.ismac.cnr.it。