Suppr超能文献

基于伪氨基酸组成特征的枯草芽孢杆菌木聚糖酶活力预测的计算方法。

A computational method for prediction of xylanase enzymes activity in strains of Bacillus subtilis based on pseudo amino acid composition features.

机构信息

Department of Systems Biology, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research Education and Extension Organization (AREO), Karaj, Iran.

Department of Microbial Biotechnology, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research Education and Extension Organization (AREO), Karaj, Iran.

出版信息

PLoS One. 2018 Oct 22;13(10):e0205796. doi: 10.1371/journal.pone.0205796. eCollection 2018.

Abstract

Xylanases are hydrolytic enzymes which based on physicochemical properties, structure, mode of action and substrate specificities are classified into various glycoside hydrolase (GH) families. The purpose of this study is to show that the activity of the members of the xylanase family in the specified pH and temperature conditions can be computationally predicted. The proposed computational regression model was trained and tested with the Pseudo Amino Acid Composition (PseAAC) features extracted solely from the amino acid sequences of enzymes. The xylanases with experimentally determined activities were used as the training dataset to adjust the model parameters. To develop the model, 41 strains of Bacillus subtilis isolated from field soil were screened. From them, 28 strains with the highest halo diameter were selected for further studies. The performance of the model for prediction of xylanase activity was evaluated in three different temperature and pH conditions using stratified cross-validation and jackknife methods. The trained model can be used for determining the activity of newly found xylanases in the specified condition. Such computational models help to scale down the experimental costs and save time by identifying enzymes with appropriate activity for scientific and industrial usage. Our methodology for activity prediction of xylanase enzymes can be potentially applied to the members of the other enzyme families. The availability of sufficient experimental data in specified pH and temperature conditions is a prerequisite for training the learning model and to achieve high accuracy.

摘要

木聚糖酶是水解酶,根据理化性质、结构、作用方式和底物特异性,可分为各种糖苷水解酶(GH)家族。本研究旨在表明,在特定 pH 值和温度条件下,木聚糖酶家族成员的活性可以通过计算进行预测。所提出的计算回归模型是使用仅从酶的氨基酸序列中提取的伪氨基酸组成(PseAAC)特征进行训练和测试的。用实验确定的活性木聚糖酶作为训练数据集来调整模型参数。为了开发该模型,从野外土壤中筛选了 41 株枯草芽孢杆菌。从中选择了 28 株具有最高晕圈直径的菌株进行进一步研究。使用分层交叉验证和 Jackknife 方法在三种不同的温度和 pH 条件下评估模型预测木聚糖酶活性的性能。训练好的模型可用于在指定条件下确定新发现的木聚糖酶的活性。此类计算模型有助于通过鉴定具有适当活性的酶来降低实验成本和节省时间,从而用于科学和工业用途。我们预测木聚糖酶活性的方法可以潜在地应用于其他酶家族的成员。在指定的 pH 值和温度条件下,有足够的实验数据是训练学习模型和获得高精度的前提。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e087/6197662/957af5a90a6d/pone.0205796.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验