Suppr超能文献

机器学习辅助的抗菌肽预测与生成

Machine Learning-Assisted Prediction and Generation of Antimicrobial Peptides.

作者信息

Bhangu Sukhvir Kaur, Welch Nicholas, Lewis Morgan, Li Fanyi, Gardner Brint, Thissen Helmut, Kowalczyk Wioleta

机构信息

CSIRO Manufacturing Research Way Clayton Victoria 3168 Australia.

CSIRO Information Management & Technology Kensington Western Australia 6151 Australia.

出版信息

Small Sci. 2025 Mar 6;5(6):2400579. doi: 10.1002/smsc.202400579. eCollection 2025 Jun.

Abstract

Antimicrobial peptides (AMPs) offer a highly potent alternative solution due to their broad-spectrum activity and minimum resistance development against the rapidly evolving antibiotic-resistant pathogens. Herein, to accelerate the discovery process of new AMPs, a predictive and generative algorithm is build, which constructs new peptide sequences, scores their antimicrobial activity using a machine learning (ML) model, identifies amino acid motifs, and assembles high-ranking motifs into new peptide sequences. The eXtreme Gradient Boosting model achieves an accuracy of ≈87% in distinguishing between AMPs and non-AMPs. The generated peptide sequences are experimentally validated against the bacterial pathogens, and an accuracy of ≈60% is achieved. To refine the algorithm, the physicochemical features are analyzed, particularly charge and hydrophobicity of experimentally validated peptides. The peptides with specific range of charge and hydrophobicity are then removed, which lead to a substantial increase in an experimental accuracy, from ≈60% to ≈80%. Furthermore, generated peptides are active against different fungal strains with minimal off-target toxicity. In summary, in silico predictive and generative models for functional motif and AMP discovery are powerful tools for engineering highly effective AMPs to combat multidrug resistant pathogens.

摘要

抗菌肽(AMPs)由于其广谱活性以及对快速演变的抗生素耐药病原体产生耐药性的可能性最小,提供了一种高效的替代解决方案。在此,为了加速新型抗菌肽的发现过程,构建了一种预测性和生成性算法,该算法可构建新的肽序列,使用机器学习(ML)模型对其抗菌活性进行评分,识别氨基酸基序,并将高级基序组装成新的肽序列。极端梯度提升模型在区分抗菌肽和非抗菌肽方面的准确率约为87%。生成的肽序列针对细菌病原体进行了实验验证,准确率约为60%。为了优化该算法,分析了物理化学特征,特别是经实验验证的肽的电荷和疏水性。然后去除具有特定电荷和疏水性范围的肽,这使得实验准确率大幅提高,从约60%提高到约80%。此外,生成的肽对不同的真菌菌株具有活性,脱靶毒性最小。总之,用于功能基序和抗菌肽发现的计算机预测和生成模型是设计高效抗菌肽以对抗多药耐药病原体的有力工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c154/12168616/cf105367513c/SMSC-5-2400579-g003.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验