Bhangu Sukhvir Kaur, Welch Nicholas, Lewis Morgan, Li Fanyi, Gardner Brint, Thissen Helmut, Kowalczyk Wioleta
CSIRO Manufacturing Research Way Clayton Victoria 3168 Australia.
CSIRO Information Management & Technology Kensington Western Australia 6151 Australia.
Small Sci. 2025 Mar 6;5(6):2400579. doi: 10.1002/smsc.202400579. eCollection 2025 Jun.
Antimicrobial peptides (AMPs) offer a highly potent alternative solution due to their broad-spectrum activity and minimum resistance development against the rapidly evolving antibiotic-resistant pathogens. Herein, to accelerate the discovery process of new AMPs, a predictive and generative algorithm is build, which constructs new peptide sequences, scores their antimicrobial activity using a machine learning (ML) model, identifies amino acid motifs, and assembles high-ranking motifs into new peptide sequences. The eXtreme Gradient Boosting model achieves an accuracy of ≈87% in distinguishing between AMPs and non-AMPs. The generated peptide sequences are experimentally validated against the bacterial pathogens, and an accuracy of ≈60% is achieved. To refine the algorithm, the physicochemical features are analyzed, particularly charge and hydrophobicity of experimentally validated peptides. The peptides with specific range of charge and hydrophobicity are then removed, which lead to a substantial increase in an experimental accuracy, from ≈60% to ≈80%. Furthermore, generated peptides are active against different fungal strains with minimal off-target toxicity. In summary, in silico predictive and generative models for functional motif and AMP discovery are powerful tools for engineering highly effective AMPs to combat multidrug resistant pathogens.
抗菌肽(AMPs)由于其广谱活性以及对快速演变的抗生素耐药病原体产生耐药性的可能性最小,提供了一种高效的替代解决方案。在此,为了加速新型抗菌肽的发现过程,构建了一种预测性和生成性算法,该算法可构建新的肽序列,使用机器学习(ML)模型对其抗菌活性进行评分,识别氨基酸基序,并将高级基序组装成新的肽序列。极端梯度提升模型在区分抗菌肽和非抗菌肽方面的准确率约为87%。生成的肽序列针对细菌病原体进行了实验验证,准确率约为60%。为了优化该算法,分析了物理化学特征,特别是经实验验证的肽的电荷和疏水性。然后去除具有特定电荷和疏水性范围的肽,这使得实验准确率大幅提高,从约60%提高到约80%。此外,生成的肽对不同的真菌菌株具有活性,脱靶毒性最小。总之,用于功能基序和抗菌肽发现的计算机预测和生成模型是设计高效抗菌肽以对抗多药耐药病原体的有力工具。