Suppr超能文献

ACP-ML:一种基于序列的抗癌肽预测方法。

ACP-ML: A sequence-based method for anticancer peptide prediction.

机构信息

Northeast Forestry University, College of Computer and Control Engineering, Harbin, Heilongjiang, China.

Northeast Forestry University, College of Computer and Control Engineering, Harbin, Heilongjiang, China.

出版信息

Comput Biol Med. 2024 Mar;170:108063. doi: 10.1016/j.compbiomed.2024.108063. Epub 2024 Jan 28.

Abstract

Cancer is a serious malignant tumor and is difficult to cure. Chemotherapy, as a primary treatment for cancer, causes significant harm to normal cells in the body and is often accompanied by serious side effects. Recently, anti-cancer peptides (ACPs) as a type of protein for treating cancers dominated research into the development of new anti-tumor drugs because of their ability to specifically target and destroy cancer cells. The screening of proteins with cancer-inhibiting properties from a large pool of proteins is key to the development of anti-tumor drugs. However, it is expensive and inefficient to accurately identify protein functions only through biological experiments due to their complex structure. Therefore, we propose a new prediction model ACP-ML to effectively predict ACPs. In terms of feature extraction, DPC, PseAAC, CTDC, CTDT and CS-Pse-PSSM features were used and the most optimal feature set was selected by comparing combinations of these features. Then, a two-step feature selection process using MRMD and RFE algorithms was performed to determine the most crucial features from the most optimal feature set for identifying ACPs. Furthermore, we assessed the classification accuracy of single learning models and different strategies-based ensemble models through ten-fold cross-validation. Ultimately, a voting-based ensemble learning method is developed to predict ACPs. To validate its effectiveness, two independent test sets were used to perform tests, achieving accuracy of 90.891 % and 92.578 % respectively. Compared with existing anticancer peptide prediction algorithms, the proposed feature processing method is more effective, and the proposed ensemble model ACP-ML exhibits stronger generalization capability and higher accuracy.

摘要

癌症是一种严重的恶性肿瘤,难以治愈。化疗作为癌症的主要治疗方法,会对体内正常细胞造成严重伤害,并且常伴有严重的副作用。最近,抗癌肽 (ACP) 作为治疗癌症的一种蛋白质类型,因其能够特异性地靶向和破坏癌细胞而成为开发新型抗肿瘤药物的研究热点。从大量蛋白质中筛选具有抗癌特性的蛋白质是开发抗肿瘤药物的关键。然而,由于其复杂的结构,仅通过生物实验来准确识别蛋白质功能既昂贵又低效。因此,我们提出了一种新的预测模型 ACP-ML 来有效预测 ACP。在特征提取方面,使用了 DPC、PseAAC、CTDC、CTDT 和 CS-Pse-PSSM 特征,并通过比较这些特征的组合来选择最佳特征集。然后,使用 MRMD 和 RFE 算法进行两步特征选择过程,从最佳特征集中确定识别 ACP 的最关键特征。此外,我们通过十折交叉验证评估了单个学习模型和不同策略的集成模型的分类准确性。最终,采用投票式集成学习方法预测 ACP。为了验证其有效性,我们使用两个独立的测试集进行测试,分别达到了 90.891%和 92.578%的准确率。与现有的抗癌肽预测算法相比,所提出的特征处理方法更有效,所提出的集成模型 ACP-ML 具有更强的泛化能力和更高的准确性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验