Chen Jiarui, Cheong Hong Hin, Siu Shirley W I
Department of Computer and Information Science, University of Macau, Avenida da Universidade, Taipa, Macau 999078, China.
School of Pharmaceutical Sciences, Universiti Sains Malaysia, 11800 USM, Penang, Malaysia.
J Chem Inf Model. 2021 Aug 23;61(8):3789-3803. doi: 10.1021/acs.jcim.1c00181. Epub 2021 Jul 30.
Cancer is one of the leading causes of death worldwide. Conventional cancer treatment relies on radiotherapy and chemotherapy, but both methods bring severe side effects to patients, as these therapies not only attack cancer cells but also damage normal cells. Anticancer peptides (ACPs) are a promising alternative as therapeutic agents that are efficient and selective against tumor cells. Here, we propose a deep learning method based on convolutional neural networks to predict biological activity (EC50, LC50, IC50, and LD50) against six tumor cells, including breast, colon, cervix, lung, skin, and prostate. We show that models derived with multitask learning achieve better performance than conventional single-task models. In repeated 5-fold cross validation using the CancerPPD data set, the best models with the applicability domain defined obtain an average mean squared error of 0.1758, Pearson's correlation coefficient of 0.8086, and Kendall's correlation coefficient of 0.6156. As a step toward model interpretability, we infer the contribution of each residue in the sequence to the predicted activity by means of feature importance weights derived from the convolutional layers of the model. The present method, referred to as xDeep-AcPEP, will help to identify effective ACPs in rational peptide design for therapeutic purposes. The data, script files for reproducing the experiments, and the final prediction models can be downloaded from http://github.com/chen709847237/xDeep-AcPEP. The web server to directly access this prediction method is at https://app.cbbio.online/acpep/home.
癌症是全球主要死因之一。传统的癌症治疗依赖于放疗和化疗,但这两种方法都会给患者带来严重的副作用,因为这些疗法不仅会攻击癌细胞,还会损害正常细胞。抗癌肽(ACPs)作为一种对肿瘤细胞高效且有选择性的治疗剂,是一种很有前景的替代方案。在此,我们提出一种基于卷积神经网络的深度学习方法,用于预测针对六种肿瘤细胞(包括乳腺癌、结肠癌、宫颈癌、肺癌、皮肤癌和前列腺癌)的生物活性(半数有效浓度(EC50)、半数致死浓度(LC50)、半数抑制浓度(IC50)和半数致死剂量(LD50))。我们表明,通过多任务学习得出的模型比传统的单任务模型具有更好的性能。在使用CancerPPD数据集进行的重复5折交叉验证中,定义了适用域的最佳模型的平均均方误差为0.1758,皮尔逊相关系数为0.8086,肯德尔相关系数为0.6156。作为迈向模型可解释性的一步,我们通过从模型卷积层导出的特征重要性权重来推断序列中每个残基对预测活性的贡献。本方法称为xDeep - AcPEP,将有助于在用于治疗目的的合理肽设计中识别有效的抗癌肽。数据、用于重现实验的脚本文件以及最终预测模型可从http://github.com/chen709847237/xDeep - AcPEP下载。直接访问此预测方法的网络服务器位于https://app.cbbio.online/acpep/home。