Suppr超能文献

基于 QSAR 模型的化合物与人血浆蛋白结合的计算预测。

In Silico Prediction of Compounds Binding to Human Plasma Proteins by QSAR Models.

机构信息

Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China.

出版信息

ChemMedChem. 2018 Mar 20;13(6):572-581. doi: 10.1002/cmdc.201700582. Epub 2017 Nov 10.

Abstract

Plasma protein binding (PPB) is a significant pharmacokinetic property of compounds in drug discovery and design. Due to the high cost and time-consuming nature of experimental assays, in silico approaches have been developed to assess the binding profiles of chemicals. However, because of unambiguity and the lack of uniform experimental data, most available predictive models are far from satisfactory. In this study, an elaborately curated training set containing 967 diverse pharmaceuticals with plasma-protein-bound fractions (f ) was used to construct quantitative structure-activity relationship (QSAR) models by six machine learning algorithms with 26 molecular descriptors. Furthermore, we combined all of the individual learners to yield consensus prediction, marginally improving the accuracy of the consensus model. The model performance was estimated by tenfold cross validation and three external validation sets comprising 242 pharmaceutical, 397 industrial, and 231 newly designed chemicals, respectively. The models showed excellent performance for the entire test set, with mean absolute error (MAE) ranging from 0.126 to 0.178, demonstrating that our models could be used by a chemist when drawing a molecular structure from scratch. Meanwhile, structural descriptors contributing significantly to the predictive power of the models were related to the binding mechanisms, and the trend in terms of their effects on PPB can serve as guidance for the structural modification of chemicals. The applicability domain was also defined to distinguish favorable predictions from unfavorable predictions.

摘要

血浆蛋白结合(PPB)是药物发现和设计中化合物的一个重要药代动力学特性。由于实验测定的成本高且耗时,因此开发了计算方法来评估化学物质的结合特征。然而,由于不明确性和缺乏统一的实验数据,大多数可用的预测模型远不能令人满意。在这项研究中,使用精心编辑的训练集,其中包含 967 种具有不同的血浆蛋白结合分数(f)的药物,通过 6 种机器学习算法和 26 个分子描述符构建定量构效关系(QSAR)模型。此外,我们将所有个体学习者结合起来,产生共识预测,略微提高了共识模型的准确性。通过十折交叉验证和三个外部验证集(分别包含 242 种药物、397 种工业化学品和 231 种新设计的化学品)来评估模型性能。模型对整个测试集的性能表现出色,平均绝对误差(MAE)范围为 0.126 至 0.178,表明我们的模型可以帮助化学家在从头开始绘制分子结构时使用。同时,对模型预测能力有显著贡献的结构描述符与结合机制有关,这些描述符对 PPB 的影响趋势可以为化学物质的结构修饰提供指导。还定义了适用性域,以区分有利的预测和不利的预测。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验