Davila Ana, Xu Zichang, Li Songling, Rozewicki John, Wilamowski Jan, Kotelnikov Sergei, Kozakov Dima, Teraguchi Shunsuke, Standley Daron M
Research Institute for Microbial Diseases, Department of Genome Informatics, Osaka University, Suita 565-0871, Japan.
Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794-5252, USA.
Bioinform Adv. 2022 Mar 7;2(1):vbac015. doi: 10.1093/bioadv/vbac015. eCollection 2022.
The scoring of antibody-antigen docked poses starting from unbound homology models has not been systematically optimized for a large and diverse set of input sequences.
To address this need, we have developed AbAdapt, a webserver that accepts antibody and antigen sequences, models their 3D structures, predicts epitope and paratope, and then docks the modeled structures using two established docking engines (Piper and Hex). Each of the key steps has been optimized by developing and training new machine-learning models. The sequences from a diverse set of 622 antibody-antigen pairs with known structure were used as inputs for leave-one-out cross-validation. The final set of cluster representatives included at least one 'Adequate' pose for 550/622 (88.4%) of the queries. The median (interquartile range) ranks of these 'Adequate' poses were 22 (5-77). Similar results were obtained on a holdout set of 100 unrelated antibody-antigen pairs. When epitopes were repredicted using docking-derived features for specific antibodies, the median ROC AUC increased from 0.679 to 0.720 in cross-validation and from 0.694 to 0.730 in the holdout set.
AbAdapt and related data are available at https://sysimm.org/abadapt/.
Supplementary data are available at online.
从未结合的同源模型开始对抗体 - 抗原对接构象进行评分,尚未针对大量不同的输入序列进行系统优化。
为满足这一需求,我们开发了AbAdapt,这是一个网络服务器,它接受抗体和抗原序列,对它们的三维结构进行建模,预测表位和互补位,然后使用两个既定的对接引擎(Piper和Hex)对接建模结构。每个关键步骤都通过开发和训练新的机器学习模型进行了优化。来自622对具有已知结构的不同抗体 - 抗原对的序列用作留一法交叉验证的输入。最终的聚类代表集对于550/622(88.4%)的查询至少包含一个“合适”构象。这些“合适”构象的中位数(四分位间距)排名为22(5 - 77)。在一组100个不相关抗体 - 抗原对组成的验证集上也获得了类似结果。当使用针对特定抗体的对接衍生特征重新预测表位时,交叉验证中中位数ROC AUC从0.679增加到0.720,在验证集中从0.694增加到0.730。
AbAdapt及相关数据可在https://sysimm.org/abadapt/获取。
补充数据可在网上获取。