Department of Pharmaceutical Sciences, University of Michigan, Ann Arbor, MI, 48109, USA.
Biointerfaces Institute, University of Michigan, Ann Arbor, MI, 48109, USA.
Nat Commun. 2022 Jul 1;13(1):3788. doi: 10.1038/s41467-022-31457-3.
Therapeutic antibody development requires selection and engineering of molecules with high affinity and other drug-like biophysical properties. Co-optimization of multiple antibody properties remains a difficult and time-consuming process that impedes drug development. Here we evaluate the use of machine learning to simplify antibody co-optimization for a clinical-stage antibody (emibetuzumab) that displays high levels of both on-target (antigen) and off-target (non-specific) binding. We mutate sites in the antibody complementarity-determining regions, sort the antibody libraries for high and low levels of affinity and non-specific binding, and deep sequence the enriched libraries. Interestingly, machine learning models trained on datasets with binary labels enable predictions of continuous metrics that are strongly correlated with antibody affinity and non-specific binding. These models illustrate strong tradeoffs between these two properties, as increases in affinity along the co-optimal (Pareto) frontier require progressive reductions in specificity. Notably, models trained with deep learning features enable prediction of novel antibody mutations that co-optimize affinity and specificity beyond what is possible for the original antibody library. These findings demonstrate the power of machine learning models to greatly expand the exploration of novel antibody sequence space and accelerate the development of highly potent, drug-like antibodies.
治疗性抗体的开发需要选择和工程化具有高亲和力和其他类似药物的物理化学性质的分子。多个抗体性质的共同优化仍然是一个困难和耗时的过程,阻碍了药物的开发。在这里,我们评估了使用机器学习来简化临床阶段抗体(emibetuzumab)的抗体共同优化,该抗体显示出高水平的靶标(抗原)和非靶标(非特异性)结合。我们突变抗体互补决定区的位点,对高亲和力和非特异性结合的抗体文库进行分类,并对富集的文库进行深度测序。有趣的是,基于二进制标签的数据集训练的机器学习模型能够预测与抗体亲和力和非特异性结合具有强相关性的连续指标。这些模型说明了这两种特性之间的强烈权衡,因为沿着共同优化(Pareto)前沿的亲和力增加需要特异性的逐步降低。值得注意的是,使用深度学习特征训练的模型能够预测新的抗体突变,这些突变能够在原始抗体文库可能的范围内共同优化亲和力和特异性。这些发现证明了机器学习模型的强大功能,可以大大扩展新型抗体序列空间的探索,并加速高效、类似药物的抗体的开发。