Biomedical Informatics Training Program, Stanford University, Stanford, California, United States of America.
Department of Biomedical and Pharmaceutical Sciences, University of Montana, Missoula, Montana, United States of America.
PLoS Comput Biol. 2020 Nov 2;16(11):e1008399. doi: 10.1371/journal.pcbi.1008399. eCollection 2020 Nov.
Cytochrome P450 2D6 (CYP2D6) is a highly polymorphic gene whose protein product metabolizes more than 20% of clinically used drugs. Genetic variations in CYP2D6 are responsible for interindividual heterogeneity in drug response that can lead to drug toxicity and ineffective treatment, making CYP2D6 one of the most important pharmacogenes. Prediction of CYP2D6 phenotype relies on curation of literature-derived functional studies to assign a functional status to CYP2D6 haplotypes. As the number of large-scale sequencing efforts grows, new haplotypes continue to be discovered, and assignment of function is challenging to maintain. To address this challenge, we have trained a convolutional neural network to predict functional status of CYP2D6 haplotypes, called Hubble.2D6. Hubble.2D6 predicts haplotype function from sequence data and was trained using two pre-training steps with a combination of real and simulated data. We find that Hubble.2D6 predicts CYP2D6 haplotype functional status with 88% accuracy in a held-out test set and explains 47.5% of the variance in in vitro functional data among star alleles with unknown function. Hubble.2D6 may be a useful tool for assigning function to haplotypes with uncurated function, and used for screening individuals who are at risk of being poor metabolizers.
细胞色素 P450 2D6(CYP2D6)是一个高度多态性的基因,其蛋白产物代谢超过 20%的临床使用药物。CYP2D6 的遗传变异导致药物反应的个体间异质性,可能导致药物毒性和治疗无效,使 CYP2D6 成为最重要的药物代谢基因之一。CYP2D6 表型的预测依赖于对文献来源的功能研究的整理,以赋予 CYP2D6 单倍型的功能状态。随着大规模测序工作的数量增加,新的单倍型不断被发现,功能的分配难以维持。为了解决这一挑战,我们训练了一个卷积神经网络来预测 CYP2D6 单倍型的功能状态,称为 Hubble.2D6。Hubble.2D6 从序列数据预测单倍型功能,并通过使用真实和模拟数据的组合进行两个预训练步骤进行训练。我们发现,Hubble.2D6 在一个独立的测试集中预测 CYP2D6 单倍型功能状态的准确率为 88%,并解释了未知功能的星等位基因中体外功能数据的 47.5%的方差。Hubble.2D6 可能是一种有用的工具,用于分配未经过整理的功能的单倍型,并用于筛选可能是代谢不良的个体。