Genome Center, University of California, Davis, Davis, California, USA.
Department of Cell Biology and Molecular genetics, Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, Maryland, USA.
Proteins. 2021 Dec;89(12):1987-1996. doi: 10.1002/prot.26231. Epub 2021 Oct 5.
Critical Assessment of Structure Prediction (CASP) is an organization aimed at advancing the state of the art in computing protein structure from sequence. In the spring of 2020, CASP launched a community project to compute the structures of the most structurally challenging proteins coded for in the SARS-CoV-2 genome. Forty-seven research groups submitted over 3000 three-dimensional models and 700 sets of accuracy estimates on 10 proteins. The resulting models were released to the public. CASP community members also worked together to provide estimates of local and global accuracy and identify structure-based domain boundaries for some proteins. Subsequently, two of these structures (ORF3a and ORF8) have been solved experimentally, allowing assessment of both model quality and the accuracy estimates. Models from the AlphaFold2 group were found to have good agreement with the experimental structures, with main chain GDT_TS accuracy scores ranging from 63 (a correct topology) to 87 (competitive with experiment).
结构预测评估 (Critical Assessment of Structure Prediction, CASP) 是一个旨在推动从序列计算蛋白质结构的最新技术的组织。2020 年春天,CASP 发起了一个社区项目,旨在计算编码在 SARS-CoV-2 基因组中的最具挑战性的蛋白质的结构。47 个研究小组提交了超过 3000 个三维模型和 700 组关于 10 种蛋白质的准确性估计。这些模型被发布到了公众面前。CASP 社区成员还共同努力,提供了对局部和全局准确性的估计,并确定了一些蛋白质的基于结构的结构域边界。随后,这两种结构(ORF3a 和 ORF8)已经通过实验得到解决,从而可以评估模型质量和准确性估计。来自 AlphaFold2 小组的模型与实验结构吻合较好,主链 GDT_TS 准确性得分范围从 63(正确的拓扑结构)到 87(与实验相当)。