Suppr超能文献

蛋白质数据库中碳水化合物分子的现代化统一表示。

Modernized uniform representation of carbohydrate molecules in the Protein Data Bank.

机构信息

Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA.

Rutgers Cancer Institute of New Jersey, Robert Wood Johnson Medical School, New Brunswick, NJ 08903, USA.

出版信息

Glycobiology. 2021 Sep 20;31(9):1204-1218. doi: 10.1093/glycob/cwab039.

Abstract

Since 1971, the Protein Data Bank (PDB) has served as the single global archive for experimentally determined 3D structures of biological macromolecules made freely available to the global community according to the FAIR principles of Findability-Accessibility-Interoperability-Reusability. During the first 50 years of continuous PDB operations, standards for data representation have evolved to better represent rich and complex biological phenomena. Carbohydrate molecules present in more than 14,000 PDB structures have recently been reviewed and remediated to conform to a new standardized format. This machine-readable data representation for carbohydrates occurring in the PDB structures and the corresponding reference data improves the findability, accessibility, interoperability and reusability of structural information pertaining to these molecules. The PDB Exchange MacroMolecular Crystallographic Information File data dictionary now supports (i) standardized atom nomenclature that conforms to International Union of Pure and Applied Chemistry-International Union of Biochemistry and Molecular Biology (IUPAC-IUBMB) recommendations for carbohydrates, (ii) uniform representation of branched entities for oligosaccharides, (iii) commonly used linear descriptors of carbohydrates developed by the glycoscience community and (iv) annotation of glycosylation sites in proteins. For the first time, carbohydrates in PDB structures are consistently represented as collections of standardized monosaccharides, which precisely describe oligosaccharide structures and enable improved carbohydrate visualization, structure validation, robust quantitative and qualitative analyses, search for dendritic structures and classification. The uniform representation of carbohydrate molecules in the PDB described herein will facilitate broader usage of the resource by the glycoscience community and researchers studying glycoproteins.

摘要

自 1971 年以来,蛋白质数据银行(PDB)一直是生物大分子实验确定的三维结构的全球唯一存档库,根据可发现性-可访问性-互操作性-可重用性(FAIR)原则免费向全球社区提供。在 PDB 连续运行的头 50 年中,数据表示标准不断发展,以更好地表示丰富而复杂的生物现象。最近,对 PDB 结构中存在的超过 14000 个碳水化合物分子进行了审查和修复,以符合新的标准化格式。这种用于 PDB 结构中碳水化合物的可机读数据表示以及相应的参考数据,提高了与这些分子相关的结构信息的可发现性、可访问性、互操作性和可重用性。PDB Exchange 大分子晶体学信息文件数据字典现在支持 (i) 符合国际纯粹与应用化学联合会-国际生物化学与分子生物学联合会(IUPAC-IUBMB)碳水化合物建议的标准化原子命名法,(ii) 寡糖分支实体的统一表示,(iii) 糖科学社区开发的常用线性碳水化合物描述符,以及 (iv) 蛋白质中糖基化位点的注释。这是第一次,PDB 结构中的碳水化合物被一致地表示为标准化单糖的集合,这些单糖精确描述了寡糖结构,并能够改善碳水化合物可视化、结构验证、稳健的定量和定性分析、树突状结构搜索和分类。本文所述 PDB 中碳水化合物分子的统一表示将促进糖科学社区和研究糖蛋白的研究人员更广泛地使用该资源。

相似文献

1
Modernized uniform representation of carbohydrate molecules in the Protein Data Bank.
Glycobiology. 2021 Sep 20;31(9):1204-1218. doi: 10.1093/glycob/cwab039.
2
The Protein Data Bank Archive.
Methods Mol Biol. 2021;2305:3-21. doi: 10.1007/978-1-0716-1406-8_1.
3
Enhanced validation of small-molecule ligands and carbohydrates in the Protein Data Bank.
Structure. 2021 Apr 1;29(4):393-400.e1. doi: 10.1016/j.str.2021.02.004. Epub 2021 Mar 2.
5
Data mining the protein data bank: automatic detection and assignment of carbohydrate structures.
Carbohydr Res. 2004 Apr 2;339(5):1015-20. doi: 10.1016/j.carres.2003.09.038.
6
Analysis and validation of carbohydrate three-dimensional structures.
Acta Crystallogr D Biol Crystallogr. 2009 Feb;65(Pt 2):156-68. doi: 10.1107/S0907444909001905. Epub 2009 Jan 20.
7
PDB explorer -- a web based algorithm for protein annotation viewer and 3D visualization.
Interdiscip Sci. 2014 Dec;6(4):279-84. doi: 10.1007/s12539-012-0044-x. Epub 2014 Aug 9.
8
Protein Data Bank Japan (PDBj): maintaining a structural data archive and resource description framework format.
Nucleic Acids Res. 2012 Jan;40(Database issue):D453-60. doi: 10.1093/nar/gkr811. Epub 2011 Oct 5.
10
IHMCIF: An Extension of the PDBx/mmCIF Data Standard for Integrative Structure Determination Methods.
J Mol Biol. 2024 Sep 1;436(17):168546. doi: 10.1016/j.jmb.2024.168546. Epub 2024 Mar 18.

引用本文的文献

1
Surface Plasmon Resonance for the Interaction of Capsular Polysaccharide (CPS) With KpACE.
Bio Protoc. 2025 Jun 20;15(12):e5346. doi: 10.21769/BioProtoc.5346.
2
Generating 3D Models of Carbohydrates with GLYCAM-Web.
bioRxiv. 2025 May 9:2025.05.08.652828. doi: 10.1101/2025.05.08.652828.
3
PDBe tools for an in-depth analysis of small molecules in the Protein Data Bank.
Protein Sci. 2025 Apr;34(4):e70084. doi: 10.1002/pro.70084.
6
Announcing the launch of Protein Data Bank China as an Associate Member of the Worldwide Protein Data Bank Partnership.
Acta Crystallogr D Struct Biol. 2023 Sep 1;79(Pt 9):792-795. doi: 10.1107/S2059798323006381. Epub 2023 Aug 10.
7
The catalytic domains of Streptococcus mutans glucosyltransferases: a structural analysis.
Acta Crystallogr F Struct Biol Commun. 2023 May 1;79(Pt 5):119-127. doi: 10.1107/S2053230X23003199. Epub 2023 May 5.

本文引用的文献

1
Enhanced validation of small-molecule ligands and carbohydrates in the Protein Data Bank.
Structure. 2021 Apr 1;29(4):393-400.e1. doi: 10.1016/j.str.2021.02.004. Epub 2021 Mar 2.
4
Cross-neutralization of SARS-CoV-2 by a human monoclonal SARS-CoV antibody.
Nature. 2020 Jul;583(7815):290-295. doi: 10.1038/s41586-020-2349-y. Epub 2020 May 18.
5
Impact of the Protein Data Bank on antineoplastic approvals.
Drug Discov Today. 2020 May;25(5):837-850. doi: 10.1016/j.drudis.2020.02.002. Epub 2020 Feb 14.
6
Current Status of Carbohydrates Information in the Protein Data Bank.
J Chem Inf Model. 2020 Feb 24;60(2):684-699. doi: 10.1021/acs.jcim.9b00874. Epub 2020 Jan 28.
7
Cryo-electron microscopy structures of human oligosaccharyltransferase complexes OST-A and OST-B.
Science. 2019 Dec 13;366(6471):1372-1375. doi: 10.1126/science.aaz3505.
8
Oligosaccharyltransferase: A Gatekeeper of Health and Tumor Progression.
Int J Mol Sci. 2019 Dec 2;20(23):6074. doi: 10.3390/ijms20236074.
9
PDBe: improved findability of macromolecular structure data in the PDB.
Nucleic Acids Res. 2020 Jan 8;48(D1):D335-D343. doi: 10.1093/nar/gkz990.
10
GlyGen: Computational and Informatics Resources for Glycoscience.
Glycobiology. 2020 Jan 28;30(2):72-73. doi: 10.1093/glycob/cwz080.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验