Ruepp Andreas, Brauner Barbara, Dunger-Kaltenbach Irmtraud, Frishman Goar, Montrone Corinna, Stransky Michael, Waegele Brigitte, Schmidt Thorsten, Doudieu Octave Noubibou, Stümpflen Volker, Mewes H Werner
Institute for Bioinformatics (MIPS), German Research Center for Environmental Health, Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany.
Nucleic Acids Res. 2008 Jan;36(Database issue):D646-50. doi: 10.1093/nar/gkm936. Epub 2007 Oct 26.
Protein complexes are key molecular entities that integrate multiple gene products to perform cellular functions. The CORUM (http://mips.gsf.de/genre/proj/corum/index.html) database is a collection of experimentally verified mammalian protein complexes. Information is manually derived by critical reading of the scientific literature from expert annotators. Information about protein complexes includes protein complex names, subunits, literature references as well as the function of the complexes. For functional annotation, we use the FunCat catalogue that enables to organize the protein complex space into biologically meaningful subsets. The database contains more than 1750 protein complexes that are built from 2400 different genes, thus representing 12% of the protein-coding genes in human. A web-based system is available to query, view and download the data. CORUM provides a comprehensive dataset of protein complexes for discoveries in systems biology, analyses of protein networks and protein complex-associated diseases. Comparable to the MIPS reference dataset of protein complexes from yeast, CORUM intends to serve as a reference for mammalian protein complexes.
蛋白质复合体是整合多种基因产物以执行细胞功能的关键分子实体。CORUM数据库(http://mips.gsf.de/genre/proj/corum/index.html)收集了经过实验验证的哺乳动物蛋白质复合体。信息由专业注释人员通过仔细阅读科学文献手动获取。关于蛋白质复合体的信息包括蛋白质复合体名称、亚基、文献参考以及复合体的功能。对于功能注释,我们使用FunCat分类目录,它能够将蛋白质复合体空间组织成具有生物学意义的子集。该数据库包含由2400个不同基因构建的1750多个蛋白质复合体,占人类蛋白质编码基因的12%。有一个基于网络的系统可用于查询、查看和下载数据。CORUM为系统生物学中的发现、蛋白质网络分析以及与蛋白质复合体相关的疾病提供了一个全面的蛋白质复合体数据集。与来自酵母的MIPS蛋白质复合体参考数据集类似,CORUM旨在作为哺乳动物蛋白质复合体的参考。