Suppr超能文献

通用蛋白质知识库/瑞士蛋白质数据库

UniProtKB/Swiss-Prot.

作者信息

Boutet Emmanuel, Lieberherr Damien, Tognolli Michael, Schneider Michel, Bairoch Amos

机构信息

Swiss Institute of Bioinformatics, Centre Medical Universitaire, Geneva, Switzerland.

出版信息

Methods Mol Biol. 2007;406:89-112. doi: 10.1007/978-1-59745-535-0_4.

Abstract

The Swiss Institute of Bioinformatics (SIB), the European Bioinformatics Institute (EBI), and the Protein Information Resource (PIR) form the Universal Protein Resource (UniProt) consortium. Its main goal is to provide the scientific community with a central resource for protein sequences and functional information. The UniProt consortium maintains the UniProt KnowledgeBase (UniProtKB) and several supplementary databases including the UniProt Reference Clusters (UniRef) and the UniProt Archive (UniParc). (1) UniProtKB is a comprehensive protein sequence knowledgebase that consists of two sections: UniProtKB/Swiss-Prot, which contains manually annotated entries, and UniProtKB/TrEMBL, which contains computer-annotated entries. UniProtKB/Swiss-Prot entries contain information curated by biologists and provide users with cross-links to about 100 external databases and with access to additional information or tools. (2) The UniRef databases (UniRef100, UniRef90, and UniRef50) define clusters of protein sequences that share 100, 90, or 50% identity. (3) The UniParc database stores and maps all publicly available protein sequence data, including obsolete data excluded from UniProtKB. The UniProt databases can be accessed online (http://www.uniprot.org/) or downloaded in several formats (ftp://ftp.uniprot.org/pub). New releases are published every 2 weeks. The purpose of this chapter is to present a guided tour of a UniProtKB/Swiss-Prot entry, paying particular attention to the specificities of plant protein annotation. We will also present some of the tools and databases that are linked to each entry.

摘要

瑞士生物信息学研究所(SIB)、欧洲生物信息学研究所(EBI)和蛋白质信息资源库(PIR)共同组成了通用蛋白质资源库(UniProt)联盟。其主要目标是为科学界提供一个蛋白质序列和功能信息的核心资源库。UniProt联盟维护着UniProt知识库(UniProtKB)以及几个补充数据库,包括UniProt参考簇(UniRef)和UniProt存档库(UniParc)。(1)UniProtKB是一个全面的蛋白质序列知识库,由两部分组成:UniProtKB/瑞士蛋白质数据库(UniProtKB/Swiss-Prot),其中包含人工注释的条目;以及UniProtKB/翻译后修饰数据库(UniProtKB/TrEMBL),其中包含计算机注释的条目。UniProtKB/瑞士蛋白质数据库条目包含由生物学家精心策划的信息,并为用户提供与约100个外部数据库的交叉链接,以及获取其他信息或工具的途径。(2)UniRef数据库(UniRef100、UniRef90和UniRef50)定义了具有100%、90%或50%序列同一性的蛋白质序列簇。(3)UniParc数据库存储并映射所有公开可用的蛋白质序列数据,包括从UniProtKB中排除的过时数据。UniProt数据库可在线访问(http://www.uniprot.org/)或以多种格式下载(ftp://ftp.uniprot.org/pub)。每两周发布一次新版本。本章的目的是对UniProtKB/瑞士蛋白质数据库条目进行一次引导式浏览,特别关注植物蛋白质注释的特点。我们还将介绍与每个条目相关联的一些工具和数据库。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验