Suppr超能文献

美国国立生物技术信息中心的参考序列(RefSeq)数据库:当前状态、分类扩展及功能注释。

Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation.

作者信息

O'Leary Nuala A, Wright Mathew W, Brister J Rodney, Ciufo Stacy, Haddad Diana, McVeigh Rich, Rajput Bhanu, Robbertse Barbara, Smith-White Brian, Ako-Adjei Danso, Astashyn Alexander, Badretdin Azat, Bao Yiming, Blinkova Olga, Brover Vyacheslav, Chetvernin Vyacheslav, Choi Jinna, Cox Eric, Ermolaeva Olga, Farrell Catherine M, Goldfarb Tamara, Gupta Tripti, Haft Daniel, Hatcher Eneida, Hlavina Wratko, Joardar Vinita S, Kodali Vamsi K, Li Wenjun, Maglott Donna, Masterson Patrick, McGarvey Kelly M, Murphy Michael R, O'Neill Kathleen, Pujar Shashikant, Rangwala Sanjida H, Rausch Daniel, Riddick Lillian D, Schoch Conrad, Shkeda Andrei, Storz Susan S, Sun Hanzhen, Thibaud-Nissen Francoise, Tolstoy Igor, Tully Raymond E, Vatsan Anjana R, Wallin Craig, Webb David, Wu Wendy, Landrum Melissa J, Kimchi Avi, Tatusova Tatiana, DiCuccio Michael, Kitts Paul, Murphy Terence D, Pruitt Kim D

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA.

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA

出版信息

Nucleic Acids Res. 2016 Jan 4;44(D1):D733-45. doi: 10.1093/nar/gkv1189. Epub 2015 Nov 8.

Abstract

The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management.

相似文献

1
Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation.
Nucleic Acids Res. 2016 Jan 4;44(D1):D733-45. doi: 10.1093/nar/gkv1189. Epub 2015 Nov 8.
2
NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy.
Nucleic Acids Res. 2012 Jan;40(Database issue):D130-5. doi: 10.1093/nar/gkr1079. Epub 2011 Nov 24.
3
NCBI Reference Sequences: current status, policy and new initiatives.
Nucleic Acids Res. 2009 Jan;37(Database issue):D32-6. doi: 10.1093/nar/gkn721. Epub 2008 Oct 16.
4
NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D501-4. doi: 10.1093/nar/gki025.
5
RefSeq: an update on mammalian reference sequences.
Nucleic Acids Res. 2014 Jan;42(Database issue):D756-63. doi: 10.1093/nar/gkt1114. Epub 2013 Nov 19.
6
Comparison of RefSeq protein-coding regions in human and vertebrate genomes.
BMC Genomics. 2013 Sep 25;14:654. doi: 10.1186/1471-2164-14-654.
7
NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.
Nucleic Acids Res. 2007 Jan;35(Database issue):D61-5. doi: 10.1093/nar/gkl842. Epub 2006 Nov 27.
8
RefSeq microbial genomes database: new representation and annotation strategy.
Nucleic Acids Res. 2014 Jan;42(Database issue):D553-9. doi: 10.1093/nar/gkt1274. Epub 2013 Dec 6.
9
RefSeq: an update on prokaryotic genome annotation and curation.
Nucleic Acids Res. 2018 Jan 4;46(D1):D851-D860. doi: 10.1093/nar/gkx1068.
10
Gene: a gene-centered information resource at NCBI.
Nucleic Acids Res. 2015 Jan;43(Database issue):D36-42. doi: 10.1093/nar/gku1055. Epub 2014 Oct 29.

引用本文的文献

1
Mutual Antagonism Between PRC1 Condensates and SWI/SNF in Chromatin Regulation.
bioRxiv. 2025 Aug 26:2025.08.25.672128. doi: 10.1101/2025.08.25.672128.
2
Deciphering enzymatic potential in metagenomic reads through DNA language models.
Nucleic Acids Res. 2025 Aug 27;53(16). doi: 10.1093/nar/gkaf836.
4
Loss of multiple micro-RNAs uncovers multi-level restructuring of gene regulation in rodents.
BMC Genomics. 2025 Sep 2;26(1):800. doi: 10.1186/s12864-025-11815-3.
6
RNADecayCafe, a uniformly processed atlas of RNA half-life estimates across multiple human cell lines.
bioRxiv. 2025 Aug 21:2025.08.19.671151. doi: 10.1101/2025.08.19.671151.
7
Pathogenic variation underlying rare diseases in an Arab population: Implications for screening programs.
Genet Med Open. 2025 Jul 19;3:103446. doi: 10.1016/j.gimo.2025.103446. eCollection 2025.
8
MitoCOMON: whole mitochondrial DNA sequencing by primer design and long overlapping amplicon assembly.
BMC Genomics. 2025 Aug 30;26(1):787. doi: 10.1186/s12864-025-12010-0.

本文引用的文献

1
Long non-coding RNA HOTAIR: A novel oncogene (Review).
Mol Med Rep. 2015 Oct;12(4):5611-8. doi: 10.3892/mmr.2015.4161. Epub 2015 Jul 31.
2
Mouse genome annotation by the RefSeq project.
Mamm Genome. 2015 Oct;26(9-10):379-90. doi: 10.1007/s00335-015-9585-8. Epub 2015 Jul 28.
3
RefSeq curation and annotation of antizyme and antizyme inhibitor genes in vertebrates.
Nucleic Acids Res. 2015 Sep 3;43(15):7270-9. doi: 10.1093/nar/gkv713. Epub 2015 Jul 13.
5
SCIENTIFIC STANDARDS. Promoting an open research culture.
Science. 2015 Jun 26;348(6242):1422-5. doi: 10.1126/science.aab2374.
6
ZFIN, The zebrafish model organism database: Updates and new directions.
Genesis. 2015 Aug;53(8):498-509. doi: 10.1002/dvg.22868. Epub 2015 Jul 8.
7
Past, present, and future of arenavirus taxonomy.
Arch Virol. 2015 Jul;160(7):1851-74. doi: 10.1007/s00705-015-2418-y.
8
Ratification vote on taxonomic proposals to the International Committee on Taxonomy of Viruses (2015).
Arch Virol. 2015 Jul;160(7):1837-50. doi: 10.1007/s00705-015-2425-z.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验