Suppr超能文献

卡维亚尔:用于测试 SNV 新颖性的一种易于使用的系统。

Kaviar: an accessible system for testing SNV novelty.

机构信息

Institute for Systems Biology, Seattle, WA 98109, USA.

出版信息

Bioinformatics. 2011 Nov 15;27(22):3216-7. doi: 10.1093/bioinformatics/btr540. Epub 2011 Sep 28.

Abstract

With the rapidly expanding availability of data from personal genomes, exomes and transcriptomes, medical researchers will frequently need to test whether observed genomic variants are novel or known. This task requires downloading and handling large and diverse datasets from a variety of sources, and processing them with bioinformatics tools and pipelines. Alternatively, researchers can upload data to online tools, which may conflict with privacy requirements. We present here Kaviar, a tool that greatly simplifies the assessment of novel variants. Kaviar includes: (i) an integrated and growing database of genomic variation from diverse sources, including over 55 million variants from personal genomes, family genomes, transcriptomes, SNV databases and population surveys; and (ii) software for querying the database efficiently.

摘要

随着个人基因组、外显子组和转录组数据的迅速增加,医学研究人员将经常需要测试观察到的基因组变体是否为新的或已知的。这项任务需要从各种来源下载和处理大量且不同的数据集,并使用生物信息学工具和管道对其进行处理。或者,研究人员可以将数据上传到在线工具,这可能会与隐私要求相冲突。我们在这里介绍 Kaviar,这是一种大大简化新型变体评估的工具。Kaviar 包括:(i) 来自不同来源的基因组变异的集成和不断增长的数据库,包括来自个人基因组、家系基因组、转录组、单核苷酸变异数据库和人群调查的超过 5500 万个变异;以及 (ii) 用于高效查询数据库的软件。

相似文献

1
Kaviar: an accessible system for testing SNV novelty.
Bioinformatics. 2011 Nov 15;27(22):3216-7. doi: 10.1093/bioinformatics/btr540. Epub 2011 Sep 28.
2
Cruxome: a powerful tool for annotating, interpreting and reporting genetic variants.
BMC Genomics. 2021 Jun 3;22(1):407. doi: 10.1186/s12864-021-07728-6.
3
TIARA: a database for accurate analysis of multiple personal genomes based on cross-technology.
Nucleic Acids Res. 2011 Jan;39(Database issue):D883-8. doi: 10.1093/nar/gkq1101. Epub 2010 Nov 4.
4
Sequence database versioning for command line and Galaxy bioinformatics servers.
Bioinformatics. 2016 Apr 15;32(8):1275-7. doi: 10.1093/bioinformatics/btv724. Epub 2015 Dec 12.
5
solQTL: a tool for QTL analysis, visualization and linking to genomes at SGN database.
BMC Bioinformatics. 2010 Oct 21;11:525. doi: 10.1186/1471-2105-11-525.
6
SeQuiLa: an elastic, fast and scalable SQL-oriented solution for processing and querying genomic intervals.
Bioinformatics. 2019 Jun 1;35(12):2156-2158. doi: 10.1093/bioinformatics/bty940.
7
CrustyBase: an interactive online database for crustacean transcriptomes.
BMC Genomics. 2020 Sep 14;21(1):637. doi: 10.1186/s12864-020-07063-2.
8
AnnTools: a comprehensive and versatile annotation toolkit for genomic variants.
Bioinformatics. 2012 Mar 1;28(5):724-5. doi: 10.1093/bioinformatics/bts032. Epub 2012 Jan 18.
9
SSRome: an integrated database and pipelines for exploring microsatellites in all organisms.
Nucleic Acids Res. 2019 Jan 8;47(D1):D244-D252. doi: 10.1093/nar/gky998.

引用本文的文献

1
Heterozygous KRT32 variant is responsible for autosomal dominant loose anagen hair syndrome.
HGG Adv. 2025 Aug 14;6(4):100495. doi: 10.1016/j.xhgg.2025.100495.
2
Genetic association of preeclampsia to von Willebrand factor and its size-regulator ADAMTS13.
Res Sq. 2025 Jul 8:rs.3.rs-5685318. doi: 10.21203/rs.3.rs-5685318/v1.
4
Loose Anagen Hair Associated with Wooly Hair Caused by a Heterozygous, Intronic Variant.
Genes (Basel). 2025 Apr 17;16(4):459. doi: 10.3390/genes16040459.
6
Genome Sequencing of Idiopathic Speech Delay.
Hum Mutat. 2024 Mar 28;2024:9692863. doi: 10.1155/2024/9692863. eCollection 2024.
9
Ensemble and consensus approaches to prediction of recessive inheritance for missense variants in human disease.
Cell Rep Methods. 2024 Dec 16;4(12):100914. doi: 10.1016/j.crmeth.2024.100914. Epub 2024 Dec 9.
10
Arrayed CRISPR libraries for the genome-wide activation, deletion and silencing of human protein-coding genes.
Nat Biomed Eng. 2025 Jan;9(1):127-148. doi: 10.1038/s41551-024-01278-4. Epub 2024 Dec 4.

本文引用的文献

1
PanSNPdb: the Pan-Asian SNP genotyping database.
PLoS One. 2011;6(6):e21451. doi: 10.1371/journal.pone.0021451. Epub 2011 Jun 23.
2
The variant call format and VCFtools.
Bioinformatics. 2011 Aug 1;27(15):2156-8. doi: 10.1093/bioinformatics/btr330. Epub 2011 Jun 7.
3
ENGINES: exploring single nucleotide variation in entire human genomes.
BMC Bioinformatics. 2011 Apr 19;12:105. doi: 10.1186/1471-2105-12-105.
4
A map of human genome variation from population-scale sequencing.
Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.
5
Resequencing of 200 human exomes identifies an excess of low-frequency non-synonymous coding variants.
Nat Genet. 2010 Nov;42(11):969-72. doi: 10.1038/ng.680. Epub 2010 Oct 3.
6
SeqAnt: a web service to rapidly identify and annotate DNA sequence variations.
BMC Bioinformatics. 2010 Sep 20;11:471. doi: 10.1186/1471-2105-11-471.
7
The characterization of twenty sequenced human genomes.
PLoS Genet. 2010 Sep 9;6(9):e1001111. doi: 10.1371/journal.pgen.1001111.
8
A standard variation file format for human genome sequences.
Genome Biol. 2010;11(8):R88. doi: 10.1186/gb-2010-11-8-r88. Epub 2010 Aug 26.
9
Varietas: a functional variation database portal.
Database (Oxford). 2010 Jul 29;2010:baq016. doi: 10.1093/database/baq016.
10
A draft sequence of the Neandertal genome.
Science. 2010 May 7;328(5979):710-722. doi: 10.1126/science.1188021.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验