一种非模式腹足动物（黑唇蜒螺）转录组的组装与注释：从头组装器的比较

Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: a comparison of de novo assemblers.

作者信息

Amin Shorash, Prentis Peter J, Gilding Edward K, Pavasovic Ana

机构信息

School of Biomedical Sciences, Faculty of Health, Queensland University of Technology, GPO Box 2434, Brisbane, Qld 4001, Australia.

出版信息

BMC Res Notes. 2014 Aug 1;7:488. doi: 10.1186/1756-0500-7-488.

DOI:10.1186/1756-0500-7-488

PMID:25084827

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4124492/

Abstract

BACKGROUND

The sequencing, de novo assembly and annotation of transcriptome datasets generated with next generation sequencing (NGS) has enabled biologists to answer genomic questions in non-model species with unprecedented ease. Reliable and accurate de novo assembly and annotation of transcriptomes, however, is a critically important step for transcriptome assemblies generated from short read sequences. Typical benchmarks for assembly and annotation reliability have been performed with model species. To address the reliability and accuracy of de novo transcriptome assembly in non-model species, we generated an RNAseq dataset for an intertidal gastropod mollusc species, Nerita melanotragus, and compared the assembly produced by four different de novo transcriptome assemblers; Velvet, Oases, Geneious and Trinity, for a number of quality metrics and redundancy.

RESULTS

Transcriptome sequencing on the Ion Torrent PGM™ produced 1,883,624 raw reads with a mean length of 133 base pairs (bp). Both the Trinity and Oases de novo assemblers produced the best assemblies based on all quality metrics including fewer contigs, increased N50 and average contig length and contigs of greater length. Overall the BLAST and annotation success of our assemblies was not high with only 15-19% of contigs assigned a putative function.

CONCLUSIONS

We believe that any improvement in annotation success of gastropod species will require more gastropod genome sequences, but in particular an increase in mollusc protein sequences in public databases. Overall, this paper demonstrates that reliable and accurate de novo transcriptome assemblies can be generated from short read sequencers with the right assembly algorithms.

摘要

背景

利用新一代测序（NGS）生成的转录组数据集进行测序、从头组装和注释，使生物学家能够以前所未有的轻松方式回答非模式物种中的基因组问题。然而，对于从短读长序列生成的转录组组装而言，可靠且准确的从头组装和注释是至关重要的一步。典型的组装和注释可靠性基准测试是在模式物种上进行的。为了评估非模式物种中从头转录组组装的可靠性和准确性，我们为一种潮间带腹足纲软体动物黑凹螺（Nerita melanotragus）生成了一个RNAseq数据集，并比较了四种不同的从头转录组组装程序（Velvet、Oases、Geneious和Trinity）产生的组装结果，涉及多个质量指标和冗余情况。

结果

在Ion Torrent PGM™上进行的转录组测序产生了1,883,624条原始读段，平均长度为133个碱基对（bp）。基于所有质量指标，包括更少的重叠群、增加的N50和平均重叠群长度以及更长的重叠群，Trinity和Oases从头组装程序都产生了最佳组装结果。总体而言，我们组装结果的BLAST和注释成功率不高，只有15 - 19%的重叠群被赋予了推定功能。

结论

我们认为，腹足纲物种注释成功率的任何提高都将需要更多的腹足纲基因组序列，特别是公共数据库中软体动物蛋白质序列的增加。总体而言，本文表明，使用合适的组装算法，可以从短读长测序仪生成可靠且准确的从头转录组组装。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4109/4124492/2e328b192c4d/1756-0500-7-488-1.jpg

相似文献

Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: a comparison of de novo assemblers.

BMC Res Notes. 2014 Aug 1;7:488. doi: 10.1186/1756-0500-7-488.

Combining transcriptome assemblies from multiple de novo assemblers in the allo-tetraploid plant Nicotiana benthamiana.

PLoS One. 2014 Mar 10;9(3):e91776. doi: 10.1371/journal.pone.0091776. eCollection 2014.

Short read Illumina data for the de novo assembly of a non-model snail species transcriptome (Radix balthica, Basommatophora, Pulmonata), and a comparison of assembler performance.

BMC Genomics. 2011 Jun 16;12:317. doi: 10.1186/1471-2164-12-317.

Comparison of De Novo Transcriptome Assemblers and k-mer Strategies Using the Killifish, Fundulus heteroclitus.

PLoS One. 2016 Apr 7;11(4):e0153104. doi: 10.1371/journal.pone.0153104. eCollection 2016.

Inferring bona fide transfrags in RNA-Seq derived-transcriptome assemblies of non-model organisms.

BMC Bioinformatics. 2015 Feb 21;16(1):58. doi: 10.1186/s12859-015-0492-5.

Challenges and advances for transcriptome assembly in non-model species.

PLoS One. 2017 Sep 20;12(9):e0185020. doi: 10.1371/journal.pone.0185020. eCollection 2017.

Optimization of de novo transcriptome assembly from high-throughput short read sequencing data improves functional annotation for non-model organisms.

BMC Bioinformatics. 2012 Jul 18;13:170. doi: 10.1186/1471-2105-13-170.

Comparative performance of transcriptome assembly methods for non-model organisms.

BMC Genomics. 2016 Jul 27;17:523. doi: 10.1186/s12864-016-2923-8.

Comparison of assembly algorithms for improving rate of metatranscriptomic functional annotation.

Microbiome. 2014 Oct 28;2:39. doi: 10.1186/2049-2618-2-39. eCollection 2014.

Comparisons of de novo transcriptome assemblers in diploid and polyploid species using peanut (Arachis spp.) RNA-Seq data.

PLoS One. 2014 Dec 31;9(12):e115055. doi: 10.1371/journal.pone.0115055. eCollection 2014.

引用本文的文献

De novo assembly of plasmodium interspersed repeat (pir) genes from Plasmodium vivax RNAseq data suggests geographic conservation of sub-family transcription.

BMC Genomics. 2025 May 29;26(1):544. doi: 10.1186/s12864-025-11752-1.

Normalized Workflow to Optimize Hybrid De Novo Transcriptome Assembly for Non-Model Species: A Case Study in (Baker) Boiss.

Plants (Basel). 2022 Sep 10;11(18):2365. doi: 10.3390/plants11182365.

Transcriptional Analyses of Acute Exposure to Methylmercury on Erythrocytes of Loggerhead Sea Turtle.

Toxics. 2021 Mar 29;9(4):70. doi: 10.3390/toxics9040070.

Evaluation of Seven Different RNA-Seq Alignment Tools Based on Experimental Data from the Model Plant .

Int J Mol Sci. 2020 Mar 3;21(5):1720. doi: 10.3390/ijms21051720.

Comparative Analysis of Strategies for Transcriptome Assembly in Prokaryotes: as a Case Study.

High Throughput. 2019 Nov 30;8(4):20. doi: 10.3390/ht8040020.

Auxin controls circadian flower opening and closure in the waterlily.

BMC Plant Biol. 2018 Jul 11;18(1):143. doi: 10.1186/s12870-018-1357-7.

Comparative molecular analyses of select pH- and osmoregulatory genes in three freshwater crayfish , and .

PeerJ. 2017 Aug 24;5:e3623. doi: 10.7717/peerj.3623. eCollection 2017.

The transcriptome of a "sleeping" invader: de novo assembly and annotation of the transcriptome of aestivating Cornu aspersum.

BMC Genomics. 2017 Jun 28;18(1):491. doi: 10.1186/s12864-017-3885-1.

Combining independent de novo assemblies optimizes the coding transcriptome for nonconventional model eukaryotic organisms.

BMC Bioinformatics. 2016 Dec 9;17(1):525. doi: 10.1186/s12859-016-1406-x.

Transcriptomic Analysis of the Endangered Neritid Species Clithon retropictus: De Novo Assembly, Functional Annotation, and Marker Discovery.

Genes (Basel). 2016 Jul 22;7(7):35. doi: 10.3390/genes7070035.

本文引用的文献

Transcriptome analyses and differential gene expression in a non-model fish species with alternative mating tactics.

BMC Genomics. 2014 Feb 28;15:167. doi: 10.1186/1471-2164-15-167.

454 pyrosequencing-based analysis of gene expression profiles in the amphipod Melita plumulosa: transcriptome assembly and toxicant induced changes.

Aquat Toxicol. 2014 Aug;153:73-88. doi: 10.1016/j.aquatox.2013.11.022. Epub 2013 Dec 12.

De novo assembly of the transcriptome of the non-model plant Streptocarpus rexii employing a novel heuristic to recover locus-specific transcript clusters.

PLoS One. 2013 Dec 6;8(12):e80961. doi: 10.1371/journal.pone.0080961. eCollection 2013.

Illumina-based de novo transcriptome sequencing and analysis of Amanita exitialis basidiocarps.

Gene. 2013 Dec 10;532(1):63-71. doi: 10.1016/j.gene.2013.09.014. Epub 2013 Sep 17.

SNP detection from de novo transcriptome sequencing in the bivalve Macoma balthica: marker development for evolutionary studies.

PLoS One. 2012;7(12):e52302. doi: 10.1371/journal.pone.0052302. Epub 2012 Dec 26.

Transcriptomic responses to salinity stress in the Pacific oyster Crassostrea gigas.

PLoS One. 2012;7(9):e46244. doi: 10.1371/journal.pone.0046244. Epub 2012 Sep 27.

De novo sequencing and transcriptome analysis of the central nervous system of mollusc Lymnaea stagnalis by deep RNA sequencing.

PLoS One. 2012;7(8):e42546. doi: 10.1371/journal.pone.0042546. Epub 2012 Aug 1.

Transcriptome profiles link environmental variation and physiological response of Mytilus californianus between Pacific tides.

Funct Ecol. 2012 Feb 1;26(1):144-155. doi: 10.1111/j.1365-2435.2011.01924.x. Epub 2011 Oct 13.

Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data.

Bioinformatics. 2012 Jun 15;28(12):1647-9. doi: 10.1093/bioinformatics/bts199. Epub 2012 Apr 27.

Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels.

Bioinformatics. 2012 Apr 15;28(8):1086-92. doi: 10.1093/bioinformatics/bts094. Epub 2012 Feb 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种非模式腹足动物（黑唇蜒螺）转录组的组装与注释：从头组装器的比较

Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: a comparison of de novo assemblers.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献