Suppr超能文献

利用大规模平行焦磷酸测序对拟南芥转录组进行采样。

Sampling the Arabidopsis transcriptome with massively parallel pyrosequencing.

作者信息

Weber Andreas P M, Weber Katrin L, Carr Kevin, Wilkerson Curtis, Ohlrogge John B

机构信息

Department of Plant Biology, Michigan State University, East Lansing, MI 48824-1312, USA.

出版信息

Plant Physiol. 2007 May;144(1):32-42. doi: 10.1104/pp.107.096677. Epub 2007 Mar 9.

Abstract

Massively parallel sequencing of DNA by pyrosequencing technology offers much higher throughput and lower cost than conventional Sanger sequencing. Although extensively used already for sequencing of genomes, relatively few applications of massively parallel pyrosequencing to transcriptome analysis have been reported. To test the ability of this technology to provide unbiased representation of transcripts, we analyzed mRNA from Arabidopsis (Arabidopsis thaliana) seedlings. Two sequencing runs yielded 541,852 expressed sequence tags (ESTs) after quality control. Mapping of the ESTs to the Arabidopsis genome and to The Arabidopsis Information Resource 7.0 cDNA models indicated: (1) massively parallel pyrosequencing detected transcription of 17,449 gene loci providing very deep coverage of the transcriptome. Performing a second sequencing run only increased the number of genes identified by 10%, but increased the overall sequence coverage by 50%. (2) Mapping of the ESTs to their predicted full-length transcripts indicated that all regions of the transcript were well represented regardless of transcript length or expression level. Furthermore, short, medium, and long transcripts were equally represented. (3) Over 16,000 of the ESTs that mapped to the genome were not represented in the existing dbEST database. In some cases, the ESTs provide the first experimental evidence for transcripts derived from predicted genes, and, for at least 60 locations in the genome, pyrosequencing identified likely protein-coding sequences that are not now annotated as genes. Together, the results indicate massively parallel pyrosequencing provides novel information helpful to improve the annotation of the Arabidopsis genome. Furthermore, the unbiased representation of transcripts will be particularly useful for gene discovery and gene expression analysis of nonmodel plants with less complete genomic information. EST sequence accession numbers in GenBank are EH 795234 through EH 995233 and EL 000001 through EL 341852.

摘要

与传统的桑格测序法相比,焦磷酸测序技术对DNA进行大规模平行测序可提供更高的通量和更低的成本。尽管该技术已广泛用于基因组测序,但将大规模平行焦磷酸测序应用于转录组分析的报道相对较少。为了测试该技术提供转录本无偏差表征的能力,我们分析了拟南芥幼苗的mRNA。经过质量控制后,两次测序运行产生了541,852个表达序列标签(EST)。将这些EST定位到拟南芥基因组和拟南芥信息资源7.0 cDNA模型表明:(1)大规模平行焦磷酸测序检测到17,449个基因座的转录,为转录组提供了非常深入的覆盖。进行第二次测序运行仅使鉴定出的基因数量增加了10%,但总体序列覆盖率增加了50%。(2)将EST定位到其预测的全长转录本表明,无论转录本长度或表达水平如何,转录本的所有区域均得到了很好的表征。此外,短、中、长转录本的表征均等。(3)超过16,000个定位到基因组的EST在现有的dbEST数据库中未出现。在某些情况下,这些EST为源自预测基因的转录本提供了首个实验证据,并且对于基因组中的至少60个位置,焦磷酸测序鉴定出了目前未注释为基因的可能的蛋白质编码序列。总之,结果表明大规模平行焦磷酸测序提供了有助于改善拟南芥基因组注释的新信息。此外,转录本的无偏差表征对于基因组信息不太完整的非模式植物的基因发现和基因表达分析将特别有用。GenBank中的EST序列登录号为EH 795234至EH 995233以及EL 000001至EL 341852。

相似文献

1
Sampling the Arabidopsis transcriptome with massively parallel pyrosequencing.
Plant Physiol. 2007 May;144(1):32-42. doi: 10.1104/pp.107.096677. Epub 2007 Mar 9.
2
Characterization of 954 bovine full-CDS cDNA sequences.
BMC Genomics. 2005 Nov 23;6:166. doi: 10.1186/1471-2164-6-166.
4
Sequencing Medicago truncatula expressed sequenced tags using 454 Life Sciences technology.
BMC Genomics. 2006 Oct 24;7:272. doi: 10.1186/1471-2164-7-272.
7
SNP discovery by transcriptome pyrosequencing.
Methods Mol Biol. 2011;729:225-46. doi: 10.1007/978-1-61779-065-2_15.
8
Analysis of the transcriptional complexity of Arabidopsis thaliana by massively parallel signature sequencing.
Nat Biotechnol. 2004 Aug;22(8):1006-11. doi: 10.1038/nbt992. Epub 2004 Jul 11.

引用本文的文献

1
Tracing the evolution of sequencing into the era of genomic medicine.
Nat Rev Genet. 2025 Aug 15. doi: 10.1038/s41576-025-00884-5.
2
Analysis on morphological characteristics and identification of candidate genes during the flowering development of alfalfa.
Front Plant Sci. 2024 Aug 13;15:1426838. doi: 10.3389/fpls.2024.1426838. eCollection 2024.
4
Protocol for RNA-seq Expression Analysis in Yeast.
Bio Protoc. 2021 Sep 20;11(18):e4161. doi: 10.21769/BioProtoc.4161.
5
Non-Coding RNAs in Cancer Diagnosis and Therapy: Focus on Lung Cancer.
Cancers (Basel). 2021 Mar 18;13(6):1372. doi: 10.3390/cancers13061372.
6
Future Trends in Nebulized Therapies for Pulmonary Disease.
J Pers Med. 2020 May 10;10(2):37. doi: 10.3390/jpm10020037.
7
Natural variation among Arabidopsis thaliana accessions in tolerance to high magnesium supply.
Sci Rep. 2018 Sep 11;8(1):13640. doi: 10.1038/s41598-018-31950-0.
8
Hypertranscription in Development, Stem Cells, and Regeneration.
Dev Cell. 2017 Jan 9;40(1):9-21. doi: 10.1016/j.devcel.2016.11.010. Epub 2016 Dec 15.
9
Observability of Plant Metabolic Networks Is Reflected in the Correlation of Metabolic Profiles.
Plant Physiol. 2016 Oct;172(2):1324-1333. doi: 10.1104/pp.16.00900. Epub 2016 Aug 26.

本文引用的文献

1
Gene discovery and annotation using LCM-454 transcriptome sequencing.
Genome Res. 2007 Jan;17(1):69-73. doi: 10.1101/gr.5145806. Epub 2006 Nov 9.
2
Performance evaluation of existing de novo sequencing algorithms.
J Proteome Res. 2006 Nov;5(11):3018-28. doi: 10.1021/pr060222h.
3
Sequencing Medicago truncatula expressed sequenced tags using 454 Life Sciences technology.
BMC Genomics. 2006 Oct 24;7:272. doi: 10.1186/1471-2164-7-272.
5
The Arabidopsis unannotated secreted peptide database, a resource for plant peptidomics.
Plant Physiol. 2006 Nov;142(3):831-8. doi: 10.1104/pp.106.086041. Epub 2006 Sep 22.
6
Arabidopsis thaliana proteomics: from proteome to genome.
J Exp Bot. 2006;57(7):1485-91. doi: 10.1093/jxb/erj130. Epub 2006 Mar 21.
7
Plant MPSS databases: signature-based transcriptional resources for analyses of mRNA and small RNA.
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D731-5. doi: 10.1093/nar/gkj077.
8
Metagenomics to paleogenomics: large-scale sequencing of mammoth DNA.
Science. 2006 Jan 20;311(5759):392-4. doi: 10.1126/science.1123360. Epub 2005 Dec 20.
10
Pyrosequencing: history, biochemistry and future.
Clin Chim Acta. 2006 Jan;363(1-2):83-94. doi: 10.1016/j.cccn.2005.04.038. Epub 2005 Sep 13.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验