MetaPhOrs：使用基于一致性的置信分数，从多种系统发育证据预测直系同源和旁系同源。

MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score.

机构信息

Bioinformatics and Genomics Programme, Centre de Regulació Genòmica (CRG), Universitat Pompeu Fabra, Dr. Aiguader, 88. 08003, Barcelona, Spain.

出版信息

Nucleic Acids Res. 2011 Mar;39(5):e32. doi: 10.1093/nar/gkq953. Epub 2010 Dec 11.

DOI:10.1093/nar/gkq953

PMID:21149260

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3061081/

Abstract

Reliable prediction of orthology is central to comparative genomics. Approaches based on phylogenetic analyses closely resemble the original definition of orthology and paralogy and are known to be highly accurate. However, the large computational cost associated to these analyses is a limiting factor that often prevents its use at genomic scales. Recently, several projects have addressed the reconstruction of large collections of high-quality phylogenetic trees from which orthology and paralogy relationships can be inferred. This provides us with the opportunity to infer the evolutionary relationships of genes from multiple, independent, phylogenetic trees. Using such strategy, we combine phylogenetic information derived from different databases, to predict orthology and paralogy relationships for 4.1 million proteins in 829 fully sequenced genomes. We show that the number of independent sources from which a prediction is made, as well as the level of consistency across predictions, can be used as reliable confidence scores. A webserver has been developed to easily access these data (http://orthology.phylomedb.org), which provides users with a global repository of phylogeny-based orthology and paralogy predictions.

摘要

可靠的同源性预测是比较基因组学的核心。基于系统发育分析的方法与同源性和旁系同源性的原始定义非常相似，并且被证明具有高度的准确性。然而，这些分析所涉及的巨大计算成本是一个限制因素，常常阻止其在基因组规模上使用。最近，有几个项目致力于从大量高质量的系统发育树中重建，这些树可以推断出同源性和旁系同源性的关系。这为我们提供了从多个独立的系统发育树推断基因进化关系的机会。使用这种策略，我们结合了来自不同数据库的系统发育信息，为 829 个完全测序的基因组中的 410 万个蛋白质预测了同源性和旁系同源性的关系。我们表明，预测所来自的独立来源的数量以及预测之间的一致性水平可以用作可靠的置信分数。已经开发了一个网络服务器来方便地访问这些数据（http://orthology.phylomedb.org），该服务器为用户提供了基于系统发育的同源性和旁系同源性预测的全局存储库。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aead/3061081/36beb042d731/gkq953f1.jpg

相似文献

MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score.

Nucleic Acids Res. 2011 Mar;39(5):e32. doi: 10.1093/nar/gkq953. Epub 2010 Dec 11.

MetaPhOrs 2.0: integrative, phylogeny-based inference of orthology and paralogy across the tree of life.

Nucleic Acids Res. 2020 Jul 2;48(W1):W553-W557. doi: 10.1093/nar/gkaa282.

PhylomeDB v3.0: an expanding repository of genome-wide collections of trees, alignments and phylogeny-based orthology and paralogy predictions.

Nucleic Acids Res. 2011 Jan;39(Database issue):D556-60. doi: 10.1093/nar/gkq1109. Epub 2010 Nov 12.

PhylomeDB v4: zooming into the plurality of evolutionary histories of a genome.

Nucleic Acids Res. 2014 Jan;42(Database issue):D897-902. doi: 10.1093/nar/gkt1177. Epub 2013 Nov 25.

Phylogenetic reconstruction of orthology, paralogy, and conserved synteny for dog and human.

PLoS Comput Biol. 2006 Sep 29;2(9):e133. doi: 10.1371/journal.pcbi.0020133.

PhylomeDB: a database for genome-wide collections of gene phylogenies.

Nucleic Acids Res. 2008 Jan;36(Database issue):D491-6. doi: 10.1093/nar/gkm899. Epub 2007 Oct 25.

Inferring Orthology and Paralogy.

Methods Mol Biol. 2019;1910:149-175. doi: 10.1007/978-1-4939-9074-0_5.

PhylomeDB V5: an expanding repository for genome-wide catalogues of annotated gene phylogenies.

Nucleic Acids Res. 2022 Jan 7;50(D1):D1062-D1068. doi: 10.1093/nar/gkab966.

QuartetS-DB: a large-scale orthology database for prokaryotes and eukaryotes inferred by evolutionary evidence.

BMC Bioinformatics. 2012 Jun 22;13:143. doi: 10.1186/1471-2105-13-143.

QuartetS: a fast and accurate algorithm for large-scale orthology detection.

Nucleic Acids Res. 2011 Jul;39(13):e88. doi: 10.1093/nar/gkr308. Epub 2011 May 13.

引用本文的文献

Cross-species meta-analysis of transcriptome changes during the morula-to-blastocyst transition: metabolic and physiological changes take center stage.

Am J Physiol Cell Physiol. 2021 Dec 1;321(6):C913-C931. doi: 10.1152/ajpcell.00318.2021. Epub 2021 Oct 20.

A Prioritized and Validated Resource of Mitochondrial Proteins in Identifies Unique Biology.

mSphere. 2021 Oct 27;6(5):e0061421. doi: 10.1128/mSphere.00614-21. Epub 2021 Sep 8.

Role of epigenetics in unicellular to multicellular transition in Dictyostelium.

Genome Biol. 2021 May 4;22(1):134. doi: 10.1186/s13059-021-02360-9.

A Workflow for Selection of Single Nucleotide Polymorphic Markers for Studying of Genetics of Ischemic Stroke Outcomes.

Genes (Basel). 2021 Feb 25;12(3):328. doi: 10.3390/genes12030328.

Structure and function of the vacuolar Ccc1/VIT1 family of iron transporters and its regulation in fungi.

Comput Struct Biotechnol J. 2020 Nov 23;18:3712-3722. doi: 10.1016/j.csbj.2020.10.044. eCollection 2020.

Benchmarking orthology methods using phylogenetic patterns defined at the base of Eukaryotes.

Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa206.

In a Pair of Paralogous Isozymes Catalyze the First Committed Step of Leucine Biosynthesis in Either the Mitochondria or the Cytosol.

Front Microbiol. 2020 Aug 4;11:1843. doi: 10.3389/fmicb.2020.01843. eCollection 2020.

Draft genome of the European medicinal leech Hirudo medicinalis (Annelida, Clitellata, Hirudiniformes) with emphasis on anticoagulants.

Sci Rep. 2020 Jun 18;10(1):9885. doi: 10.1038/s41598-020-66749-5.

Extending the small-molecule similarity principle to all levels of biology with the Chemical Checker.

Nat Biotechnol. 2020 Sep;38(9):1087-1096. doi: 10.1038/s41587-020-0502-7. Epub 2020 May 18.

MetaPhOrs 2.0: integrative, phylogeny-based inference of orthology and paralogy across the tree of life.

Nucleic Acids Res. 2020 Jul 2;48(W1):W553-W557. doi: 10.1093/nar/gkaa282.

本文引用的文献

ETE: a python Environment for Tree Exploration.

BMC Bioinformatics. 2010 Jan 13;11:24. doi: 10.1186/1471-2105-11-24.

eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations.

Nucleic Acids Res. 2010 Jan;38(Database issue):D190-5. doi: 10.1093/nar/gkp951. Epub 2009 Nov 9.

Joining forces in the quest for orthologs.

Genome Biol. 2009;10(9):403. doi: 10.1186/gb-2009-10-9-403. Epub 2009 Sep 29.

trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses.

Bioinformatics. 2009 Aug 1;25(15):1972-3. doi: 10.1093/bioinformatics/btp348. Epub 2009 Jun 8.

Berkeley PHOG: PhyloFacts orthology group prediction web server.

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W84-9. doi: 10.1093/nar/gkp373. Epub 2009 May 12.

The tree versus the forest: the fungal tree of life and the topological diversity within the yeast phylome.

PLoS One. 2009;4(2):e4357. doi: 10.1371/journal.pone.0004357. Epub 2009 Feb 3.

Phylogenetic and functional assessment of orthologs inference projects and methods.

PLoS Comput Biol. 2009 Jan;5(1):e1000262. doi: 10.1371/journal.pcbi.1000262. Epub 2009 Jan 16.

EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates.

Genome Res. 2009 Feb;19(2):327-35. doi: 10.1101/gr.073585.107. Epub 2008 Nov 24.

Large-scale assignment of orthology: back to phylogenetics?

Genome Biol. 2008 Oct 30;9(10):235. doi: 10.1186/gb-2008-9-10-235.

The quest for orthologs: finding the corresponding gene across genomes.

Trends Genet. 2008 Nov;24(11):539-51. doi: 10.1016/j.tig.2008.08.009. Epub 2008 Sep 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

MetaPhOrs：使用基于一致性的置信分数，从多种系统发育证据预测直系同源和旁系同源。

MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献