Suppr超能文献

EnsemblCompara基因树:脊椎动物中完整的、可识别基因复制的系统发育树。

EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates.

作者信息

Vilella Albert J, Severin Jessica, Ureta-Vidal Abel, Heng Li, Durbin Richard, Birney Ewan

机构信息

EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.

出版信息

Genome Res. 2009 Feb;19(2):327-35. doi: 10.1101/gr.073585.107. Epub 2008 Nov 24.

Abstract

We have developed a comprehensive gene orientated phylogenetic resource, EnsemblCompara GeneTrees, based on a computational pipeline to handle clustering, multiple alignment, and tree generation, including the handling of large gene families. We developed two novel non-sequence-based metrics of gene tree correctness and benchmarked a number of tree methods. The TreeBeST method from TreeFam shows the best performance in our hands. We also compared this phylogenetic approach to clustering approaches for ortholog prediction, showing a large increase in coverage using the phylogenetic approach. All data are made available in a number of formats and will be kept up to date with the Ensembl project.

摘要

我们基于一个用于处理聚类、多重比对和树生成(包括处理大型基因家族)的计算流程,开发了一个全面的基因导向系统发育资源EnsemblCompara GeneTrees。我们开发了两种基于非序列的全新基因树正确性度量方法,并对多种树构建方法进行了基准测试。在我们的测试中,来自TreeFam的TreeBeST方法表现最佳。我们还将这种系统发育方法与用于直系同源物预测的聚类方法进行了比较,结果表明使用系统发育方法时覆盖范围大幅增加。所有数据都以多种格式提供,并将与Ensembl项目保持同步更新。

相似文献

1
EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates.
Genome Res. 2009 Feb;19(2):327-35. doi: 10.1101/gr.073585.107. Epub 2008 Nov 24.
3
ncRNA orthologies in the vertebrate lineage.
Database (Oxford). 2016 Mar 15;2016. doi: 10.1093/database/bav127. Print 2016.
4
Ensembl comparative genomics resources.
Database (Oxford). 2016 Feb 20;2016. doi: 10.1093/database/bav096. Print 2016.
5
Exact Algorithms for Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees.
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1077-1090. doi: 10.1109/TCBB.2017.2710342. Epub 2017 Jun 1.
6
TreeFam v9: a new website, more species and orthology-on-the-fly.
Nucleic Acids Res. 2014 Jan;42(Database issue):D922-5. doi: 10.1093/nar/gkt1055. Epub 2013 Nov 4.
7
A hybrid micro-macroevolutionary approach to gene tree reconstruction.
J Comput Biol. 2006 Mar;13(2):320-35. doi: 10.1089/cmb.2006.13.320.
8
Bayesian coestimation of phylogeny and sequence alignment.
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
9
OHNOLOGS v2: a comprehensive resource for the genes retained from whole genome duplication in vertebrates.
Nucleic Acids Res. 2020 Jan 8;48(D1):D724-D730. doi: 10.1093/nar/gkz909.

引用本文的文献

1
Genome-wide selection signal analysis reveals the adaptability of Tibetan sheep to high altitudes.
Front Vet Sci. 2025 Aug 14;12:1632017. doi: 10.3389/fvets.2025.1632017. eCollection 2025.
2
Transcription Factor LjWRKY50 Affects Jasmonate-Regulated Floral Bud Duration in .
Plants (Basel). 2025 Jul 27;14(15):2328. doi: 10.3390/plants14152328.
6
Encoding and decoding selectivity and promiscuity in the human chemokine-GPCR interaction network.
Cell. 2025 Jun 26;188(13):3603-3622.e27. doi: 10.1016/j.cell.2025.03.046. Epub 2025 Apr 23.
8
GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data.
Database (Oxford). 2025 Apr 4;2025. doi: 10.1093/database/baaf021.
9
Sensitive detection of synthetic response to cancer immunotherapy driven by gene paralog pairs.
Patterns (N Y). 2025 Feb 25;6(3):101184. doi: 10.1016/j.patter.2025.101184. eCollection 2025 Mar 14.

本文引用的文献

1
Database resources of the National Center for Biotechnology Information.
Nucleic Acids Res. 2008 Jan;36(Database issue):D13-21. doi: 10.1093/nar/gkm1000. Epub 2007 Nov 27.
2
MSOAR: a high-throughput ortholog assignment system based on genome rearrangement.
J Comput Biol. 2007 Nov;14(9):1160-75. doi: 10.1089/cmb.2007.0048.
3
Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolution.
Nat Genet. 2007 Nov;39(11):1361-8. doi: 10.1038/ng.2007.9. Epub 2007 Oct 7.
5
Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences.
Nature. 2007 May 10;447(7141):167-77. doi: 10.1038/nature05805.
6
PAML 4: phylogenetic analysis by maximum likelihood.
Mol Biol Evol. 2007 Aug;24(8):1586-91. doi: 10.1093/molbev/msm088. Epub 2007 May 4.
7
Evolutionary and biomedical insights from the rhesus macaque genome.
Science. 2007 Apr 13;316(5822):222-34. doi: 10.1126/science.1139247.
8
Ensembl 2007.
Nucleic Acids Res. 2007 Jan;35(Database issue):D610-7. doi: 10.1093/nar/gkl996. Epub 2006 Dec 5.
9
NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.
Nucleic Acids Res. 2007 Jan;35(Database issue):D61-5. doi: 10.1093/nar/gkl842. Epub 2006 Nov 27.
10
Phylogenetic reconstruction of orthology, paralogy, and conserved synteny for dog and human.
PLoS Comput Biol. 2006 Sep 29;2(9):e133. doi: 10.1371/journal.pcbi.0020133.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验