Vilella Albert J, Severin Jessica, Ureta-Vidal Abel, Heng Li, Durbin Richard, Birney Ewan
EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.
Genome Res. 2009 Feb;19(2):327-35. doi: 10.1101/gr.073585.107. Epub 2008 Nov 24.
We have developed a comprehensive gene orientated phylogenetic resource, EnsemblCompara GeneTrees, based on a computational pipeline to handle clustering, multiple alignment, and tree generation, including the handling of large gene families. We developed two novel non-sequence-based metrics of gene tree correctness and benchmarked a number of tree methods. The TreeBeST method from TreeFam shows the best performance in our hands. We also compared this phylogenetic approach to clustering approaches for ortholog prediction, showing a large increase in coverage using the phylogenetic approach. All data are made available in a number of formats and will be kept up to date with the Ensembl project.
我们基于一个用于处理聚类、多重比对和树生成(包括处理大型基因家族)的计算流程,开发了一个全面的基因导向系统发育资源EnsemblCompara GeneTrees。我们开发了两种基于非序列的全新基因树正确性度量方法,并对多种树构建方法进行了基准测试。在我们的测试中,来自TreeFam的TreeBeST方法表现最佳。我们还将这种系统发育方法与用于直系同源物预测的聚类方法进行了比较,结果表明使用系统发育方法时覆盖范围大幅增加。所有数据都以多种格式提供,并将与Ensembl项目保持同步更新。