Center for Bioinformatics ZBIT, Tübingen University, Tübingen, Germany.
ISME J. 2010 Oct;4(10):1236-42. doi: 10.1038/ismej.2010.51. Epub 2010 Apr 29.
Second-generation sequencing technologies are fueling a vast increase in the number and scope of metagenome projects. There is a great need for the development of new methods for visualizing the relationships between multiple metagenomic data sets. To address this, a novel approach is presented that combines the use of taxonomic analysis, ecological indices and non-hierarchical clustering to provide a network representation of the relationships between different metagenome data sets. The approach is illustrated using several published data sets of different types, including metagenomes, metatranscriptomes and 16S ribosomal profiles. Application of the approach to the same data summarized at different taxonomical levels gives rise to remarkably similar networks, indicating that the analysis is very robust. Importantly, the networks provide the both visual definition and metric quantification for the non-rooted relationship between samples, combining the desirable characteristics of other tools into one.
第二代测序技术正在推动宏基因组项目的数量和范围的极大增加。非常需要开发新的方法来可视化多个宏基因组数据集之间的关系。为了解决这个问题,提出了一种新的方法,该方法结合了分类分析、生态指数和非层次聚类,为不同宏基因组数据集之间的关系提供了网络表示。该方法使用了几个不同类型的已发表数据集进行说明,包括宏基因组、宏转录组和 16S 核糖体图谱。在不同分类学水平上总结相同数据的应用产生了非常相似的网络,表明分析非常稳健。重要的是,网络为样本之间的无根关系提供了可视化定义和度量量化,将其他工具的理想特性结合到一个工具中。