Jang Yeongjun, Yu Namhee, Seo Jihae, Kim Sun, Lee Sanghyuk
Ewha Research Center for Systems Biology (ERCSB), Ewha Womans University, Seoul, Republic of Korea.
Interdisciplinary Program in Bioinformatics, College of Natural Science, Seoul National University, Seoul, Republic of Korea.
Biol Direct. 2016 Mar 18;11(1):10. doi: 10.1186/s13062-016-0112-y.
Network-based integrative analysis is a powerful technique for extracting biological insights from multilayered omics data such as somatic mutations, copy number variations, and gene expression data. However, integrated analysis of multi-omics data is quite complicated and can hardly be done in an automated way. Thus, a powerful interactive visual mining tool supporting diverse analysis algorithms for identification of driver genes and regulatory modules is much needed.
Here, we present a software platform that integrates network visualization with omics data analysis tools seamlessly. The visualization unit supports various options for displaying multi-omics data as well as unique network models for describing sophisticated biological networks such as complex biomolecular reactions. In addition, we implemented diverse in-house algorithms for network analysis including network clustering and over-representation analysis. Novel functions include facile definition and optimized visualization of subgroups, comparison of a series of data sets in an identical network by data-to-visual mapping and subsequent overlaying function, and management of custom interaction networks. Utility of MONGKIE for network-based visual data mining of multi-omics data was demonstrated by analysis of the TCGA glioblastoma data. MONGKIE was developed in Java based on the NetBeans plugin architecture, thus being OS-independent with intrinsic support of module extension by third-party developers.
We believe that MONGKIE would be a valuable addition to network analysis software by supporting many unique features and visualization options, especially for analysing multi-omics data sets in cancer and other diseases. .
基于网络的整合分析是一种从体细胞突变、拷贝数变异和基因表达数据等多层组学数据中提取生物学见解的强大技术。然而,多组学数据的整合分析相当复杂,几乎无法以自动化方式完成。因此,非常需要一个强大的交互式可视化挖掘工具,它支持多种分析算法来识别驱动基因和调控模块。
在此,我们展示了一个软件平台,该平台将网络可视化与组学数据分析工具无缝集成。可视化单元支持多种显示多组学数据的选项,以及用于描述复杂生物网络(如复杂生物分子反应)的独特网络模型。此外,我们实现了多种用于网络分析的内部算法,包括网络聚类和过表达分析。新功能包括轻松定义和优化亚组的可视化、通过数据到可视化映射以及后续叠加功能在同一网络中比较一系列数据集,以及管理自定义交互网络。通过对TCGA胶质母细胞瘤数据的分析,证明了MONGKIE在基于网络的多组学数据可视化挖掘中的实用性。MONGKIE是基于NetBeans插件架构用Java开发的,因此与操作系统无关,并且第三方开发者对模块扩展有内在支持。
我们相信,MONGKIE通过支持许多独特功能和可视化选项,将成为网络分析软件的一个有价值的补充,特别是在分析癌症和其他疾病的多组学数据集方面。