College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, People's Republic of China.
College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, People's Republic of China.
Brief Bioinform. 2019 Jan 18;20(1):168-177. doi: 10.1093/bib/bbx091.
Pathway enrichment analysis has been widely used to identify cancer risk pathways, and contributes to elucidating the mechanism of tumorigenesis. However, most of the existing approaches use the outdated pathway information and neglect the complex gene interactions in pathway. Here, we first reviewed the existing widely used pathway enrichment analysis approaches briefly, and then, we proposed a novel topology-based pathway enrichment analysis (TPEA) method, which integrated topological properties and global upstream/downstream positions of genes in pathways. We compared TPEA with four widely used pathway enrichment analysis tools, including database for annotation, visualization and integrated discovery (DAVID), gene set enrichment analysis (GSEA), centrality-based pathway enrichment (CePa) and signaling pathway impact analysis (SPIA), through analyzing six gene expression profiles of three tumor types (colorectal cancer, thyroid cancer and endometrial cancer). As a result, we identified several well-known cancer risk pathways that could not be obtained by the existing tools, and the results of TPEA were more stable than that of the other tools in analyzing different data sets of the same cancer. Ultimately, we developed an R package to implement TPEA, which could online update KEGG pathway information and is available at the Comprehensive R Archive Network (CRAN): https://cran.r-project.org/web/packages/TPEA/.
通路富集分析已被广泛用于识别癌症风险通路,并有助于阐明肿瘤发生的机制。然而,大多数现有的方法都使用过时的通路信息,忽略了通路中复杂的基因相互作用。在这里,我们首先简要回顾了现有的广泛使用的通路富集分析方法,然后提出了一种新的基于拓扑的通路富集分析(TPEA)方法,该方法整合了通路中基因的拓扑性质和全局上下游位置。我们通过分析三种肿瘤类型(结直肠癌、甲状腺癌和子宫内膜癌)的六个基因表达谱,将 TPEA 与四种广泛使用的通路富集分析工具(数据库注释、可视化和综合发现(DAVID)、基因集富集分析(GSEA)、基于中心度的通路富集(CePa)和信号通路影响分析(SPIA))进行了比较。结果表明,我们鉴定了几个现有的工具无法获得的已知癌症风险通路,并且在分析同一癌症的不同数据集时,TPEA 的结果比其他工具更稳定。最终,我们开发了一个 R 包来实现 TPEA,该包可以在线更新 KEGG 通路信息,并可在 Comprehensive R Archive Network(CRAN)上获得:https://cran.r-project.org/web/packages/TPEA/。