Ahmed Rafsan, Erten Cesim, Houdjedj Aissa, Kazan Hilal, Yalcin Cansu
Electrical and Computer Engineering Graduate Program, Antalya Bilim University, Antalya, Turkey.
Department of Computer Engineering, Antalya Bilim University, Antalya, Turkey.
Front Genet. 2021 Nov 26;12:746495. doi: 10.3389/fgene.2021.746495. eCollection 2021.
One of the key concepts employed in cancer driver gene identification is that of mutual exclusivity (ME); a driver mutation is less likely to occur in case of an earlier mutation that has common functionality in the same molecular pathway. Several ME tests have been proposed recently, however the current protocols to evaluate ME tests have two main limitations. Firstly the evaluations are mostly with respect to simulated data and secondly the evaluation metrics lack a network-centric view. The latter is especially crucial as the notion of common functionality can be achieved through searching for interaction patterns in relevant networks. We propose a network-centric framework to evaluate the pairwise significances found by statistical ME tests. It has three main components. The first component consists of metrics employed in the network-centric ME evaluations. Such metrics are designed so that network knowledge and the reference set of known cancer genes are incorporated in ME evaluations under a careful definition of proper control groups. The other two components are designed as further mechanisms to avoid confounders inherent in ME detection on top of the network-centric view. To this end, our second objective is to dissect the side effects caused by mutation load artifacts where mutations driving tumor subtypes with low mutation load might be incorrectly diagnosed as mutually exclusive. Finally, as part of the third main component, the confounding issue stemming from the use of nonspecific interaction networks generated as combinations of interactions from different tissues is resolved through the creation and use of tissue-specific networks in the proposed framework. The data, the source code and useful scripts are available at: https://github.com/abu-compbio/NetCentric.
癌症驱动基因识别中使用的关键概念之一是互斥性(ME);如果在同一分子途径中具有共同功能的早期突变已经发生,那么驱动突变发生的可能性就较小。最近已经提出了几种ME测试方法,然而,目前评估ME测试的方案存在两个主要局限性。首先,评估大多是针对模拟数据,其次,评估指标缺乏以网络为中心的视角。后者尤为关键,因为共同功能的概念可以通过在相关网络中搜索相互作用模式来实现。我们提出了一个以网络为中心的框架来评估统计ME测试发现的成对显著性。它有三个主要组成部分。第一个组成部分包括以网络为中心的ME评估中使用的指标。这些指标的设计使得在仔细定义适当对照组的情况下,网络知识和已知癌症基因的参考集被纳入ME评估。另外两个组成部分被设计为进一步的机制,以在以网络为中心的视角之上避免ME检测中固有的混杂因素。为此,我们的第二个目标是剖析由突变负荷假象引起的副作用,其中驱动低突变负荷肿瘤亚型的突变可能被错误地诊断为互斥。最后,作为第三个主要组成部分的一部分,通过在所提出的框架中创建和使用组织特异性网络,解决了由于使用作为来自不同组织的相互作用组合而生成的非特异性相互作用网络所产生的混杂问题。数据、源代码和有用的脚本可在以下网址获取:https://github.com/abu-compbio/NetCentric。