Bleazard Thomas, Lamb Janine A, Griffiths-Jones Sam
Faculty of Medical and Human Sciences, and Faculty of Life Sciences, University of Manchester, UK.
Bioinformatics. 2015 May 15;31(10):1592-8. doi: 10.1093/bioinformatics/btv023. Epub 2015 Jan 20.
Many studies have investigated the differential expression of microRNAs (miRNAs) in disease states and between different treatments, tissues and developmental stages. Given a list of perturbed miRNAs, it is common to predict the shared pathways on which they act. The standard test for functional enrichment typically yields dozens of significantly enriched functional categories, many of which appear frequently in the analysis of apparently unrelated diseases and conditions.
We show that the most commonly used functional enrichment test is inappropriate for the analysis of sets of genes targeted by miRNAs. The hypergeometric distribution used by the standard method consistently results in significant P-values for functional enrichment for targets of randomly selected miRNAs, reflecting an underlying bias in the predicted gene targets of miRNAs as a whole. We developed an algorithm to measure enrichment using an empirical sampling approach, and applied this in a reanalysis of the gene ontology classes of targets of miRNA lists from 44 published studies. The vast majority of the miRNA target sets were not significantly enriched in any functional category after correction for bias. We therefore argue against continued use of the standard functional enrichment method for miRNA targets.
许多研究调查了疾病状态下以及不同治疗、组织和发育阶段中微小RNA(miRNA)的差异表达。给定一组受干扰的miRNA,预测它们共同作用的途径很常见。功能富集的标准测试通常会产生数十个显著富集的功能类别,其中许多在分析明显无关的疾病和病症时经常出现。
我们表明,最常用的功能富集测试不适用于分析miRNA靶向的基因集。标准方法使用的超几何分布始终会为随机选择的miRNA的靶标产生显著的功能富集P值,这反映了miRNA预测基因靶标整体上存在的潜在偏差。我们开发了一种使用经验抽样方法测量富集的算法,并将其应用于对44项已发表研究中miRNA列表靶标的基因本体类别进行的重新分析。在校正偏差后,绝大多数miRNA靶标集在任何功能类别中都没有显著富集。因此,我们反对继续对miRNA靶标使用标准功能富集方法。