Wang Jingshu, Gui Lin, Su Weijie J, Sabatti Chiara, Owen Art B
Department of Statistics, The University of Chicago.
Department of Statistics and Data Science, University of Pennsylvania.
Ann Stat. 2022 Aug;50(4):1890-1909. doi: 10.1214/21-aos2139. Epub 2022 Aug 25.
Replicability is a fundamental quality of scientific discoveries: we are interested in those signals that are detectable in different laboratories, different populations, across time etc. Unlike meta-analysis which accounts for experimental variability but does not guarantee replicability, testing a partial conjunction (PC) null aims specifically to identify the signals that are discovered in multiple studies. In many contemporary applications, for example, comparing multiple high-throughput genetic experiments, a large number of PC nulls need to be tested simultaneously, calling for a multiple comparisons correction. However, standard multiple testing adjustments on the PC -values can be severely conservative, especially when is large and the signals are sparse. We introduce AdaFilter, a new multiple testing procedure that increases power by adaptively filtering out unlikely candidates of PC nulls. We prove that AdaFilter can control FWER and FDR as long as data across studies are independent, and has much higher power than other existing methods. We illustrate the application of AdaFilter with three examples: microarray studies of Duchenne muscular dystrophy, single-cell RNA sequencing of T cells in lung cancer tumors and GWAS for metabolomics.
我们关注那些在不同实验室、不同人群以及不同时间等条件下都能检测到的信号。与荟萃分析不同,荟萃分析考虑了实验变异性但不保证可重复性,而检验部分合取(PC)原假设专门旨在识别在多项研究中发现的信号。例如,在许多当代应用中,比较多个高通量基因实验时,需要同时检验大量的PC原假设,这就需要进行多重比较校正。然而,对P值进行标准的多重检验调整可能会非常保守,尤其是当样本量很大且信号稀疏时。我们引入了AdaFilter,一种新的多重检验程序,它通过自适应地滤除不太可能的PC原假设候选者来提高检验效能。我们证明,只要各研究的数据是独立的,AdaFilter就能控制错误发现率(FWER)和错误发现比例(FDR),并且其检验效能比其他现有方法高得多。我们用三个例子说明了AdaFilter的应用:杜兴氏肌肉营养不良症的微阵列研究、肺癌肿瘤中T细胞的单细胞RNA测序以及代谢组学的全基因组关联研究(GWAS)。