Suppr超能文献

使用自适应滤波程序检测多个复制信号。

DETECTING MULTIPLE REPLICATING SIGNALS USING ADAPTIVE FILTERING PROCEDURES.

作者信息

Wang Jingshu, Gui Lin, Su Weijie J, Sabatti Chiara, Owen Art B

机构信息

Department of Statistics, The University of Chicago.

Department of Statistics and Data Science, University of Pennsylvania.

出版信息

Ann Stat. 2022 Aug;50(4):1890-1909. doi: 10.1214/21-aos2139. Epub 2022 Aug 25.

Abstract

Replicability is a fundamental quality of scientific discoveries: we are interested in those signals that are detectable in different laboratories, different populations, across time etc. Unlike meta-analysis which accounts for experimental variability but does not guarantee replicability, testing a partial conjunction (PC) null aims specifically to identify the signals that are discovered in multiple studies. In many contemporary applications, for example, comparing multiple high-throughput genetic experiments, a large number of PC nulls need to be tested simultaneously, calling for a multiple comparisons correction. However, standard multiple testing adjustments on the PC -values can be severely conservative, especially when is large and the signals are sparse. We introduce AdaFilter, a new multiple testing procedure that increases power by adaptively filtering out unlikely candidates of PC nulls. We prove that AdaFilter can control FWER and FDR as long as data across studies are independent, and has much higher power than other existing methods. We illustrate the application of AdaFilter with three examples: microarray studies of Duchenne muscular dystrophy, single-cell RNA sequencing of T cells in lung cancer tumors and GWAS for metabolomics.

摘要

可重复性是科学发现的一项基本特性

我们关注那些在不同实验室、不同人群以及不同时间等条件下都能检测到的信号。与荟萃分析不同,荟萃分析考虑了实验变异性但不保证可重复性,而检验部分合取(PC)原假设专门旨在识别在多项研究中发现的信号。例如,在许多当代应用中,比较多个高通量基因实验时,需要同时检验大量的PC原假设,这就需要进行多重比较校正。然而,对P值进行标准的多重检验调整可能会非常保守,尤其是当样本量很大且信号稀疏时。我们引入了AdaFilter,一种新的多重检验程序,它通过自适应地滤除不太可能的PC原假设候选者来提高检验效能。我们证明,只要各研究的数据是独立的,AdaFilter就能控制错误发现率(FWER)和错误发现比例(FDR),并且其检验效能比其他现有方法高得多。我们用三个例子说明了AdaFilter的应用:杜兴氏肌肉营养不良症的微阵列研究、肺癌肿瘤中T细胞的单细胞RNA测序以及代谢组学的全基因组关联研究(GWAS)。

相似文献

1
DETECTING MULTIPLE REPLICATING SIGNALS USING ADAPTIVE FILTERING PROCEDURES.
Ann Stat. 2022 Aug;50(4):1890-1909. doi: 10.1214/21-aos2139. Epub 2022 Aug 25.
5
Combining Partial True Discovery Guarantee Procedures.
Biom J. 2024 Jul;66(5):e202300075. doi: 10.1002/bimj.202300075.
6
False discovery rate-controlled multiple testing for union null hypotheses: a knockoff-based approach.
Biometrics. 2023 Dec;79(4):3497-3509. doi: 10.1111/biom.13848. Epub 2023 Mar 15.
7
A multiple-testing procedure for high-dimensional mediation hypotheses.
J Am Stat Assoc. 2022;117(537):198-213. doi: 10.1080/01621459.2020.1765785. Epub 2020 Jun 24.
9
Screening for partial conjunction hypotheses.
Biometrics. 2008 Dec;64(4):1215-22. doi: 10.1111/j.1541-0420.2007.00984.x. Epub 2008 Feb 6.
10
On generalized fixed sequence procedures for controlling the FWER.
Stat Med. 2015 Dec 30;34(30):3968-83. doi: 10.1002/sim.6603. Epub 2015 Jul 30.

本文引用的文献

1
Large-Scale Hypothesis Testing for Causal Mediation Effects with Applications in Genome-wide Epigenetic Studies.
J Am Stat Assoc. 2022;117(537):67-81. doi: 10.1080/01621459.2021.1914634. Epub 2021 May 19.
2
On optimal two-stage testing of multiple mediators.
Biom J. 2022 Aug;64(6):1090-1108. doi: 10.1002/bimj.202100190. Epub 2022 Apr 14.
3
Global test for high-dimensional mediation: Testing groups of potential mediators.
Stat Med. 2019 Aug 15;38(18):3346-3360. doi: 10.1002/sim.8199. Epub 2019 May 9.
4
Trans-ethnic association study of blood pressure determinants in over 750,000 individuals.
Nat Genet. 2019 Jan;51(1):51-62. doi: 10.1038/s41588-018-0303-9. Epub 2018 Dec 21.
5
Flexible statistical methods for estimating and testing effects in genomic studies with multiple conditions.
Nat Genet. 2019 Jan;51(1):187-195. doi: 10.1038/s41588-018-0268-8. Epub 2018 Nov 26.
6
Global characterization of T cells in non-small-cell lung cancer by single-cell sequencing.
Nat Med. 2018 Jul;24(7):978-985. doi: 10.1038/s41591-018-0045-3. Epub 2018 Jun 25.
7
Multi-omics approaches to disease.
Genome Biol. 2017 May 5;18(1):83. doi: 10.1186/s13059-017-1215-1.
8
1,500 scientists lift the lid on reproducibility.
Nature. 2016 May 26;533(7604):452-4. doi: 10.1038/533452a.
9
SLOPE-ADAPTIVE VARIABLE SELECTION VIA CONVEX OPTIMIZATION.
Ann Appl Stat. 2015;9(3):1103-1140. doi: 10.1214/15-AOAS842.
10
False Discovery Control in Large-Scale Spatial Multiple Testing.
J R Stat Soc Series B Stat Methodol. 2015 Jan 1;77(1):59-83. doi: 10.1111/rssb.12064.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验