Mansel Clayton O, Ghisays Valentina, Mahnken Jonathan D, Swerdlow Russell H, Reiman Eric M, Karnes Jason H, Denny Joshua C, Veatch Olivia J
Department of Cell Biology and Physiology, University of Kansas Medical Center, Kansas City, KS, USA.
Banner Alzheimer's Institute, Phoenix, AZ, USA.
medRxiv. 2025 May 23:2025.05.22.25328175. doi: 10.1101/2025.05.22.25328175.
Recent genome-wide association studies (GWAS) for Alzheimer's Disease and related dementias (ADRD) have increased statistical power via larger analysis datasets from biobanks by 1) including non-age-matched controls and prevalent cases, and/or 2) including individuals who report a family history of ADRD as proxy cases. However, these methods have the potential to increase noise and distort genetic associations which are important for genomic-informed prevention and treatment of ADRD. Here, we sought to understand how the effect sizes of genetic associations in ADRD could be sensitive to these methodological choices, using genotypes as an example.
Participants in the Research Program over the age of 49 at enrollment (n=258,693) were assigned one of four categories: incident ADRD (developed after enrollment in ), prevalent ADRD (present on enrollment), proxy ADRD (participant noted a family history of ADRD), and control (no history or diagnosis of ADRD). Dementia diagnoses were determined using available Electronic Health Records (EHR) and genotype was determined using whole-genome sequencing. Effect sizes for the associations between risk alleles and ADRD diagnoses were compared using polychotomous logistic regression.
The mean age of the cohort was 67±10 years, and it was 58% female; 63% clustered predominantly with European genetic reference populations. Among the participants, 3,107 (1.2%) had prevalent ADRD, 301 (0.1%) had incident ADRD, and 19,910 (7.7%) reported a family history of ADRD (proxy ADRD). Both prevalent and proxy ADRD had attenuated associations with genotype compared to incident ADRD. The adjusted generalized ratio (95% CI) (AGR) for incident ADRD for ε4 heterozygotes was 2.95 (2.31-3.74) compared to 2.10 (1.96-2.24) and 1.42 (1.32-1.55) for proxy and prevalent ADRD, respectively. For ε4 homozygotes, the effect sizes were even more different. Furthermore, association effect sizes increased when restricting the control (no ADRD) group to older age brackets.
Our study highlights how genetic associations with ADRD can be sensitive to how cases are defined in biobanks like , with effect sizes downwardly biased when using prevalent or by-proxy cases compared to incident cases.
近期针对阿尔茨海默病及相关痴呆症(ADRD)的全基因组关联研究(GWAS)通过生物样本库中更大的分析数据集提高了统计效力,方法包括:1)纳入非年龄匹配的对照和现患病例,和/或2)纳入报告有ADRD家族史的个体作为替代病例。然而,这些方法有可能增加噪声并扭曲基因关联,而这些关联对于基于基因组信息的ADRD预防和治疗至关重要。在此,我们以基因型为例,试图了解ADRD中基因关联的效应大小如何对这些方法选择敏感。
研究项目中年龄在49岁及以上的参与者(n = 258,693)被分为四类之一:新发ADRD(入组后发病)、现患ADRD(入组时存在)、替代ADRD(参与者报告有ADRD家族史)和对照(无ADRD病史或诊断)。使用可用的电子健康记录(EHR)确定痴呆症诊断,并使用全基因组测序确定基因型。使用多分类逻辑回归比较风险等位基因与ADRD诊断之间关联的效应大小。
队列的平均年龄为67±10岁,女性占58%;63%主要聚类于欧洲遗传参考人群。在参与者中,3107人(1.2%)患有现患ADRD,301人(0.1%)患有新发ADRD,19910人(7.7%)报告有ADRD家族史(替代ADRD)。与新发ADRD相比,现患和替代ADRD与基因型的关联均减弱。新发ADRD的ε4杂合子的调整广义比(95%CI)(AGR)为2.95(2.31 - 3.74),而替代和现患ADRD分别为2.10(1.96 - 2.24)和1.42(1.32 - 1.55)。对于ε4纯合子,效应大小差异更大。此外,将对照(无ADRD)组限制在较高年龄组时,关联效应大小增加。
我们的研究强调了ADRD的基因关联如何对像[生物样本库名称]这样的生物样本库中病例的定义敏感,与新发病例相比,使用现患或替代病例时效应大小存在向下偏差。