PCD Diagnostic Centre and Dept of Paediatric Respiratory Medicine, Royal Brompton and Harefield NHS Trust, London, UK.
Division of Molecular and Clinical Medicine, University of Dundee, Ninewells Hospital and Medical School, Dundee, UK.
Eur Respir J. 2021 Aug 5;58(2). doi: 10.1183/13993003.02359-2020. Print 2021 Aug.
Primary ciliary dyskinesia (PCD) is a heterogeneous inherited disorder caused by mutations in approximately 50 cilia-related genes. PCD genotype-phenotype relationships have mostly arisen from small case series because existing statistical approaches to investigating relationships have been unsuitable for rare diseases.
We applied a topological data analysis (TDA) approach to investigate genotype-phenotype relationships in PCD. Data from separate training and validation cohorts included 396 genetically defined individuals carrying pathogenic variants in PCD genes. To develop the TDA models, 12 clinical and diagnostic variables were included. TDA-driven hypotheses were subsequently tested using traditional statistics.
Disease severity at diagnosis, measured by forced expiratory volume in 1 s (FEV) z-score, was significantly worse in individuals with mutations (compared to other gene mutations) and better in those with mutations; the latter also reported less neonatal respiratory distress. Patients without neonatal respiratory distress had better preserved FEV at diagnosis. Individuals with mutations were phenotypically diverse. Cilia ultrastructure and beat pattern defects correlated closely to specific causative gene groups, confirming these tests can be used to support a genetic diagnosis.
This large scale, multi-national study presents PCD as a syndrome with overlapping symptoms and variations in phenotype according to genotype. TDA modelling confirmed genotype-phenotype relationships reported by smaller studies ( FEV worse with mutation) and identified new relationships, including FEV preservation with mutations and diversity of severity with mutations.
原发性纤毛运动障碍(PCD)是一种由大约 50 个纤毛相关基因的突变引起的异质性遗传性疾病。PCD 的基因型-表型关系主要来自于小的病例系列,因为现有的统计方法不适用于罕见疾病。
我们应用拓扑数据分析(TDA)方法来研究 PCD 的基因型-表型关系。来自单独的训练和验证队列的数据包括 396 名携带 PCD 基因突变的遗传定义个体。为了开发 TDA 模型,纳入了 12 个临床和诊断变量。随后使用传统统计学方法测试 TDA 驱动的假设。
以 1 秒用力呼气量(FEV)z 分数衡量的诊断时疾病严重程度,在 突变个体中明显较差(与其他基因突变相比),在 突变个体中较好;后者也报告新生儿呼吸窘迫较少。无新生儿呼吸窘迫的患者在诊断时的 FEV 保存更好。没有 突变的个体表型多样。纤毛超微结构和拍打模式缺陷与特定的致病基因群密切相关,证实这些测试可用于支持遗传诊断。
这项大规模的、多国家的研究表明 PCD 是一种具有重叠症状和表型变异的综合征,取决于基因型。TDA 模型证实了较小研究报告的基因型-表型关系(FEV 在 突变时更差),并确定了新的关系,包括 突变时 FEV 的保存和 突变时严重程度的多样性。