Icahn Institute for Genomics and Multiscale Biology and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
Division of Pulmonary and Critical Care Medicine, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA.
Sci Rep. 2018 Jun 11;8(1):8826. doi: 10.1038/s41598-018-27189-4.
Asthma is a common, under-diagnosed disease affecting all ages. We sought to identify a nasal brush-based classifier of mild/moderate asthma. 190 subjects with mild/moderate asthma and controls underwent nasal brushing and RNA sequencing of nasal samples. A machine learning-based pipeline identified an asthma classifier consisting of 90 genes interpreted via an L2-regularized logistic regression classification model. This classifier performed with strong predictive value and sensitivity across eight test sets, including (1) a test set of independent asthmatic and control subjects profiled by RNA sequencing (positive and negative predictive values of 1.00 and 0.96, respectively; AUC of 0.994), (2) two independent case-control cohorts of asthma profiled by microarray, and (3) five cohorts with other respiratory conditions (allergic rhinitis, upper respiratory infection, cystic fibrosis, smoking), where the classifier had a low to zero misclassification rate. Following validation in large, prospective cohorts, this classifier could be developed into a nasal biomarker of asthma.
哮喘是一种常见的、诊断不足的疾病,影响所有年龄段。我们试图确定一种基于鼻腔刷的轻度/中度哮喘分类器。190 名轻度/中度哮喘患者和对照组接受了鼻腔刷检和鼻腔样本的 RNA 测序。基于机器学习的管道确定了一个由 90 个基因组成的哮喘分类器,这些基因通过 L2 正则化逻辑回归分类模型进行解释。该分类器在包括(1)由 RNA 测序进行特征分析的独立哮喘和对照组测试集(阳性和阴性预测值分别为 1.00 和 0.96,AUC 为 0.994),(2)两个通过微阵列进行哮喘特征分析的独立病例对照队列,以及(3)五个具有其他呼吸道疾病(过敏性鼻炎、上呼吸道感染、囊性纤维化、吸烟)的队列在内的八个测试集中具有很强的预测价值和灵敏度,该分类器的错误分类率低至为零。在大型前瞻性队列中验证后,该分类器可开发为哮喘的鼻腔生物标志物。