School of Life Sciences, Shanghai University, Shanghai, 200444, China.
Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China.
Comput Biol Med. 2024 Feb;169:107883. doi: 10.1016/j.compbiomed.2023.107883. Epub 2023 Dec 22.
COVID-19 is hypothesized to exert enduring effects on the immune systems of patients, leading to alterations in immune-related gene expression. This study aimed to scrutinize the persistent implications of SARS-CoV-2 infection on gene expression and its influence on subsequent immune activation responses. We designed a machine learning-based approach to analyze transcriptomic data from both healthy individuals and patients who had recovered from COVID-19. Patients were categorized based on their influenza vaccination status and then compared with healthy controls. The initial sample set encompassed 86 blood samples from healthy controls and 72 blood samples from recuperated COVID-19 patients prior to influenza vaccination. The second sample set included 123 blood samples from healthy controls and 106 blood samples from recovered COVID-19 patients who had been vaccinated against influenza. For each sample, the dataset captured expression levels of 17,060 genes. Above two sample sets were first analyzed by seven feature ranking algorithms, yielding seven feature lists for each dataset. Then, each list was fed into the incremental feature selection method, incorporating three classic classification algorithms, to extract essential genes, classification rules and build efficient classifiers. The genes and rules were analyzed in this study. The main findings included that NEXN and ZNF354A were highly expressed in recovered COVID-19 patients, whereas MKI67 and GZMB were highly expressed in patients with secondary immune activation post-COVID-19 recovery. These pivotal genes could provide valuable insights for future health monitoring of COVID-19 patients and guide the creation of continued treatment regimens.
COVID-19 被假设会对患者的免疫系统产生持久影响,导致免疫相关基因表达的改变。本研究旨在仔细研究 SARS-CoV-2 感染对基因表达的持续影响及其对随后的免疫激活反应的影响。我们设计了一种基于机器学习的方法来分析来自健康个体和从 COVID-19 中康复的患者的转录组数据。根据他们的流感疫苗接种情况对患者进行分类,然后与健康对照组进行比较。初始样本集包括 86 份来自健康对照者的血液样本和 72 份来自流感疫苗接种前 COVID-19 康复患者的血液样本。第二个样本集包括 123 份来自健康对照者的血液样本和 106 份已接种流感疫苗的 COVID-19 康复患者的血液样本。对于每个样本,数据集捕获了 17060 个基因的表达水平。以上两个样本集首先通过七种特征排序算法进行分析,为每个数据集生成了七个特征列表。然后,将每个列表输入到增量特征选择方法中,该方法结合了三种经典分类算法,以提取重要基因、分类规则和构建高效分类器。对基因和规则进行了分析。主要发现包括,在 COVID-19 康复患者中,NEXN 和 ZNF354A 高度表达,而在 COVID-19 康复后二次免疫激活的患者中,MKI67 和 GZMB 高度表达。这些关键基因可为未来 COVID-19 患者的健康监测提供有价值的见解,并指导制定持续治疗方案。