Suppr超能文献

脓毒症关键基因的鉴定及识别机器学习模型的开发

Identification of key genes and development of an identifying machine learning model for sepsis.

作者信息

Li Zhonghao, Chen Shengsong, Gao Nan, Chen Jie, Qin Ying, Zhang Guoqiang

机构信息

Department of Neurosurgery, Dongfang Hospital, Beijing University of Chinese Medicine, Beijing, China.

Emergency Department, China-Japan Friendship Hospital, Beijing, China.

出版信息

Inflamm Res. 2025 Jun 30;74(1):100. doi: 10.1007/s00011-025-02068-7.

Abstract

OBJECTIVE AND DESIGN

This study aims to identify key genes of sepsis and construct a model for sepsis identification through integrated multi-organ single-cell RNA sequencing (scRNA-seq) and machine learning.

MATERIAL OR SUBJECTS

Datasets downloaded from the Gene Expression Omnibus (GSE207363, GSE207651, GSE185263, GSE69063 and GSE134347) were used.

METHODS

ScRNA-seq data extracted from heart (GSE207363) and lung tissues (GSE207651) of septic mice were processed and analyzed using the Seurat package in R. Key genes were identified as present in both heart and lung tissues, resulting from the overlap of three analyses along with differential expression analyses. We then used support vector machine recursive feature elimination to construct a model for sepsis identification based on these key genes. The GSE185263 dataset was used for training, while GSE69063 and GSE134347 were used for testing. The accuracy of the model in identifying of sepsis was validated by analyzing the area under the receiver operating characteristic curve (AUROC) using the test datasets.

RESULTS

Thirteen genes were initially identified as key genes, and after translation to their human homologs, ten genes remained. The optimal SVM-RFE model incorporated eight of these genes (CAMP, CD74, HLA-DQA1, HLA-DQB1, HLA-DMA, HLA-DRB5, and LYZ). In the two test datasets, the AUROC value for the accuracy of the model in identifying of sepsis was 0.904 and 0.924, respectively.

CONCLUSIONS

We have identified several key genes and developed a machine learning model for sepsis identification. Further studies are needed to validate our findings.

摘要

目的与设计

本研究旨在通过整合多器官单细胞RNA测序(scRNA-seq)和机器学习来鉴定脓毒症的关键基因并构建脓毒症识别模型。

材料或研究对象

使用从基因表达综合数据库(GSE207363、GSE207651、GSE185263、GSE69063和GSE134347)下载的数据集。

方法

使用R语言中的Seurat软件包对从脓毒症小鼠的心脏(GSE207363)和肺组织(GSE207651)中提取的scRNA-seq数据进行处理和分析。通过三次分析的重叠以及差异表达分析,将同时存在于心脏和肺组织中的基因鉴定为关键基因。然后,我们使用支持向量机递归特征消除法,基于这些关键基因构建脓毒症识别模型。GSE185263数据集用于训练,而GSE69063和GSE134347用于测试。通过使用测试数据集分析受试者工作特征曲线下面积(AUROC)来验证模型在识别脓毒症方面的准确性。

结果

最初鉴定出13个基因作为关键基因,在转化为人同源基因后,剩下10个基因。最佳支持向量机递归特征消除(SVM-RFE)模型纳入了其中8个基因(CAMP、CD74、HLA-DQA1、HLA-DQB1、HLA-DMA、HLA-DRB5和LYZ)。在两个测试数据集中,该模型识别脓毒症准确性的AUROC值分别为0.904和0.924。

结论

我们已经鉴定出几个关键基因,并开发了一种用于脓毒症识别的机器学习模型。需要进一步研究来验证我们的发现。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验