Suppr超能文献

[机器学习与逻辑回归模型在预测心脏手术后急性肾损伤中的比较:基于MIMIC-III数据库的数据分析]

[Comparison of machine learning and Logistic regression model in predicting acute kidney injury after cardiac surgery: data analysis based on MIMIC-III database].

作者信息

Xiong Wei, Zhang Lifan, She Kai, Xu Guo, Bai Shanglin, Liu Xuan

机构信息

Department of Cardiovascular Surgery, Sichuan Mianyang 404 Hospital, Mianyang 621000, Sichuan, China.

Department of Clinical Medical College, West China Hospital of Sichuan University, Chengdu 610041, Sichuan, China. Corresponding author: Xiong Wei, Email:

出版信息

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2022 Nov;34(11):1188-1193. doi: 10.3760/cma.j.cn121430-20210223-00279.

Abstract

OBJECTIVE

To establish an acute kidney injury (AKI) prediction model in patients after cardiac surgery by extreme gradient boosting (XGBoost) machine learning model, and to explore the risk and protective factors for AKI in patients after cardiac surgery.

METHODS

All patients who underwent cardiac surgery in Medical Information Mart for Intensive Care-III (MIMIC-III) database were enrolled, and they were divided into AKI group and non-AKI group according to whether AKI developed within 14 days after cardiac surgery. Their clinical characteristics were compared. Based on five-fold cross-validation, XGBoost and Logistic regression were used to establish the prediction model of AKI after cardiac surgery. And the area under the receiver operator characteristic curve (AUC) of the models was compared. The output model of XGBoost was interpreted by Shapley additive explanations (SHAP).

RESULTS

A total of 6 912 patients were included, of which 5 681 (82.2%) developed AKI within 14 days after the operation, and 1 231 (17.8%) did not. Compared with the non-AKI group, the main characteristics of AKI group included older age [years: 68.0 (59.0, 76.0) vs. 62.0 (52.0, 71.0)], higher incidence of emergency admission and complicated with obesity and diabetes (52.4% vs. 47.8%, 9.0% vs. 4.0%, 32.0% vs. 22.2%), lower respiratory rate [RR; bpm: times/min: 17.0 (14.0, 20.0) vs. 19.0 (15.0, 22.0)], lower heart rate [HR; bpm: 80.0 (67.0, 89.0) vs. 82.0 (71.5, 93.0)], higher blood pressure [mmHg (1 mmHg ≈ 0.133 kPa): 80.0 (70.7, 90.0) vs. 78.0 (70.0, 88.0)], higher hemoglobin (Hb), blood glucose, blood K level and serum creatinine [SCr; Hb (g/L): 122.0 (109.0, 136.0) vs. 120.0 (106.0, 135.0), blood glucose (mmol/L): 7.3 (6.1, 8.9) vs. 6.8 (5.7, 8.5), blood K level (mmol/L): 4.2 (3.9, 4.7) vs. 4.2 (3.8, 4.6), SCr (μmol/L): 88.4 (70.7, 106.1) vs. 79.6 (70.7, 97.2)], lower albumin (ALB) and triacylglycerol [TG; ALB (g/L): 38.0 (35.0, 41.0) vs. 39.0 (37.0, 42.0), TG (mmol/L): 1.4 (1.0, 2.0) vs. 1.5 (1.0, 2.2)] as well as higher incidence of multiple organ dysfunction syndrome (MODS) and sepsis (30.6% vs. 16.2%, 3.3% vs. 1.9%), with significant differences (all P < 0.05). In the output model of Logistic regression, important predictors were lactic acid [Lac; odds ratio (OR) = 1.062, 95% confidence interval (95%CI) was 1.030-1.100, P = 0.005], obesity (OR = 2.234, 95%CI was 1.900-2.640, P < 0.001), male (OR = 0.858, 95%CI was 0.794-0.928, P = 0.049), diabetes (OR = 1.820, 95%CI was 1.680-1.980, P < 0.001) and emergency admission (OR = 1.278, 95%CI was 1.190-1.380, P < 0.001). Receiver operator characteristic curve (ROC curve) analysis showed that the AUC of the Logistic regression model for predicting AKI after cardiac surgery was 0.62 (95%CI was 0.61-0.67). After optimizing the XGBoost model parameters by grid search combined with five-fold cross-validation, the model was trained well with no overfitting or overfitting. ROC analysis showed that the AUC of XGBoost model for predicting AKI after cardiac surgery was 0.77 (95%CI was 0.75-0.80), which was significantly higher than that of Logistic regression model (P < 0.01). After SHAP treatment, in the output model of XGBoost, age and ALB were the most important predictors of the final outcome, where age was the risk factor (average |SHAP value| was 0.434), and ALB was the protective factor (average |SHAP value| was 0.221).

CONCLUSIONS

Age is an important risk factor for AKI after cardiac surgery, and ALB is a protective factor. The performance of machine learning in predicting cardiac and vascular surgery-associated AKI is better than the traditional Logistic regression. XGBoost can analyze the more complex relationship between variables and outcomes, and can predict the risk of postoperative AKI more accurately and individually.

摘要

目的

通过极限梯度提升(XGBoost)机器学习模型建立心脏手术后急性肾损伤(AKI)预测模型,并探讨心脏手术后患者发生AKI的危险因素及保护因素。

方法

纳入重症监护医学信息数据库三期(MIMIC-III)中接受心脏手术的所有患者,根据术后14天内是否发生AKI分为AKI组和非AKI组,比较两组患者的临床特征。基于五折交叉验证,采用XGBoost和逻辑回归建立心脏手术后AKI的预测模型,并比较模型的受试者操作特征曲线下面积(AUC)。采用Shapley加性解释(SHAP)对XGBoost的输出模型进行解释。

结果

共纳入6912例患者,其中5681例(82.2%)术后14天内发生AKI,1231例(17.8%)未发生AKI。与非AKI组相比,AKI组的主要特征包括年龄较大[岁:68.0(59.0,76.0) vs. 62.0(52.0,71.0)]、急诊入院发生率较高且合并肥胖和糖尿病(52.4% vs. 47.8%,9.0% vs. 4.0%,32.0% vs. 22.2%)、呼吸频率较低[RR;次/分钟:17.0(14.0,20.0) vs. 19.0(15.0,22.0)]、心率较低[HR;次/分钟:80.0(67.0,89.0) vs. 82.0(71.5,93.0)]、血压较高[mmHg(1 mmHg≈0.133 kPa):80.0(70.7,90.0) vs. 78.0(70.0,88.0)]、血红蛋白(Hb)、血糖、血钾水平和血清肌酐[SCr;Hb(g/L):122.0(109.0,136.0) vs. 120.0(106.0,135.0),血糖(mmol/L):7.3(6.1,8.9) vs. 6.8(5.7,8.5),血钾水平(mmol/L):4.2(3.9,4.7) vs. 4.2(3.8,4.6),SCr(μmol/L):88.4(70.7,106.1) vs. 79.6(70.7,97.2)]、白蛋白(ALB)和三酰甘油[TG;ALB(g/L):38.0(35.0,41.0) vs. 39.0(37.0,42.0),TG(mmol/L):1.4(1.0,2.0) vs. 1.5(1.0,2.2)]以及多器官功能障碍综合征(MODS)和脓毒症发生率较高(30.6% vs. 16.2%,3.3% vs. 1.9%),差异均有统计学意义(均P<0.05)。在逻辑回归的输出模型中,重要预测因素为乳酸[Lac;比值比(OR)=1.062,95%置信区间(95%CI)为1.030 - 1.100,P = 0.005]、肥胖(OR = 2.234,95%CI为1.900 - 2.640,P<0.001)、男性(OR = 0.858,95%CI为0.794 - 0.928,P = 0.049)、糖尿病(OR = 1.820,95%CI为1.680 - 1.980,P<0.001)和急诊入院(OR = 1.278,95%CI为1.190 - 1.380,P<0.001)。受试者操作特征曲线(ROC曲线)分析显示,逻辑回归模型预测心脏手术后AKI的AUC为0.62(95%CI为0.61 - 0.67)。通过网格搜索结合五折交叉验证对XGBoost模型参数进行优化后,模型训练良好,无过拟合或欠拟合。ROC分析显示,XGBoost模型预测心脏手术后AKI的AUC为0.77(95%CI为0.75 - 0.80),显著高于逻辑回归模型(P<0.01)。经过SHAP处理后,在XGBoost的输出模型中,年龄和ALB是最终结局的最重要预测因素,其中年龄是危险因素(平均|SHAP值|为0.434),ALB是保护因素(平均|SHAP值|为0.221)。

结论

年龄是心脏手术后AKI的重要危险因素,ALB是保护因素。机器学习在预测心脏和血管手术相关AKI方面的性能优于传统逻辑回归。XGBoost能够分析变量与结局之间更复杂的关系,能更准确、个体化地预测术后AKI风险。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验