Shu Peng, Huang Ling, Wang Xia, Wen Zhuping, Luo Yiqi, Xu Fang
The Central Hospital of Wuhan, Tongji Medical College, Huazhong University of Science and Technology, No.26, Shengli Street, Jiang'an District, Wuhan, Hubei Province, China.
BMC Nephrol. 2025 Jul 1;26(1):304. doi: 10.1186/s12882-025-04201-4.
Thrombosis of arteriovenous fistulas represents a prevalent complication among patients undergoing hemodialysis, characterized by a notably high incidence rate. Presently, there is an absence of robust assessment tools capable of predicting thrombosis occurrence. This study seeks to develop an interpretable machine learning model to forecast the risk of arteriovenous fistula thrombosis.
Clinical data were retrospectively collected from 1,168 patients who received hemodialysis via arteriovenous fistulas at The Central Hospital of Wuhan between January 2017 and October 2024. A comprehensive analysis of 55 features was conducted utilizing Python. The dataset was partitioned into a training set and a test set, comprising 70% and 30% of the samples, respectively. Six distinct machine learning models-namely, Random Forest, Extreme Gradient Boosting, Decision Tree, Logistic Regression, K-Nearest Neighbors, and Naive Bayes-were constructed to predict the risk of thrombosis in arteriovenous fistulas. The performance of these models was assessed utilizing several metrics, including the F1 score, precision, specificity, accuracy, area under the receiver operating characteristic curve, and recall rate. The contribution of each feature within the most effective model was evaluated using SHAP values, and a specific case was selected to demonstrate the model's predictive capability.
The study encompassed a cohort of 974 patients, each characterized by 55 clinical data features. Among the six machine learning models evaluated, the Random Forest model demonstrated superior performance, achieving an AUC-ROC of 0.984. SHAP visualization analysis identified the number of surgeries, stenosis, free fatty acids, platelet count, and C-reactive protein as the five most significant features influencing the risk of arteriovenous fistula thrombosis.
We developed a Random Forest model based on patients' clinical data, which effectively predicts the risk of thrombosis in arteriovenous fistulas. SHAP analysis offers the potential to inform personalized and evidence-based nursing interventions for healthcare professionals.
动静脉内瘘血栓形成是血液透析患者中常见的并发症,发病率显著较高。目前,缺乏能够预测血栓形成的强大评估工具。本研究旨在开发一种可解释的机器学习模型,以预测动静脉内瘘血栓形成的风险。
回顾性收集2017年1月至2024年10月在武汉市中心医院通过动静脉内瘘进行血液透析的1168例患者的临床资料。利用Python对55个特征进行综合分析。数据集被分为训练集和测试集,分别包含70%和30%的样本。构建了六种不同的机器学习模型,即随机森林、极端梯度提升、决策树、逻辑回归、K近邻和朴素贝叶斯,以预测动静脉内瘘血栓形成的风险。使用包括F1分数、精确率、特异性、准确率、受试者工作特征曲线下面积和召回率等多个指标评估这些模型的性能。使用SHAP值评估最有效模型中每个特征的贡献,并选择一个具体案例来展示模型的预测能力。
该研究纳入了974例患者,每例患者具有55个临床数据特征。在评估的六种机器学习模型中,随机森林模型表现出卓越的性能,AUC-ROC达到0.984。SHAP可视化分析确定手术次数、狭窄、游离脂肪酸、血小板计数和C反应蛋白是影响动静脉内瘘血栓形成风险的五个最重要特征。
我们基于患者的临床数据开发了一种随机森林模型,该模型有效地预测了动静脉内瘘血栓形成的风险。SHAP分析为医疗保健专业人员提供了为个性化和循证护理干预提供信息的潜力。