Li Aimin, Liu Yueying, Luo Yufan, Xiao Xue, Xiao Wei, Xie Ruijin, Deng Xianhui, Chen Zhe, Zhou Qian, Gong Yue, Chen Zhen, Xu Hua
Yangzhou Polytechnic College, Yangzhou, China.
Department of Pediatrics, Affiliated Hospital of Jiangnan University, Wuxi, China.
Sci Rep. 2025 May 26;15(1):18336. doi: 10.1038/s41598-025-01874-7.
Tourette syndrome is a relatively prevalent neurological condition, particularly among children, characterized by sudden, involuntary, repetitive movements or vocalizations. Contemporary diagnostic approaches for Tourette syndrome (TS) primarily rely on behavioral assessments, which pose challenges due to symptom overlap with other psychiatric disorders and significant inter-individual variability. Establishing a machine learning-based predictive model for predicting the risk of TS could potentially enhance diagnostic precision and treatment effectiveness. The investigation was conducted at the Department of Pediatrics, Affiliated Hospital of Jiangnan University, spanning the period from January 2022 to October 2024. Clinical data, encompassing complete blood counts, liver and kidney function assessments, blood glucose levels, and serum electrolyte analyses, were collected. Feature selection was conducted using Boruta and multivariable logistic regression analyses, and model construction was undertaken employing 9 distinct machine learning algorithms. 10 distinct features were selected for machine learning algorithm development, and our results indicated that the Gradient Boosting Machine algorithm is the optimal model. Our study successfully established a predictive model for the risk of Tourette syndrome using Gradient Boosting Machine, and the SHAP method highlighted the key roles of β2-microglobulin and serum 25-hydroxyvitamin D levels in predicting TS risk.
抽动秽语综合征是一种相对常见的神经系统疾病,在儿童中尤为常见,其特征为突然的、不自主的、重复性动作或发声。抽动秽语综合征(TS)的当代诊断方法主要依赖行为评估,由于与其他精神疾病症状重叠以及个体间存在显著差异,这些评估面临挑战。建立基于机器学习的预测模型以预测TS风险,可能会提高诊断准确性和治疗效果。该研究在江南大学附属医院儿科进行,时间跨度为2022年1月至2024年10月。收集了包括全血细胞计数、肝肾功能评估、血糖水平和血清电解质分析在内的临床数据。使用Boruta和多变量逻辑回归分析进行特征选择,并采用9种不同的机器学习算法进行模型构建。为机器学习算法开发选择了10个不同的特征,我们的结果表明梯度提升机算法是最佳模型。我们的研究成功地使用梯度提升机建立了抽动秽语综合征风险的预测模型,SHAP方法突出了β2-微球蛋白和血清25-羟基维生素D水平在预测TS风险中的关键作用。