Lee Jeong Hoon, Kim Pyeong Hwa, Son Nak-Hoon, Han Kyunghwa, Kang Yeseul, Jeong Sejin, Kim Eun-Kyung, Yoon Haesung, Gatidis Sergios, Vasanawala Shreyas, Yoon Hee Mang, Shin Hyun Joo
Department of Radiology, Stanford Medicine, Stanford, CA, United States.
Department of Radiology and Research Institute of Radiology, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea.
J Med Internet Res. 2025 Jul 8;27:e72097. doi: 10.2196/72097.
Artificial intelligence (AI) is increasingly used in radiology, but its development in pediatric imaging remains limited, particularly for emergent conditions. Ileocolic intussusception is an important cause of acute abdominal pain in infants and toddlers and requires timely diagnosis to prevent complications such as bowel ischemia or perforation. While ultrasonography is the diagnostic standard due to its high sensitivity and specificity, its accessibility may be limited, especially outside tertiary centers. Abdominal radiographs (AXRs), despite their limited sensitivity, are often the first-line imaging modality in clinical practice. In this context, AI could support early screening and triage by analyzing AXRs and identifying patients who require further ultrasonography evaluation.
This study aimed to upgrade and externally validate an AI model for screening ileocolic intussusception using pediatric AXRs with multicenter data and to assess the diagnostic performance of the model in comparison with radiologists of varying experience levels with and without AI assistance.
This retrospective study included pediatric patients (≤5 years) who underwent both AXRs and ultrasonography for suspected intussusception. Based on the preliminary study from hospital A, the AI model was retrained using data from hospital B and validated with external datasets from hospitals C and D. Diagnostic performance of the upgraded AI model was evaluated using sensitivity, specificity, and the area under the receiver operating characteristic curve (AUC). A reader study was conducted with 3 radiologists, including 2 trainees and 1 pediatric radiologist, to evaluate diagnostic performance with and without AI assistance.
Based on the previously developed AI model trained on 746 patients from hospital A, an additional 431 patients from hospital B (including 143 intussusception cases) were used for further training to develop an upgraded AI model. External validation was conducted using data from hospital C (n=68; 19 intussusception cases) and hospital D (n=90; 30 intussusception cases). The upgraded AI model achieved a sensitivity of 81.7% (95% CI 68.6%-90%) and a specificity of 81.7% (95% CI 73.3%-87.8%), with an AUC of 86.2% (95% CI 79.2%-92.1%) in the external validation set. Without AI assistance, radiologists showed lower performance (overall AUC 64%; sensitivity 49.7%; specificity 77.1%). With AI assistance, radiologists' specificity improved to 93% (difference +15.9%; P<.001), and AUC increased to 79.2% (difference +15.2%; P=.05). The least experienced reader showed the largest improvement in specificity (+37.6%; P<.001) and AUC (+14.7%; P=.08).
The upgraded AI model improved diagnostic performance for screening ileocolic intussusception on pediatric AXRs. It effectively enhanced the specificity and overall accuracy of radiologists, particularly those with less experience in pediatric radiology. A user-friendly software platform was introduced to support broader clinical validation and underscores the potential of AI as a screening and triage tool in pediatric emergency settings.
人工智能(AI)在放射学中的应用日益广泛,但其在儿科影像学中的发展仍然有限,尤其是在紧急情况下。回结肠套叠是婴幼儿急性腹痛的重要原因,需要及时诊断以预防诸如肠缺血或穿孔等并发症。虽然超声检查因其高敏感性和特异性而成为诊断标准,但其可及性可能有限,尤其是在三级医疗中心之外。腹部X线平片(AXR)尽管敏感性有限,但在临床实践中通常是一线成像方式。在此背景下,AI可以通过分析AXR并识别需要进一步超声检查评估的患者来支持早期筛查和分诊。
本研究旨在使用多中心数据升级并外部验证用于筛查回结肠套叠的AI模型,并评估该模型与不同经验水平的放射科医生在有无AI辅助情况下的诊断性能。
这项回顾性研究纳入了因疑似套叠而接受AXR和超声检查的儿科患者(≤5岁)。基于医院A的初步研究,使用医院B的数据对AI模型进行再训练,并使用医院C和D的外部数据集进行验证。使用敏感性、特异性和受试者操作特征曲线下面积(AUC)评估升级后的AI模型的诊断性能。对3名放射科医生(包括2名实习医生和1名儿科放射科医生)进行了一项阅片者研究,以评估有无AI辅助时的诊断性能。
基于先前在医院A的746例患者上训练的AI模型,使用医院B的另外431例患者(包括143例套叠病例)进行进一步训练以开发升级后的AI模型。使用医院C(n = 68;19例套叠病例)和医院D(n = 90;30例套叠病例)的数据进行外部验证。升级后的AI模型在外部验证集中的敏感性为81.7%(95%CI 为68.6% - 90%),特异性为81.7%(95%CI 为73.3% - 87.8%),AUC为86.2%(95%CI 为79.2% - 92.1%)。在没有AI辅助的情况下,放射科医生的表现较低(总体AUC为64%;敏感性为49.7%;特异性为77.1%)。在有AI辅助的情况下,放射科医生的特异性提高到93%(差异 +15.9%;P <.001),AUC增加到79.2%(差异 +15.2%;P =.05)。经验最少的阅片者在特异性(+37.6%;P <.001)和AUC(+14.7%;P =.08)方面的改善最大。
升级后的AI模型提高了在儿科AXR上筛查回结肠套叠的诊断性能。它有效地提高了放射科医生的特异性和总体准确性,尤其是那些在儿科放射学方面经验较少的医生。引入了一个用户友好的软件平台以支持更广泛的临床验证,并强调了AI作为儿科急诊环境中筛查和分诊工具的潜力。