Department of Radiology, Sagara Perth Avenue Clinic, 26-13 Shinyashiki-cho, Kagoshima City, Kagoshima, 892-0838, Japan.
Department of Radiology, Sagara Hospital, 3-31 Matsubara-cho, Kagoshima City, Kagoshima, 892-0833, Japan.
Breast Cancer. 2020 Jul;27(4):642-651. doi: 10.1007/s12282-020-01061-8. Epub 2020 Feb 12.
To compare the breast cancer detection performance in digital mammograms of a panel of three unaided human readers (HR) versus a stand-alone artificial intelligence (AI)-based Transpara system in a population of Japanese women.
The subjects were 310 Japanese female outpatients who underwent digital mammographic examinations between January 2018 and October 2018. A panel of three HR provided a Breast Imaging Reporting and Data System (BI-RADS) score, and Transpara system provided an interactive decision support score and an examination-based cancer likelihood score. The area under the receiver operating characteristic curve (AUC), sensitivity, and specificity were compared under each of reading conditions.
The AUC was higher for human readers than with stand-alone Transpara system (human readers 0.816; Transpara system 0.706; difference 0.11; P < 0.001). The sensitivity of the unaided HR for diagnosis was 89% and specificity was 86%. The sensitivity of stand-alone Transpara system for cutoff scores of 4 and 7 were 93% and 85%, and specificities were 45% and 67%, respectively.
Although the diagnostic performance of Transpara system was statistically lower than that of HR, the recent advances in AI algorithms are expected to reduce the difference between computers and human experts in detecting breast cancer.
本研究旨在比较一组未经训练的三位人类读者(HR)与独立的人工智能(AI)辅助 Transpara 系统在日本女性人群中对乳腺癌检测性能的影响。
本研究共纳入 2018 年 1 月至 2018 年 10 月间进行数字乳腺 X 线摄影检查的 310 例日本女性门诊患者。一组三位 HR 提供了乳腺影像报告和数据系统(BI-RADS)评分,Transpara 系统提供了交互式决策支持评分和基于检查的癌症可能性评分。在每种阅读条件下比较了受试者工作特征曲线(ROC)下的面积(AUC)、敏感性和特异性。
与独立的 Transpara 系统相比,人类读者的 AUC 更高(人类读者 0.816;Transpara 系统 0.706;差异 0.11;P < 0.001)。未经训练的 HR 对诊断的敏感性为 89%,特异性为 86%。独立的 Transpara 系统的截断值为 4 和 7 时,敏感性分别为 93%和 85%,特异性分别为 45%和 67%。
尽管 Transpara 系统的诊断性能在统计学上低于 HR,但人工智能算法的最新进展有望缩小计算机与人类专家在检测乳腺癌方面的差异。