Suppr超能文献

Risk-based evaluation of machine learning-based classification methods used for medical devices.

作者信息

Haimerl Martin, Reich Christoph

机构信息

Furtwangen University of Applied Sciences, Furtwangen, Germany.

出版信息

BMC Med Inform Decis Mak. 2025 Mar 11;25(1):126. doi: 10.1186/s12911-025-02909-9.

Abstract

BACKGROUND

In the future, more medical devices will be based on machine learning (ML) methods. In general, the consideration of risks is a crucial aspect for evaluating medical devices. Accordingly, risks and their associated costs should be taken into account when assessing the performance of ML-based medical devices. This paper addresses the following three research questions towards a risk-based evaluation with a focus on ML-based classification models.

METHODS

First, we analyzed how often risk-based metrics are currently utilized in the context of ML-based classification models. This was performed using a literature research based on a sample of recent scientific publications. Second, we introduce an approach for evaluating such models where expected risks and associated costs are integrated into the corresponding performance metrics. Additionally, we analyze the impact of different risk ratios on the resulting overall performance. Third, we elaborate how such risk-based approaches relate to regulatory requirements in the field of medical devices. A set of use case scenarios were utilized to demonstrate necessities and practical implications, in this regard.

RESULTS

First, it was shown that currently most scientific publications do not include risk-based approaches for measuring performance. Second, it was demonstrated that risk-based considerations have a substantial impact on the outcome. The relative increase of the resulting overall risks can go up to 196% when the ratio between different types of risks (false negatives vs. false positives) changes by a factor of 10.0. Third, we elaborated that risk-based considerations need to be included into the assessment of ML-based medical devices, according to the relevant EU regulations and standards. In particular, this applies when a substantial impact on the clinical outcome / in terms of the risk-benefit relationship occurs.

CONCLUSION

In summary, we demonstrated the necessity of a risk-based approach for the evaluation of medical devices which include ML-based classification methods. We showed that currently many scientific papers in this area do not include risk considerations. We developed basic steps towards a risk-based assessment of ML-based classifiers and elaborated consequences that could occur, when these steps are neglected. And, we demonstrated the consistency of our approach with current regulatory requirements in the EU.

摘要

相似文献

1
Risk-based evaluation of machine learning-based classification methods used for medical devices.
BMC Med Inform Decis Mak. 2025 Mar 11;25(1):126. doi: 10.1186/s12911-025-02909-9.
4
The future of Cochrane Neonatal.
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
5
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.
7
10
Review of Machine Learning Techniques in Soft Tissue Biomechanics and Biomaterials.
Cardiovasc Eng Technol. 2024 Oct;15(5):522-549. doi: 10.1007/s13239-024-00737-y. Epub 2024 Jul 2.

本文引用的文献

1
Metrics reloaded: recommendations for image analysis validation.
Nat Methods. 2024 Feb;21(2):195-212. doi: 10.1038/s41592-023-02151-z. Epub 2024 Feb 12.
2
Comparison of Classification Success Rates of Different Machine Learning Algorithms in the Diagnosis of Breast Cancer.
Asian Pac J Cancer Prev. 2022 Oct 1;23(10):3287-3297. doi: 10.31557/APJCP.2022.23.10.3287.
3
Efficient Model for Coronary Artery Disease Diagnosis: A Comparative Study of Several Machine Learning Algorithms.
J Healthc Eng. 2022 Oct 18;2022:5359540. doi: 10.1155/2022/5359540. eCollection 2022.
4
Predictive Analysis of Diabetes-Risk with Class Imbalance.
Comput Intell Neurosci. 2022 Oct 11;2022:3078025. doi: 10.1155/2022/3078025. eCollection 2022.
5
In-hospital risk stratification algorithm of Asian elderly patients.
Sci Rep. 2022 Oct 20;12(1):17592. doi: 10.1038/s41598-022-18839-9.
6
Detecting and Analyzing Suicidal Ideation on Social Media Using Deep Learning and Machine Learning Models.
Int J Environ Res Public Health. 2022 Oct 3;19(19):12635. doi: 10.3390/ijerph191912635.
7
Automated assessment of balance: A neural network approach based on large-scale balance function data.
Front Public Health. 2022 Sep 21;10:882811. doi: 10.3389/fpubh.2022.882811. eCollection 2022.
9
Machine-learning-derived predictive score for early estimation of COVID-19 mortality risk in hospitalized patients.
PLoS One. 2022 Sep 22;17(9):e0274171. doi: 10.1371/journal.pone.0274171. eCollection 2022.
10
Application of machine learning algorithms in predicting HIV infection among men who have sex with men: Model development and validation.
Front Public Health. 2022 Aug 25;10:967681. doi: 10.3389/fpubh.2022.967681. eCollection 2022.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验