Zapata-Cortes Orlando, Arango-Serna Martin Darío, Zapata-Cortes Julian Andres, Restrepo-Carmona Jaime Alonso
Instituto Tecnológico Metropolitano, Medellín 050034, Colombia.
Facultad de Minas, Universidad Nacional de Colombia, Medellín 050034, Colombia.
Sensors (Basel). 2024 Jul 18;24(14):4678. doi: 10.3390/s24144678.
From the various perspectives of machine learning (ML) and the multiple models used in this discipline, there is an approach aimed at training models for the early detection (ED) of anomalies. The early detection of anomalies is crucial in multiple areas of knowledge since identifying and classifying them allows for early decision making and provides a better response to mitigate the negative effects caused by late detection in any system. This article presents a literature review to examine which machine learning models (MLMs) operate with a focus on ED in a multidisciplinary manner and, specifically, how these models work in the field of fraud detection. A variety of models were found, including Logistic Regression (LR), Support Vector Machines (SVMs), decision trees (DTs), Random Forests (RFs), naive Bayesian classifier (NB), K-Nearest Neighbors (KNNs), artificial neural networks (ANNs), and Extreme Gradient Boosting (XGB), among others. It was identified that MLMs operate as isolated models, categorized in this article as Single Base Models (SBMs) and Stacking Ensemble Models (SEMs). It was identified that MLMs for ED in multiple areas under SBMs' and SEMs' implementation achieved accuracies greater than 80% and 90%, respectively. In fraud detection, accuracies greater than 90% were reported by the authors. The article concludes that MLMs for ED in multiple applications, including fraud, offer a viable way to identify and classify anomalies robustly, with a high degree of accuracy and precision. MLMs for ED in fraud are useful as they can quickly process large amounts of data to detect and classify suspicious transactions or activities, helping to prevent financial losses.
从机器学习(ML)的不同视角以及该学科中使用的多种模型来看,有一种方法旨在训练用于异常早期检测(ED)的模型。异常的早期检测在多个知识领域都至关重要,因为识别和分类异常有助于早期决策,并能更好地应对,以减轻任何系统中因检测延迟而造成的负面影响。本文进行了文献综述,以研究哪些机器学习模型(MLMs)以多学科方式专注于早期检测进行运作,特别是这些模型在欺诈检测领域是如何工作的。研究发现了多种模型,包括逻辑回归(LR)、支持向量机(SVMs)、决策树(DTs)、随机森林(RFs)、朴素贝叶斯分类器(NB)、K近邻(KNNs)、人工神经网络(ANNs)以及极端梯度提升(XGB)等。研究发现,MLMs作为孤立模型运行,在本文中被归类为单基模型(SBMs)和堆叠集成模型(SEMs)。研究发现,在SBMs和SEMs实施下,多个领域中用于早期检测的MLMs分别实现了大于80%和90%的准确率。在欺诈检测中,作者报告的准确率大于90%。本文得出结论,包括欺诈在内的多种应用中用于早期检测的MLMs提供了一种可行的方法,能够以高度的准确性和精确性稳健地识别和分类异常。欺诈检测中用于早期检测的MLMs很有用,因为它们可以快速处理大量数据,以检测和分类可疑交易或活动,有助于防止财务损失。