基于监测、流行病学和最终结果数据库评估用于乳腺癌预后的机器学习算法。

Evaluation of machine learning algorithms for the prognosis of breast cancer from the Surveillance, Epidemiology, and End Results database.

机构信息

Department of Breast and Thyroid Surgery, Sichuan Provincial Hospital for Women and Children (Affiliated Women and Children's Hospital of Chengdu Medical College), Chengdu, China.

出版信息

PLoS One. 2023 Jan 26;18(1):e0280340. doi: 10.1371/journal.pone.0280340. eCollection 2023.

DOI:10.1371/journal.pone.0280340

PMID:36701415

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9879508/

Abstract

INTRODUCTION

Many researchers used machine learning (ML) to predict the prognosis of breast cancer (BC) patients and noticed that the ML model had good individualized prediction performance.

OBJECTIVE

The cohort study was intended to establish a reliable data analysis model by comparing the performance of 10 common ML algorithms and the the traditional American Joint Committee on Cancer (AJCC) stage, and used this model in Web application development to provide a good individualized prediction for others.

METHODS

This study included 63145 BC patients from the Surveillance, Epidemiology, and End Results database.

RESULTS

Through the performance of the 10 ML algorithms and 7th AJCC stage in the optimal test set, we found that in terms of 5-year overall survival, multivariate adaptive regression splines (MARS) had the highest area under the curve (AUC) value (0.831) and F1-score (0.608), and both sensitivity (0.737) and specificity (0.772) were relatively high. Besides, MARS showed a highest AUC value (0.831, 95%confidence interval: 0.820-0.842) in comparison to the other ML algorithms and 7th AJCC stage (all P < 0.05). MARS, the best performing model, was selected for web application development (https://w12251393.shinyapps.io/app2/).

CONCLUSIONS

The comparative study of multiple forecasting models utilizing a large data noted that MARS based model achieved a much better performance compared to other ML algorithms and 7th AJCC stage in individualized estimation of survival of BC patients, which was very likely to be the next step towards precision medicine.

摘要

简介

许多研究人员使用机器学习（ML）来预测乳腺癌（BC）患者的预后，并注意到 ML 模型具有良好的个体化预测性能。

目的

本队列研究旨在通过比较 10 种常见 ML 算法和传统的美国癌症联合委员会（AJCC）分期的性能，建立一个可靠的数据分析模型，并将该模型用于 Web 应用程序开发，为他人提供良好的个体化预测。

方法

本研究纳入了来自监测、流行病学和最终结果（SEER）数据库的 63145 例 BC 患者。

结果

通过在最优测试集中对 10 种 ML 算法和 7 版 AJCC 分期的性能进行评估，我们发现，在 5 年总生存率方面，多元自适应回归样条（MARS）的曲线下面积（AUC）值最高（0.831），F1 评分（0.608）最高，且灵敏度（0.737）和特异性（0.772）均较高。此外，MARS 与其他 ML 算法和 7 版 AJCC 分期相比，AUC 值最高（0.831，95%置信区间：0.820-0.842，均 P < 0.05）。选择性能最佳的 MARS 模型进行 Web 应用程序开发（https://w12251393.shinyapps.io/app2/）。

结论

利用大数据对多个预测模型进行比较研究表明，与其他 ML 算法和 7 版 AJCC 分期相比，基于 MARS 的模型在 BC 患者生存个体化估计方面具有更好的性能，这很可能是迈向精准医学的下一步。

相似文献

Evaluation of machine learning algorithms for the prognosis of breast cancer from the Surveillance, Epidemiology, and End Results database.

PLoS One. 2023 Jan 26;18(1):e0280340. doi: 10.1371/journal.pone.0280340. eCollection 2023.

An Online Calculator for the Prediction of Survival in Glioblastoma Patients Using Classical Statistics and Machine Learning.

Neurosurgery. 2020 Feb 1;86(2):E184-E192. doi: 10.1093/neuros/nyz403.

Stage-Specific Survival in Breast Cancer in Chinese and White Women: Comparative Data Analysis.

JMIR Public Health Surveill. 2022 Nov 15;8(11):e40386. doi: 10.2196/40386.

Predicting Survival of Patients With Rectal Neuroendocrine Tumors Using Machine Learning: A SEER-Based Population Study.

Front Surg. 2021 Nov 3;8:745220. doi: 10.3389/fsurg.2021.745220. eCollection 2021.

Development and validation of a machine learning model to predict the risk of lymph node metastasis in renal carcinoma.

Front Endocrinol (Lausanne). 2022 Nov 18;13:1054358. doi: 10.3389/fendo.2022.1054358. eCollection 2022.

Surgical Methods and Social Factors Are Associated With Long-Term Survival in Follicular Thyroid Carcinoma: Construction and Validation of a Prognostic Model Based on Machine Learning Algorithms.

Front Oncol. 2022 Jun 21;12:816427. doi: 10.3389/fonc.2022.816427. eCollection 2022.

Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database.

J Cancer. 2018 Oct 10;9(21):3971-3978. doi: 10.7150/jca.26649. eCollection 2018.

Development and Internal Validation of Machine Learning Algorithms for Preoperative Survival Prediction of Extremity Metastatic Disease.

Clin Orthop Relat Res. 2020 Feb;478(2):322-333. doi: 10.1097/CORR.0000000000000997.

The Development and Validation of Simplified Machine Learning Algorithms to Predict Prognosis of Hospitalized Patients With COVID-19: Multicenter, Retrospective Study.

J Med Internet Res. 2022 Jan 21;24(1):e31549. doi: 10.2196/31549.

Development of machine learning model algorithm for prediction of 5-year soft tissue myxoid liposarcoma survival.

J Surg Oncol. 2021 Jun;123(7):1610-1617. doi: 10.1002/jso.26398. Epub 2021 Mar 8.

引用本文的文献

Leveraging Digital Twins for Stratification of Patients with Breast Cancer and Treatment Optimization in Geriatric Oncology: Multivariate Clustering Analysis.

JMIR Cancer. 2025 May 23;11:e64000. doi: 10.2196/64000.

Classification and Diagnostic Prediction of Colorectal Cancer Mortality Based on Machine Learning Algorithms: A Multicenter National Study.

Asian Pac J Cancer Prev. 2024 Jan 1;25(1):333-342. doi: 10.31557/APJCP.2024.25.1.333.

本文引用的文献

Deep Learning and Machine Learning with Grid Search to Predict Later Occurrence of Breast Cancer Metastasis Using Clinical Data.

J Clin Med. 2022 Sep 29;11(19):5772. doi: 10.3390/jcm11195772.

Breast cancer detection using deep learning: Datasets, methods, and challenges ahead.

Comput Biol Med. 2022 Oct;149:106073. doi: 10.1016/j.compbiomed.2022.106073. Epub 2022 Aug 31.

Predicting Breast Cancer Leveraging Supervised Machine Learning Techniques.

Comput Math Methods Med. 2022 Aug 16;2022:5869529. doi: 10.1155/2022/5869529. eCollection 2022.

Multimodal Prediction of Five-Year Breast Cancer Recurrence in Women Who Receive Neoadjuvant Chemotherapy.

Cancers (Basel). 2022 Aug 9;14(16):3848. doi: 10.3390/cancers14163848.

Deep learning for survival analysis in breast cancer with whole slide image data.

Bioinformatics. 2022 Jul 11;38(14):3629-3637. doi: 10.1093/bioinformatics/btac381.

The impact of chemotherapy and survival prediction by machine learning in early Elderly Triple Negative Breast Cancer (eTNBC): a population based study from the SEER database.

BMC Geriatr. 2022 Apr 1;22(1):268. doi: 10.1186/s12877-022-02936-5.

The Application and Comparison of Machine Learning Models for the Prediction of Breast Cancer Prognosis: Retrospective Cohort Study.

JMIR Med Inform. 2022 Feb 18;10(2):e33440. doi: 10.2196/33440.

Breast Cancer Surgery 10-Year Survival Prediction by Machine Learning: A Large Prospective Cohort Study.

Biology (Basel). 2021 Dec 29;11(1):47. doi: 10.3390/biology11010047.

Hyperparameter Tuning and Pipeline Optimization via Grid Search Method and Tree-Based AutoML in Breast Cancer Prediction.

J Pers Med. 2021 Sep 29;11(10):978. doi: 10.3390/jpm11100978.

Machine-learning algorithms predict breast cancer patient survival from UK Biobank whole-exome sequencing data.

Biomark Med. 2021 Nov;15(16):1529-1539. doi: 10.2217/bmm-2021-0280. Epub 2021 Oct 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于监测、流行病学和最终结果数据库评估用于乳腺癌预后的机器学习算法。

Evaluation of machine learning algorithms for the prognosis of breast cancer from the Surveillance, Epidemiology, and End Results database.

机构信息

出版信息

INTRODUCTION

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

简介

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献