机器学习在医疗保险索赔中预测全膝关节置换术后 30 天内非计划性再入院的个体风险效果不佳，但却揭示了与年度手术量有关的有趣的人群水平关联。

Machine Learning on Medicare Claims Poorly Predicts the Individual Risk of 30-Day Unplanned Readmission After Total Joint Arthroplasty, Yet Uncovers Interesting Population-level Associations With Annual Procedure Volumes.

机构信息

Department of Orthopaedic Surgery, Hospital for Special Surgery, New York, NY, USA.

Center for Analytics, Modeling, and Performance, Hospital for Special Surgery, New York, NY, USA.

出版信息

Clin Orthop Relat Res. 2023 Sep 1;481(9):1745-1759. doi: 10.1097/CORR.0000000000002705. Epub 2023 May 31.

DOI:10.1097/CORR.0000000000002705

PMID:37256278

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10427054/

Abstract

BACKGROUND

Unplanned hospital readmissions after total joint arthroplasty (TJA) represent potentially serious adverse events and remain a critical measure of hospital quality. Predicting the risk of readmission after TJA may provide patients and clinicians with valuable information for preoperative decision-making.

QUESTIONS/PURPOSES: (1) Can nonlinear machine-learning models integrating preoperatively available patient, surgeon, hospital, and county-level information predict 30-day unplanned hospital readmissions in a large cohort of nationwide Medicare beneficiaries undergoing TJA? (2) Which predictors are the most important in predicting 30-day unplanned hospital readmissions? (3) What specific information regarding population-level associations can we obtain from interpreting partial dependency plots (plots describing, given our modeling choice, the potentially nonlinear shape of associations between predictors and readmissions) of the most important predictors of 30-day readmission?

METHODS

National Medicare claims data (chosen because this database represents a large proportion of patients undergoing TJA annually) were analyzed for patients undergoing inpatient TJA between October 2016 and September 2018. A total of 679,041 TJAs (239,391 THAs [61.3% women, 91.9% White, 52.6% between 70 and 79 years old] and 439,650 TKAs [63.3% women, 90% White, 55.2% between 70 and 79 years old]) were included. Model features included demographics, county-level social determinants of health, prior-year (365-day) hospital and surgeon TJA procedure volumes, and clinical classification software-refined diagnosis and procedure categories summarizing each patient's Medicare claims 365 days before TJA. Machine-learning models, namely generalized additive models with pairwise interactions (prediction models consisting of both univariate predictions and pairwise interaction terms that allow for nonlinear effects), were trained and evaluated for predictive performance using area under the receiver operating characteristic (AUROC; 1.0 = perfect discrimination, 0.5 = no better than random chance) and precision-recall curves (AUPRC; equivalent to the average positive predictive value, which does not give credit for guessing "no readmission" when this is true most of the time, interpretable relative to the base rate of readmissions) on two holdout samples. All admissions (except the last 2 months' worth) were collected and split randomly 80%/20%. The training cohort was formed with the random 80% sample, which was downsampled (so it included all readmissions and a random, equal number of nonreadmissions). The random 20% sample served as the first test cohort ("random holdout"). The last 2 months of admissions (originally held aside) served as the second test cohort ("2-month holdout"). Finally, feature importances (the degree to which each variable contributed to the predictions) and partial dependency plots were investigated to answer the second and third research questions.

RESULTS

For the random holdout sample, model performance values in terms of AUROC and AUPRC were 0.65 and 0.087, respectively, for THA and 0.66 and 0.077, respectively, for TKA. For the 2-month holdout sample, these numbers were 0.66 and 0.087 and 0.65 and 0.075. Thus, our nonlinear models incorporating a wide variety of preoperative features from Medicare claims data could not well-predict the individual likelihood of readmissions (that is, the models performed poorly and are not appropriate for clinical use). The most predictive features (in terms of mean absolute scores) and their partial dependency graphs still confer information about population-level associations with increased risk of readmission, namely with older patient age, low prior 365-day surgeon and hospital TJA procedure volumes, being a man, patient history of cardiac diagnoses and lack of oncologic diagnoses, and higher county-level rates of hospitalizations for ambulatory-care sensitive conditions. Further inspection of partial dependency plots revealed nonlinear population-level associations specifically for surgeon and hospital procedure volumes. The readmission risk for THA and TKA decreased as surgeons performed more procedures in the prior 365 days, up to approximately 75 TJAs (odds ratio [OR] = 1.2 for TKA and 1.3 for THA), but no further risk reduction was observed for higher annual surgeon procedure volumes. For THA, the readmission risk decreased as hospitals performed more procedures, up to approximately 600 TJAs (OR = 1.2), but no further risk reduction was observed for higher annual hospital procedure volumes.

CONCLUSION

A large dataset of Medicare claims and machine learning were inadequate to provide a clinically useful individual prediction model for 30-day unplanned readmissions after TKA or THA, suggesting that other factors that are not routinely collected in claims databases are needed for predicting readmissions. Nonlinear population-level associations between low surgeon and hospital procedure volumes and increased readmission risk were identified, including specific volume thresholds above which the readmission risk no longer decreases, which may still be indirectly clinically useful in guiding policy as well as patient decision-making when selecting a hospital or surgeon for treatment.

LEVEL OF EVIDENCE

Level III, therapeutic study.

摘要

背景

全膝关节置换术（TJA）后计划外的住院再入院代表潜在的严重不良事件，仍然是医院质量的关键衡量标准。预测 TJA 后再入院的风险可能为患者和临床医生提供术前决策的有价值信息。

问题/目的：（1）能否通过整合术前可获得的患者、外科医生、医院和县级信息的非线性机器学习模型来预测在接受 TJA 的全国性医疗保险受益人群中的大型队列中 30 天内计划外的医院再入院？（2）哪些预测因素对预测 30 天内医院再入院最重要？（3）通过解释 30 天再入院最重要预测因素的部分依赖关系图（描述我们的建模选择与再入院之间的潜在非线性关系的图），我们可以获得有关人群水平关联的哪些具体信息？

方法

分析了 2016 年 10 月至 2018 年 9 月期间接受住院 TJA 的全国性医疗保险索赔数据（选择该数据库是因为它代表了每年接受 TJA 的患者的很大一部分）。共纳入 679041 例 TJA（239391 例全髋关节置换术[61.3%女性，91.9%白人，52.6%年龄在 70 至 79 岁之间]和 439650 例膝关节置换术[63.3%女性，90%白人，55.2%年龄在 70 至 79 岁之间]）。模型特征包括人口统计学特征、县级社会健康决定因素、前一年（365 天）医院和外科医生 TJA 手术量以及临床分类软件细化的诊断和手术类别，这些类别总结了每位患者在 TJA 前 365 天的医疗保险索赔。机器学习模型，即具有成对交互作用的广义加性模型（由单变量预测和成对交互项组成的预测模型，允许存在非线性效应），使用接收器操作特征曲线下面积（AUROC；1.0=完美区分，0.5=不如随机机会）和精度-召回曲线（AUPRC；等效于平均阳性预测值，当大多数时间实际上没有“再入院”时，不会因猜测“无再入院”而给予信贷，相对于再入院的基础率可解释）在两个保留样本上进行了训练和评估。收集并随机分为 80%/20%的所有入院（最后两个月的除外）。随机 80%样本形成训练队列，对其进行了下采样（以便包括所有再入院和随机数量相同的非再入院）。随机 20%的样本作为第一个测试队列（“随机保留队列”）。最初保留的最后两个月的入院情况作为第二个测试队列（“2 个月保留队列”）。最后，研究了特征重要性（每个变量对预测的贡献程度）和部分依赖关系图，以回答第二个和第三个研究问题。

结果

对于随机保留样本，AUROC 和 AUPRC 指标的模型性能值分别为 THA 的 0.65 和 0.087，TKA 的 0.66 和 0.077。对于 2 个月保留样本，这些数字分别为 0.66 和 0.087，TKA 的 0.65 和 0.075。因此，我们纳入了来自医疗保险索赔数据的广泛术前特征的非线性模型无法很好地预测个人再入院的可能性（即，模型表现不佳，不适合临床使用）。最重要的预测特征（以平均绝对分数表示）及其部分依赖图仍然提供了有关再入院风险增加的人群水平关联的信息，即患者年龄较大、前一年外科医生和医院 TJA 手术量较低、为男性、有心脏病诊断和缺乏肿瘤诊断、以及县级医院因门诊保健敏感条件而住院的比例较高。对部分依赖图的进一步检查显示了特定的人群水平关联，具体来说是外科医生和医院手术量的非线性关联。THA 和 TKA 的再入院风险随着外科医生在前一年进行的手术数量增加而降低，直到大约 75 例 TJA（TKA 的 OR=1.2，THA 的 OR=1.3），但对于更高的年度外科医生手术量，再入院风险没有进一步降低。对于 THA，随着医院进行的手术数量增加，再入院风险降低，直到大约 600 例 TJA（OR=1.2），但对于更高的年度医院手术量，再入院风险没有进一步降低。

结论

大量的医疗保险索赔和机器学习数据不足以提供 30 天内 TKA 或 THA 后计划外再入院的临床有用的个体预测模型，这表明需要常规收集索赔数据库中未收集的其他因素来预测再入院。确定了低外科医生和医院手术量与增加的再入院风险之间的非线性人群水平关联，包括再入院风险不再降低的特定体积阈值，这在指导政策以及患者在选择治疗医院或外科医生时的决策时可能仍然具有间接的临床意义。

证据水平

III 级，治疗性研究。

相似文献

Machine Learning on Medicare Claims Poorly Predicts the Individual Risk of 30-Day Unplanned Readmission After Total Joint Arthroplasty, Yet Uncovers Interesting Population-level Associations With Annual Procedure Volumes.

Clin Orthop Relat Res. 2023 Sep 1;481(9):1745-1759. doi: 10.1097/CORR.0000000000002705. Epub 2023 May 31.

Which Clinical and Patient Factors Influence the National Economic Burden of Hospital Readmissions After Total Joint Arthroplasty?

Clin Orthop Relat Res. 2017 Dec;475(12):2926-2937. doi: 10.1007/s11999-017-5244-6.

Preoperative Activities of Daily Living Dependency is Associated With Higher 30-Day Readmission Risk for Older Adults After Total Joint Arthroplasty.

Clin Orthop Relat Res. 2020 Feb;478(2):231-237. doi: 10.1097/CORR.0000000000001040.

High-volume Arthroplasty Centers Are Associated With Lower Hospital Costs When Performing Primary THA and TKA: A Database Study of 288,909 Medicare Claims for Procedures Performed in 2019.

Clin Orthop Relat Res. 2023 May 1;481(5):1025-1036. doi: 10.1097/CORR.0000000000002470. Epub 2022 Nov 7.

RAPT Scores Predict Inpatient Versus Outpatient Status and Readmission Rates After IPO Changes for Total Joint Arthroplasty: An Analysis of 12,348 Cases.

J Arthroplasty. 2022 Nov;37(11):2140-2148. doi: 10.1016/j.arth.2022.05.037. Epub 2022 May 19.

What Is the Association Between Hospital Volume and Complications After Revision Total Joint Arthroplasty: A Large-database Study.

Clin Orthop Relat Res. 2019 May;477(5):1221-1231. doi: 10.1097/CORR.0000000000000684.

Machine-learning Models Predict 30-Day Mortality, Cardiovascular Complications, and Respiratory Complications After Aseptic Revision Total Joint Arthroplasty.

Clin Orthop Relat Res. 2022 Nov 1;480(11):2137-2145. doi: 10.1097/CORR.0000000000002276. Epub 2022 Jun 20.

Will Medicare Readmission Penalties Motivate Hospitals to Reduce Arthroplasty Readmissions?

J Arthroplasty. 2017 Mar;32(3):709-713. doi: 10.1016/j.arth.2016.08.031. Epub 2016 Aug 31.

Increased rates of periprosthetic joint infection in patients with cirrhosis undergoing total joint arthroplasty.

Clin Orthop Relat Res. 2014 Aug;472(8):2483-91. doi: 10.1007/s11999-014-3593-y. Epub 2014 Apr 8.

Patterns and Costs of 90-Day Readmission for Surgical and Medical Complications Following Total Hip and Knee Arthroplasty.

J Arthroplasty. 2019 Oct;34(10):2304-2307. doi: 10.1016/j.arth.2019.05.046. Epub 2019 Jun 1.

引用本文的文献

Development of Explainable Machine Learning Models to Predict Outcomes After Platelet-Rich Plasma Injections for Knee Osteoarthritis.

Orthop J Sports Med. 2025 Aug 7;13(8):23259671251349743. doi: 10.1177/23259671251349743. eCollection 2025 Aug.

A Proposed Conceptual Framework and Narrative Systematic Review of the Social Determinants of Health in Hip and Knee Arthroplasty.

J Am Acad Orthop Surg. 2025 Jul 18. doi: 10.5435/JAAOS-D-25-00089.

Comparing Community-Level Social Determinants of Health With Patient Race in Total Hip Arthroplasty Outcomes.

Arthritis Care Res (Hoboken). 2025 Jul;77(7):892-899. doi: 10.1002/acr.25511. Epub 2025 Mar 21.

Explainable artificial intelligence in orthopedic surgery.

J Exp Orthop. 2024 Jul 17;11(3):e12103. doi: 10.1002/jeo2.12103. eCollection 2024 Jul.

Postoperative Communication Volume Following Total Joint Arthroplasty Can Be a Precursor for Emergency Department Visits.

Arthroplast Today. 2024 Apr 23;27:101352. doi: 10.1016/j.artd.2024.101352. eCollection 2024 Jun.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

机器学习在医疗保险索赔中预测全膝关节置换术后 30 天内非计划性再入院的个体风险效果不佳，但却揭示了与年度手术量有关的有趣的人群水平关联。

Machine Learning on Medicare Claims Poorly Predicts the Individual Risk of 30-Day Unplanned Readmission After Total Joint Arthroplasty, Yet Uncovers Interesting Population-level Associations With Annual Procedure Volumes.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

LEVEL OF EVIDENCE

背景

方法

结果

结论

证据水平

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献