Department of Orthopaedic Surgery, Hospital for Special Surgery, New York, NY, USA.
Center for Analytics, Modeling, and Performance, Hospital for Special Surgery, New York, NY, USA.
Clin Orthop Relat Res. 2023 Sep 1;481(9):1745-1759. doi: 10.1097/CORR.0000000000002705. Epub 2023 May 31.
Unplanned hospital readmissions after total joint arthroplasty (TJA) represent potentially serious adverse events and remain a critical measure of hospital quality. Predicting the risk of readmission after TJA may provide patients and clinicians with valuable information for preoperative decision-making.
QUESTIONS/PURPOSES: (1) Can nonlinear machine-learning models integrating preoperatively available patient, surgeon, hospital, and county-level information predict 30-day unplanned hospital readmissions in a large cohort of nationwide Medicare beneficiaries undergoing TJA? (2) Which predictors are the most important in predicting 30-day unplanned hospital readmissions? (3) What specific information regarding population-level associations can we obtain from interpreting partial dependency plots (plots describing, given our modeling choice, the potentially nonlinear shape of associations between predictors and readmissions) of the most important predictors of 30-day readmission?
National Medicare claims data (chosen because this database represents a large proportion of patients undergoing TJA annually) were analyzed for patients undergoing inpatient TJA between October 2016 and September 2018. A total of 679,041 TJAs (239,391 THAs [61.3% women, 91.9% White, 52.6% between 70 and 79 years old] and 439,650 TKAs [63.3% women, 90% White, 55.2% between 70 and 79 years old]) were included. Model features included demographics, county-level social determinants of health, prior-year (365-day) hospital and surgeon TJA procedure volumes, and clinical classification software-refined diagnosis and procedure categories summarizing each patient's Medicare claims 365 days before TJA. Machine-learning models, namely generalized additive models with pairwise interactions (prediction models consisting of both univariate predictions and pairwise interaction terms that allow for nonlinear effects), were trained and evaluated for predictive performance using area under the receiver operating characteristic (AUROC; 1.0 = perfect discrimination, 0.5 = no better than random chance) and precision-recall curves (AUPRC; equivalent to the average positive predictive value, which does not give credit for guessing "no readmission" when this is true most of the time, interpretable relative to the base rate of readmissions) on two holdout samples. All admissions (except the last 2 months' worth) were collected and split randomly 80%/20%. The training cohort was formed with the random 80% sample, which was downsampled (so it included all readmissions and a random, equal number of nonreadmissions). The random 20% sample served as the first test cohort ("random holdout"). The last 2 months of admissions (originally held aside) served as the second test cohort ("2-month holdout"). Finally, feature importances (the degree to which each variable contributed to the predictions) and partial dependency plots were investigated to answer the second and third research questions.
For the random holdout sample, model performance values in terms of AUROC and AUPRC were 0.65 and 0.087, respectively, for THA and 0.66 and 0.077, respectively, for TKA. For the 2-month holdout sample, these numbers were 0.66 and 0.087 and 0.65 and 0.075. Thus, our nonlinear models incorporating a wide variety of preoperative features from Medicare claims data could not well-predict the individual likelihood of readmissions (that is, the models performed poorly and are not appropriate for clinical use). The most predictive features (in terms of mean absolute scores) and their partial dependency graphs still confer information about population-level associations with increased risk of readmission, namely with older patient age, low prior 365-day surgeon and hospital TJA procedure volumes, being a man, patient history of cardiac diagnoses and lack of oncologic diagnoses, and higher county-level rates of hospitalizations for ambulatory-care sensitive conditions. Further inspection of partial dependency plots revealed nonlinear population-level associations specifically for surgeon and hospital procedure volumes. The readmission risk for THA and TKA decreased as surgeons performed more procedures in the prior 365 days, up to approximately 75 TJAs (odds ratio [OR] = 1.2 for TKA and 1.3 for THA), but no further risk reduction was observed for higher annual surgeon procedure volumes. For THA, the readmission risk decreased as hospitals performed more procedures, up to approximately 600 TJAs (OR = 1.2), but no further risk reduction was observed for higher annual hospital procedure volumes.
A large dataset of Medicare claims and machine learning were inadequate to provide a clinically useful individual prediction model for 30-day unplanned readmissions after TKA or THA, suggesting that other factors that are not routinely collected in claims databases are needed for predicting readmissions. Nonlinear population-level associations between low surgeon and hospital procedure volumes and increased readmission risk were identified, including specific volume thresholds above which the readmission risk no longer decreases, which may still be indirectly clinically useful in guiding policy as well as patient decision-making when selecting a hospital or surgeon for treatment.
Level III, therapeutic study.
全膝关节置换术(TJA)后计划外的住院再入院代表潜在的严重不良事件,仍然是医院质量的关键衡量标准。预测 TJA 后再入院的风险可能为患者和临床医生提供术前决策的有价值信息。
问题/目的:(1)能否通过整合术前可获得的患者、外科医生、医院和县级信息的非线性机器学习模型来预测在接受 TJA 的全国性医疗保险受益人群中的大型队列中 30 天内计划外的医院再入院?(2)哪些预测因素对预测 30 天内医院再入院最重要?(3)通过解释 30 天再入院最重要预测因素的部分依赖关系图(描述我们的建模选择与再入院之间的潜在非线性关系的图),我们可以获得有关人群水平关联的哪些具体信息?
分析了 2016 年 10 月至 2018 年 9 月期间接受住院 TJA 的全国性医疗保险索赔数据(选择该数据库是因为它代表了每年接受 TJA 的患者的很大一部分)。共纳入 679041 例 TJA(239391 例全髋关节置换术[61.3%女性,91.9%白人,52.6%年龄在 70 至 79 岁之间]和 439650 例膝关节置换术[63.3%女性,90%白人,55.2%年龄在 70 至 79 岁之间])。模型特征包括人口统计学特征、县级社会健康决定因素、前一年(365 天)医院和外科医生 TJA 手术量以及临床分类软件细化的诊断和手术类别,这些类别总结了每位患者在 TJA 前 365 天的医疗保险索赔。机器学习模型,即具有成对交互作用的广义加性模型(由单变量预测和成对交互项组成的预测模型,允许存在非线性效应),使用接收器操作特征曲线下面积(AUROC;1.0=完美区分,0.5=不如随机机会)和精度-召回曲线(AUPRC;等效于平均阳性预测值,当大多数时间实际上没有“再入院”时,不会因猜测“无再入院”而给予信贷,相对于再入院的基础率可解释)在两个保留样本上进行了训练和评估。收集并随机分为 80%/20%的所有入院(最后两个月的除外)。随机 80%样本形成训练队列,对其进行了下采样(以便包括所有再入院和随机数量相同的非再入院)。随机 20%的样本作为第一个测试队列(“随机保留队列”)。最初保留的最后两个月的入院情况作为第二个测试队列(“2 个月保留队列”)。最后,研究了特征重要性(每个变量对预测的贡献程度)和部分依赖关系图,以回答第二个和第三个研究问题。
对于随机保留样本,AUROC 和 AUPRC 指标的模型性能值分别为 THA 的 0.65 和 0.087,TKA 的 0.66 和 0.077。对于 2 个月保留样本,这些数字分别为 0.66 和 0.087,TKA 的 0.65 和 0.075。因此,我们纳入了来自医疗保险索赔数据的广泛术前特征的非线性模型无法很好地预测个人再入院的可能性(即,模型表现不佳,不适合临床使用)。最重要的预测特征(以平均绝对分数表示)及其部分依赖图仍然提供了有关再入院风险增加的人群水平关联的信息,即患者年龄较大、前一年外科医生和医院 TJA 手术量较低、为男性、有心脏病诊断和缺乏肿瘤诊断、以及县级医院因门诊保健敏感条件而住院的比例较高。对部分依赖图的进一步检查显示了特定的人群水平关联,具体来说是外科医生和医院手术量的非线性关联。THA 和 TKA 的再入院风险随着外科医生在前一年进行的手术数量增加而降低,直到大约 75 例 TJA(TKA 的 OR=1.2,THA 的 OR=1.3),但对于更高的年度外科医生手术量,再入院风险没有进一步降低。对于 THA,随着医院进行的手术数量增加,再入院风险降低,直到大约 600 例 TJA(OR=1.2),但对于更高的年度医院手术量,再入院风险没有进一步降低。
大量的医疗保险索赔和机器学习数据不足以提供 30 天内 TKA 或 THA 后计划外再入院的临床有用的个体预测模型,这表明需要常规收集索赔数据库中未收集的其他因素来预测再入院。确定了低外科医生和医院手术量与增加的再入院风险之间的非线性人群水平关联,包括再入院风险不再降低的特定体积阈值,这在指导政策以及患者在选择治疗医院或外科医生时的决策时可能仍然具有间接的临床意义。
III 级,治疗性研究。