Stefanakis Konstantinos, Mingrone Geltrude, George Jacob, Mantzoros Christos S
Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA.
Università Cattolica del Sacro Cuore, Rome, Italy.
Metabolism. 2025 Feb;163:156082. doi: 10.1016/j.metabol.2024.156082. Epub 2024 Nov 19.
There are no known non-invasive tests (NITs) designed for accurately detecting metabolic dysfunction-associated steatohepatitis (MASH) with liver fibrosis stages F2-F3, excluding cirrhosis-the FDA-defined range for prescribing Resmetirom and other drugs in clinical trials. We aimed to validate and re-optimize known NITs, and most importantly to develop new machine learning (ML)-based NITs to accurately detect MASH F2-F3.
Clinical and metabolomic data were collected from 443 patients across three countries and two clinic types (metabolic surgery, gastroenterology/hepatology) covering the entire spectrum of biopsy-proven MASH, including cirrhosis and healthy controls. Three novel types of ML models were developed using a categorical gradient boosting machine pipeline under a classic 4:1 split and a secondary independent validation analysis. These were compared with twenty-three biomarker, imaging, and algorithm-based NITs with both known and re-optimized cutoffs for MASH F2-F3.
The NAFLD (Non-Alcoholic Fatty Liver Disease) Fibrosis Score (NFS) at a - 1.455 cutoff attained an area under the receiver operating characteristic curve (AUC) of 0.59, the highest sensitivity (90.9 %), and a negative predictive value (NPV) of 87.2 %. FIB-4 risk stratification followed by elastography (8 kPa) had the best specificity (86.9 %) and positive predictive value (PPV) (63.3 %), with an AUC of 0.57. NFS followed by elastography improved the PPV to 65.3 % and AUC to 0.62. Re-optimized FibroScan-AST (FAST) at a 0.22 cutoff had the highest PPV (69.1 %). ML models using aminotransferases, metabolic syndrome components, BMI, and 3-ureidopropionate achieved an AUC of 0.89, which further increased to 0.91 following hyperparameter optimization and the addition of alpha-ketoglutarate. These new ML models outperformed all other NITs and displayed accuracy, sensitivity, specificity, PPV, and NPV up to 91.2 %, 85.3 %, 97.0 %, 92.4 %, and 90.7 % respectively. The models were reproduced and validated in a secondary sensitivity analysis, that used one of the cohorts as feature selection/training, and the rest as independent validation, likewise outperforming all other applicable NITs.
We report for the first time the diagnostic characteristics of non-invasive, metabolomics-based biomarker models to detect MASH with fibrosis F2-F3 required for Resmetirom treatment and inclusion in ongoing phase-III trials. These models may be used alone or in combination with other NITs to accurately determine treatment eligibility.
目前尚无专门设计用于准确检测伴有F2 - F3期肝纤维化的代谢功能障碍相关脂肪性肝炎(MASH)的非侵入性检测方法(NITs),不包括肝硬化——这是美国食品药品监督管理局(FDA)在临床试验中规定的使用Resmetirom和其他药物的范围。我们旨在验证和重新优化已知的NITs,最重要的是开发基于机器学习(ML)的新型NITs,以准确检测MASH F2 - F3。
收集了来自三个国家和两种临床类型(代谢手术、胃肠病学/肝病学)的443例患者的临床和代谢组学数据,涵盖了经活检证实的MASH的整个范围,包括肝硬化和健康对照。在经典的4:1分割和二次独立验证分析下,使用分类梯度提升机管道开发了三种新型ML模型。将这些模型与23种基于生物标志物、影像学和算法的NITs进行比较,这些NITs具有已知的和重新优化的MASH F2 - F3临界值。
NAFLD(非酒精性脂肪性肝病)纤维化评分(NFS)在临界值为 - 1.455时,受试者工作特征曲线(AUC)下面积为0.59,灵敏度最高(90.9%),阴性预测值(NPV)为87.2%。FIB - 4风险分层后进行弹性成像(8 kPa)具有最佳特异性(86.9%)和阳性预测值(PPV)(63.3%),AUC为0.57。NFS后进行弹性成像将PPV提高到65.3%,AUC提高到0.62。重新优化的FibroScan - AST(FAST)在临界值为0.22时具有最高PPV(69.1%)。使用转氨酶、代谢综合征成分、BMI和3 - 脲基丙酸的ML模型AUC为0.89,在进行超参数优化并添加α - 酮戊二酸后进一步提高到0.91。这些新的ML模型优于所有其他NITs,显示出的准确性、灵敏度、特异性、PPV和NPV分别高达91.2%、85.3%、97.0%、92.4%和90.7%。这些模型在二次敏感性分析中得到再现和验证,该分析将其中一个队列用作特征选择/训练,其余用作独立验证,同样优于所有其他适用的NITs。
我们首次报告了基于代谢组学的非侵入性生物标志物模型检测Resmetirom治疗所需的伴有F2 - F3期纤维化的MASH的诊断特征,并将其纳入正在进行的III期试验。这些模型可单独使用或与其他NITs联合使用,以准确确定治疗资格。