Department of Computer Science and Statistics, Poznan University of Medical Sciences, 60-806 Poznan, Poland.
Department of Preventive Medicine, Poznan University of Medical Sciences, 60-781 Poznan, Poland.
Int J Environ Res Public Health. 2022 Aug 17;19(16):10213. doi: 10.3390/ijerph191610213.
The need to search for new measures describing the classification of a logistic regression model stems from the difficulty in searching for previously unknown factors that predict the occurrence of a disease. A classification quality assessment can be performed by testing the change in the area under the receiver operating characteristic curve (AUC). Another approach is to use the Net Reclassification Improvement (NRI), which is based on a comparison between the predicted risk, determined on the basis of the basic model, and the predicted risk that comes from the model enriched with an additional factor. In this paper, we draw attention to Cohen's Kappa coefficient, which examines the actual agreement in the correction of a random agreement. We proposed to extend this coefficient so that it may be used to detect the quality of a logistic regression model reclassification. The results provided by Kappa's reclassification were compared with the results obtained using NRI. The random variables' distribution attached to the model on the classification change, measured by NRI, Kappa, and AUC, was presented. A simulation study was conducted on the basis of a cohort containing 3971 Poles obtained during the implementation of a lower limb atherosclerosis prevention program.
由于难以寻找预测疾病发生的先前未知因素,因此需要寻找新的方法来描述逻辑回归模型的分类。可以通过测试接收者操作特征曲线 (AUC) 下面积的变化来进行分类质量评估。另一种方法是使用净重新分类改善 (NRI),该方法基于基本模型确定的预测风险与通过附加因素丰富模型得出的预测风险之间的比较。在本文中,我们提请注意科恩氏 Kappa 系数,该系数检查了纠正随机一致性的实际一致性。我们建议扩展该系数,以便可以使用它来检测逻辑回归模型重新分类的质量。将 Kappa 重新分类的结果与使用 NRI 获得的结果进行了比较。还展示了通过 NRI、Kappa 和 AUC 测量的模型分类变化时随机变量的分布。基于下肢动脉粥样硬化预防计划实施过程中获得的包含 3971 名波兰人的队列进行了模拟研究。