Silva-Sousa Alice Corrêa, Dos Santos Cardoso Gustavo, Branco Antônio Castelo, Küchler Erika Calvano, Baratto-Filho Flares, Candemil Amanda Pelegrin, Sousa-Neto Manoel Damião, de Araujo Cristiano Miranda
Department of Restorative Dentistry, School of Dentistry of Ribeirão Preto, University of São Paulo (USP), Ribeirão Preto, SP, Brazil.
Department of Orthodontics, University of Bonn, Welschnonnenstr, Bonn, Germany.
Clin Oral Investig. 2025 Sep 17;29(10):461. doi: 10.1007/s00784-025-06559-z.
The aim of this study was to assess measurements of the maxillary canines using Cone Beam Computed Tomography (CBCT) and develop a machine learning model for sex estimation.
CBCT scans from 610 patients were screened. The maxillary canines were examined to measure total tooth length, average enamel thickness, and mesiodistal width. Various supervised machine learning algorithms were employed to construct predictive models, including Decision Tree, Gradient Boosting Classifier, K-Nearest Neighbors (KNN), Logistic Regression, Multi-Layer Perceptron (MLP), Random Forest Classifier, Support Vector Machine (SVM), XGBoost, LightGBM, and CatBoost. Validation of each model was performed using a 10-fold cross-validation approach. Metrics such as area under the curve (AUC), accuracy, recall, precision, and F1 Score were computed, with ROC curves generated for visualization.
The total length of the tooth proved to be the variable with the highest predictive power. The algorithms that demonstrated superior performance in terms of AUC were LightGBM and Logistic Regression, achieving AUC values of 0.77 [CI95% = 0.65-0.89] and 0.75 [CI95% = 0.62-0.86] for the test data, and 0.74 [CI95% = 0.70-0.80] and 0.75 [CI95% = 0.70-0.79] in cross-validation, respectively. Both models also showed high precision values.
The use of maxillary canine measurements, combined with supervised machine learning techniques, has proven to be viable for sex estimation.
The machine learning approach combined with is a low-cost option as it relies solely on a single anatomical structure.
本研究旨在使用锥形束计算机断层扫描(CBCT)评估上颌尖牙的测量数据,并开发一种用于性别估计的机器学习模型。
筛选了610例患者的CBCT扫描数据。对上颌尖牙进行检查,测量牙齿总长度、平均釉质厚度和近远中宽度。采用多种监督机器学习算法构建预测模型,包括决策树、梯度提升分类器、K近邻(KNN)、逻辑回归、多层感知器(MLP)、随机森林分类器、支持向量机(SVM)、XGBoost、LightGBM和CatBoost。使用10折交叉验证方法对每个模型进行验证。计算曲线下面积(AUC)、准确率、召回率、精确率和F1分数等指标,并生成ROC曲线进行可视化。
牙齿总长度被证明是预测能力最强的变量。在AUC方面表现优异的算法是LightGBM和逻辑回归,测试数据的AUC值分别为0.77 [CI95% = 0.65 - 0.89]和0.75 [CI95% = 0.62 - 0.86],交叉验证中的值分别为0.74 [CI95% = 0.70 - 0.80]和0.75 [CI95% = 0.70 - 0.79]。两个模型也都显示出较高的精确率值。
上颌尖牙测量数据与监督机器学习技术相结合,已被证明可用于性别估计。
机器学习方法结合是一种低成本选择,因为它仅依赖于单一解剖结构。