End-to-End vs. Human-Defined Feature Extraction: Comparing Deep Learning Approaches for Age Classification Using Mandibular Third Molars
Comparing Feature Extraction for Mandibular Molar Age Classification
DOI:
https://doi.org/10.5281/zenodo.17776415Abstract
Accurate age classification using mandibular third molar radiographs is crucial for legal and forensic applications. This study evaluated different methods for classifying age as under or over 18 in a Thai population. We compared three approaches: the traditional human-based method using Demirjian’s classification, an end-to-end deep learning model, and a human-defined feature extraction approach where a CNN predicts tooth development stages in order to determine age. The dataset included 3,407 images of individuals aged 14-23 years. The results indicated that the traditional human-based method achieved high specificity (0.99) and a strong Bayes’ post-test probability (0.99), but it exhibited low sensitivity (0.45). In comparison, the end-to-end deep learning models showed higher sensitivity (0.65 to 0.74) than the traditional method, along with specificity of 0.91 to 0.95 and Bayes’ post-test probability of 0.93 to 0.95. The human-defined feature extraction approach, which utilised developmental stages for age determination, achieved accuracy of 0.88 to 0.92 in developmental stage classification. For age classification, the models demonstrated higher specificity (0.95 to 0.97) and Bayes’ post-test probability (0.95 to 0.97) than the end-to-end deep learning method, along with sensitivity ranging from 0.51 to 0.56. Our results indicate that while traditional methods excel in specificity, the human-defined feature extraction approach provides a balanced solution with high specificity and interpretability, suggesting its potential value in clinical practice for age estimation.