End-to-End vs. Human-Defined Feature Extraction: Comparing Deep Learning Approaches for Age Classification Using Mandibular Third Molars: Comparing Feature Extraction for Mandibular Molar Age Classification

Witsarut Upalananda; Arnon Charuakkra; Sitthichok Chaichulee

doi:10.5281/zenodo.17776415

Authors

Witsarut Upalananda Department of Oral Diagnostic Sciences, Faculty of Dentistry, Prince of Songkla University, Songkhla, Thailand https://orcid.org/0000-0002-3972-9785
Arnon Charuakkra Department of Oral Biology and Oral Diagnostic Sciences, Faculty of Dentistry, Chiang Mai University, Chiang Mai, Thailand, https://orcid.org/0009-0002-4765-2447
Sitthichok Chaichulee Department of Biomedical Sciences and Biomedical Engineering, Faculty of Medicine, Prince of Songkla University, Songkhla, Thailand https://orcid.org/0000-0002-3159-7925

DOI:

https://doi.org/10.5281/zenodo.17776415

Abstract

Accurate age classification using mandibular third molar radiographs is crucial for legal and forensic applications. This study evaluated different methods for classifying age as under or over 18 in a Thai population. We compared three approaches: the traditional human-based method using Demirjian’s classification, an end-to-end deep learning model, and a human-defined feature extraction approach where a CNN predicts tooth development stages in order to determine age. The dataset included 3,407 images of individuals aged 14-23 years. The results indicated that the traditional human-based method achieved high specificity (0.99) and a strong Bayes’ post-test probability (0.99), but it exhibited low sensitivity (0.45). In comparison, the end-to-end deep learning models showed higher sensitivity (0.65 to 0.74) than the traditional method, along with specificity of 0.91 to 0.95 and Bayes’ post-test probability of 0.93 to 0.95. The human-defined feature extraction approach, which utilised developmental stages for age determination, achieved accuracy of 0.88 to 0.92 in developmental stage classification. For age classification, the models demonstrated higher specificity (0.95 to 0.97) and Bayes’ post-test probability (0.95 to 0.97) than the end-to-end deep learning method, along with sensitivity ranging from 0.51 to 0.56. Our results indicate that while traditional methods excel in specificity, the human-defined feature extraction approach provides a balanced solution with high specificity and interpretability, suggesting its potential value in clinical practice for age estimation.

End-to-End vs. Human-Defined Feature Extraction: Comparing Deep Learning Approaches for Age Classification Using Mandibular Third Molars

Comparing Feature Extraction for Mandibular Molar Age Classification

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Developed By