Neural Network Technologies for Age Estimation in Children from Orthopantomograms (a Pilot Study)
The aim of the study was to investigate the potential of using artificial intelligence technologies for age estimation in children from dental radiographs.
Materials and Methods. A retrospective study was conducted, analyzing orthopantomograms of 322 children (173 female, 149 male) aged 4–16 years. Fourteen permanent mandibular teeth were annotated on each radiograph. Neural network training was performed by splitting the data into training and test sets at a ratio of 80:20; 5-fold cross-validation was used. Age estimation was approached as a regression task. The neural network training and validation were conducted in Python using the PyTorch library. The accuracy of age prediction was assessed using the coefficient of determination (R2), mean squared error (MSE), and mean absolute error (MAE).
Results. The study showed that the developed machine learning model was highly accurate in age estimation in children. The mean absolute error across cross-validation was 0.92 years, which was significantly lower than the error associated with traditional manual methods.
Introduction
In the vast majority of cases, forensic medical expert practice relies on “manual” methods to measure and visually assess the dental status to estimate the age [1, 2]. The most popular among them are the following: the method of qualitative dentition estimation based on the stages developed by A. Demirjian; its modification proposed by G. Willems; and the quantitative method developed by R. Cameriere. The Demirjian’s method is based on determining the stages of tooth formation and development (eight stages from A to H) from a panoramic radiograph (orthopantomogram). The developmental stage of each tooth is converted into scores using a conversion table considering gender, and these scores are summed. The final maturity score is converted into dental age by using calculation tables or charts [3]. Willems subsequently slightly modified the method, simplifying the conversion of final scores into age, but the system of staging teeth according to the mineralization degree remained the method’s basis [3, 4]. Cameriere proposed a quantitative method for dental age estimation based on the correlation between age and the size of the open apices of tooth roots [5]. As practice has shown, these methods have a number of limitations. One of the main limitations is the requirement for the specialist to have at least a basic knowledge of dental development characteristics, understand the specifics of the method used, have enough time for the examination, and possess expertise in conducting this type of assessment. All these conditions affect the level of intra- and inter-expert consistency and can ultimately lead to inaccurate examination results [6–8].
Nowadays, the level of development of innovative technologies allows for the modernization of existing traditional methods. One option for increasing accuracy is the use of neural network technologies, namely machine learning methods. The application of digital innovations will make the age estimation faster, more accurate, and more objective [8]. Recently, the use of artificial intelligence has significantly expanded. The latest advances in deep learning in forensic medicine have shown promising results in age estimation using computed tomography scans of the knee joint [9], computed tomography scans of the neck [10], and other bone structures [11]. These studies demonstrate the great potential of using deep learning for age estimation. A number of studies in the field of artificial intelligence are devoted to the dental status assessment based on the analysis of third molars [6, 7], seven mandibular teeth [12], but the assessment of the entire mandibular dentition has not been carried out yet, which makes the scientific research up-to-date.
The aim of the study was to investigate the potential of using artificial intelligence technologies for age estimation in children from dental radiographs (orthopantomograms).
Materials and Methods
Sample characteristics. The study used panoramic dental radiographs obtained from the archival records of the Clinical Center of the Institute of Dentistry, I.M. Sechenov First Moscow State Medical University (Russia). The radiographs were taken for diagnostic and therapeutic purposes between 2017 and 2023. All images were obtained using a Pan eXam Plus digital panoramic X-ray machine (KaVo Dental, Germany) with the following exposure parameters: tube voltage — 66 kV, tube current — 2.5 mA, exposure time — 17 s. The files were saved in JPEG format with an original image resolution of 2800×1480 pixels.
A total of 322 orthopantomograms were included in the study. Exclusion criteria were the following: panoramic radiographs without information on patient sex, date of birth, or date of image acquisition; images from patients with a history of systemic diseases (including metabolic disorders and conditions affecting dental development); orthopantomograms showing visible bone tissue pathology; and poor-quality images. Patient age was calculated as the difference between the date of birth and the date of image acquisition. The age range was 4 to 16 years, with a mean age of 10.8 years. There were 46.3% males and 53.7% females. The distribution of patients by sex and age is shown in Figure 1.
|
Figure 1. Distribution of the study sample by sex and age for female (left) and male (right) patients |
The study was conducted in accordance with the Declaration of Helsinki (2024). The research protocol was approved by the Ethics Committee of I.M. Sechenov First Moscow State Medical University (protocol No.02-24, dated January 29, 2024). All data used in this study were anonymized to ensure confidentiality.
Data preparation. In the initial stage, key parameters (sex and age) were extracted, and the orthopantomograms were simultaneously anonymized to prevent this information from influencing the model’s training process. Data annotation for machine learning was performed using the LabelMe software. On each orthopantomogram, the permanent mandibular teeth were annotated and labeled according to their corresponding tooth numbers (31–37 and 41–47), excluding the third molars (38 and 48). The following parameters were outlined for each tooth: tooth contour perimeter (S), metric characteristics including tooth length (L), and the width of the open apex/apices (AB, Ab, Ad). Depending on the stage of tooth development and mineralization, between two and four parameters were annotated per tooth. For example, for tooth 31, two parameters were annotated (perimeter — 31S; tooth length — 31L), while for tooth 36, four parameters were annotated (perimeter — 36S; tooth length — 36L; width of the distal and medial roots — 36Ad and 36Am, respectively). Figure 2 shows a fully annotated orthopantomogram image in the LabelMe software, where each parameter of the 14 teeth is outlined in a different color to ensure subsequent neural network training. The annotated file was saved in JSON format in a dedicated folder. The dataset was subsequently used for model training. The methodological basis for annotation was the original Cameriere’s method based on measuring the metric characteristics of seven left mandibular teeth. According to this method, a linear regression equation is used to estimate the child’s age [3].
|
Figure 2. Example of annotation (labeling of tooth perimeter, tooth length, and root width) of permanent mandibular teeth using LabelMe software |
Neural networks. Machine learning methods, specifically neural networks, were used to further estimate the age. The age estimation task was approached as a regression in years using the EfficientNet-b0 neural network architecture [13–15] (Figure 3). This network was chosen due to its suitability with small datasets and positive results of its practical application. Neural network training and validation were conducted using the Python library PyTorch.
|
Figure 3. EfficientNet network architecture [15] |
The regression-based age prediction task using the neural network was performed sequentially and consisted of the following stages. First, to reduce fluctuations in the neural network evaluation metrics, a 5-fold cross-validation of the model was conducted. The study evaluated the performance of both a single neural network and an ensemble consisting of varying numbers of networks of the same architecture, each trained from different random initial states. The general approach to model training and evaluation was the following: each model was trained for 50 epochs using 5-fold cross-validation. Training was performed by splitting the dataset in a 4:1 ratio, with 80% of the data used for training and 20% for testing. The system testing was conducted using a data block swapping method, where different data blocks were alternately used for testing against the training data to achieve the most stable mean absolute error (MAE) metric (Figure 4). During training, standard image augmentations (rotation, inversion, and mirroring) were used. The mean absolute error on the test dataset was evaluated at each epoch, and the best-performing model was saved. Optimization was conducted using the backpropagation algorithm, specifically the Adam (Adaptive Moment Estimation) method.
|
Figure 4. Cross-validation scheme used in the study (F1–F5 — sample subsets, IT1–IT5 — data iterations) |
Statistical analysis of the data included evaluation of age prediction accuracy using the following metrics: coefficient of determination (R2), mean absolute error (MAE), and mean squared error (MSE).
Results
During the search for the optimal training parameters, the following values were identified and established: learning rate — 0.001, input image edge size — 224 pixels, and batch size for training — 16. To build the ensemble, independent sequential training of the models was performed using the Adam method. Table 1 shows the MAE values for different ensemble sizes obtained during testing and optimal model selection.
|
Table 1. Table of values of mean absolute error (MAE) and mean squared error (MSE) in different numbers of neural networks |
After that, the performance of the models within the ensemble was evaluated: predictions were obtained from each individual model on the test data and averaged with equal weights, which allowed to reduce prediction variance. Through cross-validation, the most optimal and acceptable training parameters were identified. Finally, an ensemble of five models was selected as the most accurate. When the ensemble size exceeded five models, the error margin did not further decrease, justifying the choice of this number. Thus, the mean absolute error from a single neural network model was 1.04 years, whereas in a case of the ensemble of five models it was 0.927 years (see Table 1). The error distribution for the five-model ensemble is demonstrated in Figure 5.
![]() |
Figure 5. Error distribution graph for test data using an ensemble of five models |
The quality of the regression result was additionally visualized using a scatter plot of the test data based on predictions from the five-model ensemble, demonstrating an MAE of 0.92 and an R2 of 0.86 (Figure 6).
![]() |
Figure 6. Joint distribution of chronological and predicted age using an ensemble of five neural network models on test data |
Analysis of the joint distribution of chronological age and predicted age showed that the majority of values fell within the desired value of the true age. Each point in the plot represents an individual sample from the test set, and its location (true age, predicted age) is close to the ideal position for accurate estimations. Therefore, the selected model successfully achieved an effective solution.
Discussion
The majority of contemporary methods for dental age estimation involve a complex process of “manual” determination of a wide range of morphometric features, which are then compared to reference values stratified by sex and ethnicity, or involve measuring metric dental parameters followed by age calculation using regression equations. In international practice, the Demirjian’s and Cameriere’s methods are most commonly used for age estimation [3–5]. These methods have gained widespread acceptance across different countries due to their ease of use and acceptable results. However, “manual” methods have several limitations. The main one is connected with estimation difficulties caused by the task monotony and expert’s eyestrain, as well as lack of experience in analyzing radiographic images (orthopantomograms) and unfamiliarity/non-compliance with the estimation methodology [6]. Inter-expert variability in evaluating tooth development stages has also been noted, due to the common similarities between the adjacent mineralization stages. It may lead to overestimation or underestimation of chronological age [12, 16]. Consequently, increasing attention is paid to the development of automated algorithms (programs, methods) which can predict the age without expert involvement, therefore, eliminating subjectivity and improving prediction accuracy.
Research findings have demonstrated that, compared to original “manual” methods, all types of machine learning have greater accuracy [6]; however, error values vary depending on the underlying method (Demirjian, Cameriere, or Willems) used as the basis for annotation and on the machine learning approach employed (linear regression, random forest, support vector machines, etc). For instance, in a study by Galibourg et al. [17], a machine learning algorithm based on the Demirjian’s classification of radiographic stages of permanent teeth resulted in an MAE of 0.811 years, whereas “manual” estimation performed for comparison in the same study produced an MAE of 1.107 years. In the study by Shen et al. [12], the MAE for the traditional European Cameriere’s formula was 0.846 years, while machine learning methods based on Cameriere’s tooth maturation stages proved to be more accurate in estimating dental age: linear regression achieved an MAE of 0.55 years, and the support vector machine method had an MAE of 0.49 years. In the work of Tao et al. [18], assessment using the Demirjian’s method showed an MAE of 1.307 years for the entire sample, whereas error calculations using the MLP (multilayer perceptron) method were 0.75 years for the entire sample. In a study by Abuabara et al. [19], the accuracy of eight artificial intelligence models was compared with that of the traditional method. The results demonstrated that gradient boosting and random forest models had the highest performance and the lowest mean error. Thus, all studies confirmed the hypothesis of more accurate age prediction using machine learning methods (Table 2), which is consistent with the findings of the present study.
It should be noted that automated approaches using deep learning methods encounter a number of challenges in their planning and implementation. They also have some limitations, such as shortage of digital data, the time-consuming data annotation, selection of the optimal training method, and difficulties in fine-tuning, which influence the development and practical implementation of new methods [6, 12, 19]. This requires researchers to carefully select material appropriate for the study, ensure its compliance with inclusion criteria, develop a precise research plan and methodology, use reliable training methods, and engage qualified specialists with expertise in artificial intelligence.
The present study focused only on a sample of orthopantomograms from children aged 4 to 16 years; future research and results will be improved by modifying the sample characteristics through increasing its size and expanding the age range. In the following research, we will include materials from different centers and cover other age groups to create a more comprehensive dataset with a relatively balanced distribution of demographic characteristics. The ultimate goal of this project is to develop a new software product for age estimation in Russia with its subsequent practical implementation. It should be mentioned that similar tools are available at medical institutions in other countries; however, this is an innovation for Russia. Conducting research in this area corresponds to the healthcare digitalization, which is reflected in the development and implementation of modern information technologies leading to practical results being software creation.
Conclusion
A machine learning algorithm for age estimation based on dental radiographs in children aged 4 to 16 years was presented and tested. By analyzing 14 permanent mandibular teeth and using a new set of data annotation features, with the help of linear regression and neural network methods, it was possible to successfully predict children’s age from orthopantomograms with a mean error of 0.92 years. These findings confirm the potential and real effectiveness of practical implementation of machine learning algorithms alongside with or instead of standard “manual” methods using assessment tables. This pilot project (experiment) has confirmed that an artificial intelligence-based algorithm is more accurate than widely used traditional methods for dental age prediction. The positive obtained results open up new opportunities for further integration of this algorithm into the practical work of both forensic medicine specialists and doctors in other clinical fields. The development of this scientific direction implies a quantitative increase and diversity of the sample, the use of various types of neural network algorithms, and their combinations to successfully achieve the ultimate goal being the software creation to improve the accuracy of age prediction based on dental status.
Study funding and conflict of interest. The study received no funding from any sources, and there are no conflicts of interest associated with this research.
References
- Zolotenkova G.V., Poletaeva M.P. Review of age assessment methods of children and adolescents by teeth X-rays. Sudebno-meditsinskaia ekspertiza 2024; 67(2): 47–52, https://doi.org/10.17116/sudmed20246702147.
- Cummaudo M., De Angelis D., Magli F., Minà G., Merelli V., Cattaneo C. Age estimation in the living: a scoping review of population data for skeletal and dental methods. Forensic Sci Int 2021; 320: 110689, https://doi.org/10.1016/j.forsciint.2021.110689.
- Demirjian A., Goldstein H., Tanner J.M. A new system of dental age assessment. Hum Biol 1973; 45(2): 211–227.
- Chaillet N., Willems G., Demirjian A. Dental maturity in Belgian children using Demirjian's method and polynomial functions: new standard curves for forensic and clinical use. J Forensic Odontostomatol 2004; 22(2): 18–27.
- Cameriere R., Ferrante L., Cingolani M. Age estimation in children by measurement of open apices in teeth. Int J Legal Med 2006; 120(1): 49–52, https://doi.org/10.1007/s00414-005-0047-9.
- Vila-Blanco N., Varas-Quintana P., Tomás I., Carreira M.J. A systematic overview of dental methods for age assessment in living individuals: from traditional to artificial intelligence-based approaches. Int J Legal Med 2023; 137(4): 1117–1146, https://doi.org/10.1007/s00414-023-02960-z.
- Shen S., Zhou Z., Wang J., Fan L., Han J., Tao J. Using machine learning to determine age over 16 based on development of third molar and periodontal ligament of second molar. BMC Oral Health 2023; 23(1): 680, https://doi.org/10.1186/s12903-023-03284-5.
- Vila-Blanco N., Carreira M.J., Varas-Quintana P., Balsa-Castro C., Tomas I. Deep neural networks for chronological age estimation from OPG images. IEEE Trans Med Imaging 2020; 39(7): 2374–2384, https://doi.org/10.1109/TMI.2020.2968765.
- Zolotenkov D.D., Ogarev E.V., Valetov D.K., Nefedova S.M., Zolotenkova G.V., Pigolkin Y.I. Age assessment using CT of knee joint and neural network technologies. Sudebno-meditsinskaia ekspertiza 2023; 66(4): 34–40, https://doi.org/10.17116/sudmed20236604134.
- Zolotenkova G.V., Valetov D.K., Poletaeva M.P., Vassilevski Y.V. Experience of using neural networks to assess age-related changes in some structures of the skull and cervical vertebrae based on CT scans (pilot project). Sovremennye tehnologii v medicine 2024; 16(2): 29–38, https://doi.org/10.17691/stm2024.16.2.03.
- Pigolkin Y.I., Solodovnikov V.I., Zolotenkov D.D., Salem B.R., Poletaeva M.P., Zolotenkova G.V. Experience of Data Analysis application for proceeding research data in solving the problem of target age group establishment. Sudebno-meditsinskaia ekspertiza 2024; 67(4): 37–41, https://doi.org/10.17116/sudmed20246704137.
- Shen S., Liu Z., Wang J., Fan L., Ji F., Tao J. Machine learning assisted Cameriere method for dental age estimation. BMC Oral Health 2021; 21(1): 641, https://doi.org/10.1186/s12903-021-01996-0.
- Huang G., Liu Z., Van Der Maaten L., Weinberger K.Q. Densely connected convolutional networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). USA; 2017, p. 2261–2269, https://doi.org/10.1109/CVPR.2017.243.
- Tan M., Le Q.V. EfficientNet: rethinking model scaling for convolutional neural networks. In: Proceedings of the 36th International Conference on Machine Learning, ICML. USA; 2019; p. 6105–6114. URL: https://proceedings.mlr.press/v97/tan19a.html.
- Xu R., Lin H., Lu K., Cao L., Liu Y. A forest fire detection system based on ensemble learning. Forests 2021; 12: 217, https://doi.org/10.3390/f12020217.
- Willmann C., Fernandez De Grado G., Kolb C., Raul J.S., Musset A.M., Gros C.I., Offner D. Accuracy of age estimation using three dental age estimation methods in a young, large, and multiethnic patient sample. Dent J (Basel) 2023; 11(12): 288, https://doi.org/10.3390/dj11120288.
- Galibourg A., Cussat-Blanc S., Dumoncel J., Telmon N., Monsarrat P., Maret D. Comparison of different machine learning approaches to predict dental age using Demirjian's staging approach. Int J Legal Med 2021; 135(2): 665–675, https://doi.org/10.1007/s00414-020-02489-5.
- Tao J., Wang J., Wang A., Xie Z., Wang Z., Wu S., Hassanien A.E., Xiao K. Dental age estimation: a machine learning perspective. In: The International Conference on Advanced Machine Learning Technologies and Applications (AMLTA2019). Springer, Cham; 2019; p. 722–733, https://doi.org/10.1007/978-3-030-14118-9_71.
- Abuabara A., do Nascimento T.V.P.M., da Cruz K.R., Küchler E.C., Madalena I.R., de Oliveira M.B.C.R., Lepri C.P., de Menezes-Oliveira M.A.H., de Araujo C.M., Baratto-Filho F. Dental age estimation by comparing Demirjian's method and machine learning in Southeast Brazilian youth. Forensic Sci Med Pathol 2025, https://doi.org/10.1007/s12024-025-01042-3.

















