TY  - JOUR
AU  - Rassmann, Sebastian
AU  - Abashishvili, Luka
AU  - Melikidze, Elene
AU  - Sukhiashvili, Anastasia
AU  - Lartsuliani, Megi
AU  - Chkhaidze, Ivane
AU  - Tskhvediani, Nino
AU  - Gordeziani, Tinatin
AU  - Kvaratskhelia, Ekaterine
AU  - Kheladze, Nino
AU  - Rekhviashvili, Maia
AU  - Rodonaia, Salome
AU  - Sukhitashvili, Natia
AU  - Urushadze, Nata
AU  - Krawitz, Peter
AU  - Tkemaladze, Tinatin
AU  - Javanmardi, Behnam
TI  - Population-specific calibration and validation of an open-source bone age AI.
JO  - Scientific reports
VL  - 15
IS  - 1
SN  - 2045-2322
CY  - [London]
PB  - Springer Nature
M1  - DZNE-2025-01119
SP  - 32673
PY  - 2025
AB  - Assessing skeletal maturity through bone age (BA) evaluation is crucial for monitoring children's growth and guiding treatments, such as hormonal therapy and orthopedic interventions. In recent years, artificial intelligence (AI) methods have been developed to automate BA assessment. However, bone growth patterns may vary by ancestry, and many AI models are trained on limited population datasets, raising concerns about their applicability to populations not included in the training process. To address this shortcoming for the case of the Georgian population, we retrospectively collected 381 pediatric hand X-rays and established a manual BA reference rating from seven local pediatric radiologists and endocrinologists. We then used a subset of 121 images to perform a sex-specific linear calibration of the open-source AI, Deeplasia, creating Deeplasia-GE. On the held-out test set (n = 260), the default version of Deeplasia achieved a mean absolute difference (MAD) of 6.57 months, which improved to 5.69 months after calibration. We observed that the default Deeplasia overestimates the BA in the Georgian cohort with a signed mean difference (SMD) of + 2.85 and + 5.35 months for females and males respectively, which after calibration is significantly reduced to -0.03 and + 0.58 months for females and males, respectively. We find that Deeplasia-GE has a smaller error than all the raters and, by design, Deeplasia-GE inherits the high test-retest reliability from Deeplasia. These findings suggest that Deeplasia-GE is a reliable AI-based BA assessment method for Georgian children.
KW  - Humans
KW  - Male
KW  - Female
KW  - Age Determination by Skeleton: methods
KW  - Child
KW  - Artificial Intelligence
KW  - Calibration
KW  - Retrospective Studies
KW  - Child, Preschool
KW  - Adolescent
KW  - Infant
KW  - Reproducibility of Results
KW  - Artificial intelligence (Other)
KW  - Global health equity (Other)
KW  - Hand x-rays (Other)
KW  - Model calibration (Other)
KW  - Open-Source (Other)
KW  - Pediatric bone age (Other)
LB  - PUB:(DE-HGF)16
C6  - pmid:40987805
DO  - DOI:10.1038/s41598-025-20148-w
UR  - https://pub.dzne.de/record/281434
ER  -