Enhancement of Non-Linear Deep Learning Model by Adjusting Confounding Variables for Bone Age Estimation in Pediatric Hand X-rays - PubMed Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Oct;36(5):2003-2014.
doi: 10.1007/s10278-023-00849-2. Epub 2023 Jun 2.

Enhancement of Non-Linear Deep Learning Model by Adjusting Confounding Variables for Bone Age Estimation in Pediatric Hand X-rays

Affiliations

Enhancement of Non-Linear Deep Learning Model by Adjusting Confounding Variables for Bone Age Estimation in Pediatric Hand X-rays

Ki Duk Kim et al. J Digit Imaging. 2023 Oct.

Abstract

In medicine, confounding variables in a generalized linear model are often adjusted; however, these variables have not yet been exploited in a non-linear deep learning model. Sex plays important role in bone age estimation, and non-linear deep learning model reported their performances comparable to human experts. Therefore, we investigate the properties of using confounding variables in a non-linear deep learning model for bone age estimation in pediatric hand X-rays. The RSNA Pediatric Bone Age Challenge (2017) dataset is used to train deep learning models. The RSNA test dataset is used for internal validation, and 227 pediatric hand X-ray images with bone age, chronological age, and sex information from Asan Medical Center (AMC) for external validation. U-Net based autoencoder, U-Net multi-task learning (MTL), and auxiliary-accelerated MTL (AA-MTL) models are chosen. Bone age estimations adjusted by input, output prediction, and without adjusting the confounding variables are compared. Additionally, ablation studies for model size, auxiliary task hierarchy, and multiple tasks are conducted. Correlation and Bland-Altman plots between ground truth and model-predicted bone ages are evaluated. Averaged saliency maps based on image registration are superimposed on representative images according to puberty stage. In the RSNA test dataset, adjusting by input shows the best performances regardless of model size, with mean average errors (MAEs) of 5.740, 5.478, and 5.434 months for the U-Net backbone, U-Net MTL, and AA-MTL models, respectively. However, in the AMC dataset, the AA-MTL model that adjusts the confounding variable by prediction shows the best performance with an MAE of 8.190 months, whereas the other models show the best performances by adjusting the confounding variables by input. Ablation studies of task hierarchy reveal no significant differences in the results of the RSNA dataset. However, predicting the confounding variable in the second encoder layer and estimating bone age in the bottleneck layer shows the best performance in the AMC dataset. Ablations studies of multiple tasks reveal that leveraging confounding variables plays an important role regardless of multiple tasks. To estimate bone age in pediatric X-rays, the clinical setting and balance between model size, task hierarchy, and confounding adjustment method play important roles in performance and generalizability; therefore, proper adjusting methods of confounding variables to train deep learning-based models are required for improved models.

Keywords: Bone age estimation; Confounding variable; Deep learning; Model enhancement; Multi-task learning, Pediatric X-ray.

PubMed Disclaimer

Conflict of interest statement

The authors report no conflicts of interest.

Figures

Fig. 1
Fig. 1
Age histograms of each age label. RSNA, Radiological Society of North America; AMC, Asan Medical Center
Fig. 2
Fig. 2
Hand mask generation and hand extraction using U-Net
Fig. 3
Fig. 3
U-Net based multi-task learning (MTL) architecture, which simultaneously conducts image reconstruction and bone age estimation. Figure on the left depicts U-Net MTL architecture and the right depicts the auxiliary-accelerated (AA) MTL architecture. Additional encoders are added in the AA-MTL model to accelerate the target auxiliary regression task. In this figure, sex information is handled as an input vector, which is concatenated before the fully connected layer for bone age estimation
Fig. 4
Fig. 4
Correlation plot and Bland–Altman plot between ground truth bone age label of datasets and multi-task model-predicted bone age. The upper panel of the figure shows the results of the RSNA dataset, and the lower panel shows the results of the AMC dataset. Single task model with sex information, double task model with sex information, and triple task models are shown from the left to right
Fig. 5
Fig. 5
Average Grad-CAM shown in each puberty group. All three models are based on AA-MTL architecture. Pre-puberty is defined as 0–6 years, puberty as 6–14 years for boys and 6–12 years for girls, and post-puberty as 14–18 years for boys and 12–18 years for girls
Fig. 6
Fig. 6
Example anomaly case with superimposed Grad-CAM. The ground truth chronological age is 138 months and the model predicted 227.97 months. Luno-triquetral coalition, a type of carpal coalition, exists in this case, which can be associated with genetic syndromes

Similar articles

References

    1. Goodfellow, I., Y. Bengio, and A. Courville, Deep learning. 2016: MIT press.
    1. Szegedy, C., et al. Going deeper with convolutions. in Proceedings of the IEEE conference on computer vision and pattern recognition. 2015.
    1. He, K., et al. Deep residual learning for image recognition. in Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.
    1. Raghu, V.K., et al., Deep learning to estimate biological age from chest radiographs. JACC: Cardiovascular Imaging, 2021. 14(11): p. 2226–2236. - PMC - PubMed
    1. Korot E, et al. Predicting sex from retinal fundus photographs using automated deep learning. Scientific reports. 2021;11(1):1–8. doi: 10.1038/s41598-021-89743-x. - DOI - PMC - PubMed

Publication types

LinkOut - more resources