Added Value of Deep Learning-based Detection System for Multiple Major Findings on Chest Radiographs: A Randomized Crossover Study
- PMID: 33754828
- DOI: 10.1148/radiol.2021202818
Added Value of Deep Learning-based Detection System for Multiple Major Findings on Chest Radiographs: A Randomized Crossover Study
Abstract
Background Previous studies assessing the effects of computer-aided detection on observer performance in the reading of chest radiographs used a sequential reading design that may have biased the results because of reading order or recall bias. Purpose To compare observer performance in detecting and localizing major abnormal findings including nodules, consolidation, interstitial opacity, pleural effusion, and pneumothorax on chest radiographs without versus with deep learning-based detection (DLD) system assistance in a randomized crossover design. Materials and Methods This study included retrospectively collected normal and abnormal chest radiographs between January 2016 and December 2017 (https://cris.nih.go.kr/; registration no. KCT0004147). The radiographs were randomized into two groups, and six observers, including thoracic radiologists, interpreted each radiograph without and with use of a commercially available DLD system by using a crossover design with a washout period. Jackknife alternative free-response receiver operating characteristic (JAFROC) figure of merit (FOM), area under the receiver operating characteristic curve (AUC), sensitivity, specificity, false-positive findings per image, and reading times of observers with and without the DLD system were compared by using McNemar and paired t tests. Results A total of 114 normal (mean patient age ± standard deviation, 51 years ± 11; 58 men) and 114 abnormal (mean patient age, 60 years ± 15; 75 men) chest radiographs were evaluated. The radiographs were randomized to two groups: group A (n = 114) and group B (n = 114). Use of the DLD system improved the observers' JAFROC FOM (from 0.90 to 0.95, P = .002), AUC (from 0.93 to 0.98, P = .002), per-lesion sensitivity (from 83% [822 of 990 lesions] to 89.1% [882 of 990 lesions], P = .009), per-image sensitivity (from 80% [548 of 684 radiographs] to 89% [608 of 684 radiographs], P = .009), and specificity (from 89.3% [611 of 684 radiographs] to 96.6% [661 of 684 radiographs], P = .01) and reduced the reading time (from 10-65 seconds to 6-27 seconds, P < .001). The DLD system alone outperformed the pooled observers (JAFROC FOM: 0.96 vs 0.90, respectively, P = .007; AUC: 0.98 vs 0.93, P = .003). Conclusion Observers including thoracic radiologists showed improved performance in the detection and localization of major abnormal findings on chest radiographs and reduced reading time with use of a deep learning-based detection system. © RSNA, 2021 Online supplemental material is available for this article.
Similar articles
-
Deep learning-based detection system for multiclass lesions on chest radiographs: comparison with observer readings.Eur Radiol. 2020 Mar;30(3):1359-1368. doi: 10.1007/s00330-019-06532-x. Epub 2019 Nov 20. Eur Radiol. 2020. PMID: 31748854
-
Development and Validation of Deep Learning-based Automatic Detection Algorithm for Malignant Pulmonary Nodules on Chest Radiographs.Radiology. 2019 Jan;290(1):218-228. doi: 10.1148/radiol.2018180237. Epub 2018 Sep 25. Radiology. 2019. PMID: 30251934
-
Evaluation of a deep learning-based computer-aided detection algorithm on chest radiographs: Case-control study.Medicine (Baltimore). 2021 Apr 23;100(16):e25663. doi: 10.1097/MD.0000000000025663. Medicine (Baltimore). 2021. PMID: 33879750 Free PMC article.
-
The Added Effect of Artificial Intelligence on Physicians' Performance in Detecting Thoracic Pathologies on CT and Chest X-ray: A Systematic Review.Diagnostics (Basel). 2021 Nov 26;11(12):2206. doi: 10.3390/diagnostics11122206. Diagnostics (Basel). 2021. PMID: 34943442 Free PMC article. Review.
-
Introduction to the interpretation of chest radiographs during donor care.Prog Transplant. 2005 Sep;15(3):240-8. doi: 10.1177/152692480501500307. Prog Transplant. 2005. PMID: 16252630 Review.
Cited by
-
Clinical outcomes and actual consequence of lung nodules incidentally detected on chest radiographs by artificial intelligence.Sci Rep. 2023 Nov 13;13(1):19732. doi: 10.1038/s41598-023-47194-6. Sci Rep. 2023. PMID: 37957283 Free PMC article.
-
Developing and Evaluating an AI-Based Computer-Aided Diagnosis System for Retinal Disease: Diagnostic Study for Central Serous Chorioretinopathy.J Med Internet Res. 2023 Nov 29;25:e48142. doi: 10.2196/48142. J Med Internet Res. 2023. PMID: 38019564 Free PMC article.
-
Accurate auto-labeling of chest X-ray images based on quantitative similarity to an explainable AI model.Nat Commun. 2022 Apr 6;13(1):1867. doi: 10.1038/s41467-022-29437-8. Nat Commun. 2022. PMID: 35388010 Free PMC article.
-
Effects of Expert-Determined Reference Standards in Evaluating the Diagnostic Performance of a Deep Learning Model: A Malignant Lung Nodule Detection Task on Chest Radiographs.Korean J Radiol. 2023 Feb;24(2):155-165. doi: 10.3348/kjr.2022.0548. Korean J Radiol. 2023. PMID: 36725356 Free PMC article.
-
Learning from the machine: AI assistance is not an effective learning tool for resident education in chest x-ray interpretation.Eur Radiol. 2023 Nov;33(11):8241-8250. doi: 10.1007/s00330-023-10043-1. Epub 2023 Aug 12. Eur Radiol. 2023. PMID: 37572190
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Miscellaneous