Impact of a Categorical AI System for Digital Breast Tomosynthesis on Breast Cancer Interpretation by Both General Radiologists and Breast Imaging Specialists

Purpose To evaluate performance improvements of general radiologists and breast imaging specialists when interpreting a set of diverse digital breast tomosynthesis (DBT) examinations with the aid of a custom-built categorical artificial intelligence (AI) system. Materials and Methods A fully balance...

Full description

Saved in:
Bibliographic Details
Published in:Radiology. Artificial intelligence Vol. 6; no. 2; p. e230137
Main Authors: Kim, Jiye G, Haslam, Bryan, Diab, Abdul Rahman, Sakhare, Ashwin, Grisot, Giorgia, Lee, Hyunkwang, Holt, Jacqueline, Lee, Christoph I, Lotter, William, Sorensen, A Gregory
Format: Journal Article
Language:English
Published: United States Radiological Society of North America 01-03-2024
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Purpose To evaluate performance improvements of general radiologists and breast imaging specialists when interpreting a set of diverse digital breast tomosynthesis (DBT) examinations with the aid of a custom-built categorical artificial intelligence (AI) system. Materials and Methods A fully balanced multireader, multicase reader study was conducted to compare the performance of 18 radiologists (nine general radiologists and nine breast imaging specialists) reading 240 retrospectively collected screening DBT mammograms (mean patient age, 59.8 years ± 11.3 [SD]; 100% women), acquired between August 2016 and March 2019, with and without the aid of a custom-built categorical AI system. The area under the receiver operating characteristic curve (AUC), sensitivity, and specificity across general radiologists and breast imaging specialists reading with versus without AI were assessed. Reader performance was also analyzed as a function of breast cancer characteristics and patient subgroups. Results Every radiologist demonstrated improved interpretation performance when reading with versus without AI, with an average AUC of 0.93 versus 0.87, demonstrating a difference in AUC of 0.06 (95% CI: 0.04, 0.08; < .001). Improvement in AUC was observed for both general radiologists (difference of 0.08; < .001) and breast imaging specialists (difference of 0.04; < .001) and across all cancer characteristics (lesion type, lesion size, and pathology) and patient subgroups (race and ethnicity, age, and breast density) examined. Conclusion A categorical AI system helped improve overall radiologist interpretation performance of DBT screening mammograms for both general radiologists and breast imaging specialists and across various patient subgroups and breast cancer characteristics. Computer-aided Diagnosis, Screening Mammography, Digital Breast Tomosynthesis, Breast Cancer, Screening, Convolutional Neural Network (CNN), Artificial Intelligence . © RSNA, 2024.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Author contributions: Guarantors of integrity of entire study, J.G.K., J.H., A.G.S.; study concepts/study design or data acquisition or data analysis/interpretation, all authors; manuscript drafting or manuscript revision for important intellectual content, all authors; approval of final version of submitted manuscript, all authors; agrees to ensure any questions related to the work are appropriately resolved, all authors; literature research, J.G.K., B.H., J.H., W.L., A.G.S.; clinical studies, J.G.K., G.G., J.H., A.G.S.; experimental studies, J.G.K., B.H., A.R.D., G.G., H.L., W.L., A.G.S.; statistical analysis, J.G.K., B.H., A.R.D., A.S., H.L., J.H., W.L.; and manuscript editing, J.G.K., B.H., A.R.D., G.G., J.H., C.I.L., W.L., A.G.S.
ISSN:2638-6100
2638-6100
DOI:10.1148/ryai.230137