Development and Validation of an AI-driven Mammographic Breast Density Classification Tool Based on Radiologist Consensus

Mammographic breast density (BD) is commonly visually assessed using the Breast Imaging Reporting and Data System (BI-RADS) four-category scale. To overcome inter- and intraobserver variability of visual assessment, the authors retrospectively developed and externally validated a software for BD cla...

Full description

Saved in:
Bibliographic Details
Published in:Radiology. Artificial intelligence Vol. 4; no. 2; p. e210199
Main Authors: Magni, Veronica, Interlenghi, Matteo, Cozzi, Andrea, Alì, Marco, Salvatore, Christian, Azzena, Alcide A, Capra, Davide, Carriero, Serena, Della Pepa, Gianmarco, Fazzini, Deborah, Granata, Giuseppe, Monti, Caterina B, Muscogiuri, Giulia, Pellegrino, Giuseppe, Schiaffino, Simone, Castiglioni, Isabella, Papa, Sergio, Sardanelli, Francesco
Format: Journal Article
Language:English
Published: United States Radiological Society of North America 01-03-2022
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Mammographic breast density (BD) is commonly visually assessed using the Breast Imaging Reporting and Data System (BI-RADS) four-category scale. To overcome inter- and intraobserver variability of visual assessment, the authors retrospectively developed and externally validated a software for BD classification based on convolutional neural networks from mammograms obtained between 2017 and 2020. The tool was trained using the majority BD category determined by seven board-certified radiologists who independently visually assessed 760 mediolateral oblique (MLO) images in 380 women (mean age, 57 years ± 6 [SD]) from center 1; this process mimicked training from a consensus of several human readers. External validation of the model was performed by the three radiologists whose BD assessment was closest to the majority (consensus) of the initial seven on a dataset of 384 MLO images in 197 women (mean age, 56 years ± 13) obtained from center 2. The model achieved an accuracy of 89.3% in distinguishing BI-RADS a or b (nondense breasts) versus c or d (dense breasts) categories, with an agreement of 90.4% (178 of 197 mammograms) and a reliability of 0.807 (Cohen κ) compared with the mode of the three readers. This study demonstrates accuracy and reliability of a fully automated software for BD classification. Mammography, Breast, Convolutional Neural Network (CNN), Deep Learning Algorithms, Machine Learning Algorithms © RSNA, 2022.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Author contributions: Guarantors of integrity of entire study, V.M., M.I., C.S., M.A., I.C., S.P., F.S.; study concepts/study design or data acquisition or data analysis/interpretation, all authors; manuscript drafting or manuscript revision for important intellectual content, all authors; approval of final version of submitted manuscript, all authors; agrees to ensure any questions related to the work are appropriately resolved, all authors; literature research, V.M., M.I., A.C., M.A., C.S., A.A.A., S.C., G.D.P., D.F., I.C., S.P., F.S.; clinical studies, V.M., A.C., S.C., G.D.P., D.F., G.G., G.P., S.S., M.A., S.P., F.S.; statistical analysis, V.M., M.I., A.C., M.A., C.S., S.C., I.C.; and manuscript editing, V.M., M.I., A.C., M.A., C.S., S.C., I.C., S.P., F.S.
ISSN:2638-6100
2638-6100
DOI:10.1148/ryai.210199