Variability of Breast Density Classification Between US and UK Radiologists

To assess whether subjective breast density categorization remains the most useful way to categorize mammographic breast density and whether variations exist across geographic regions with differing national legislation. Breast radiologists from two countries (UK, USA) were voluntarily recruited to...

Full description

Saved in:

Bibliographic Details
Published in:	Journal of medical imaging and radiation sciences Vol. 50; no. 1; pp. 53 - 61
Main Authors:	Alomaim, Wijdan, O'Leary, Desiree, Ryan, John, Rainford, Louise, Evanoff, Michael, Foley, Shane
Format:	Journal Article
Language:	English
Published:	United States Elsevier Inc 01-03-2019
Subjects:	BI-RADS Breast density Breast Density - physiology Female Humans intrarater variability mammography Mammography - classification Mammography - standards Mammography - statistics & numerical data Observer Variation Radiologists - standards Radiologists - statistics & numerical data United Kingdom United Kingdom BI-RADS Breast density intrarater variability mammography
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	To assess whether subjective breast density categorization remains the most useful way to categorize mammographic breast density and whether variations exist across geographic regions with differing national legislation. Breast radiologists from two countries (UK, USA) were voluntarily recruited to review sets of anonymized mammographic images (n = 180) and additional repeated images (n = 70), totaling 250 images, to subjectively rate breast density according to the Breast Imaging Reporting and Data system (BI-RADS) categorization. Images were reviewed using standardized viewing conditions and Ziltron software. Inter-rater reliability was analyzed using the Kappa test. The US radiologists (n = 25) judged fewer images as being “mostly fatty” than UK radiologists (n = 24), leading a greater number of images classified in the higher BI-RADS categories, particularly in BI-RADS 3. Overall agreement for all data sets was k = 0.654 indicating substantial agreement between the two cohorts. When the data were split into BI-RADS categories, the level of agreement varied from fair to substantial. Variations in how radiologists from the USA and UK classify breast density was established, especially when the data were divided into breast density categories. This variation supports the need for a reliable breast density assessment method to enhance the individualized supplemental screening pathways for dense breasts. The use of two-scale categorization method demonstrated improved agreement. Larger sample of radiologists from different breast density jurisdictions confirms international subjective variability in density categorization and improved agreement with the two-scale (low, high) categorization. With this variability, a standardized and automated breast density assessment shows to be timely. Évaluer si la catégorisation subjective de la densité mammaire reste la façon la plus utile de catégoriser la densité mammaire aux fins de la mammographie et déterminer s'il existe des variations entre les régions géographiques présentant des lois nationales différentes. Des radiologistes en mammographie de deux pays (Royaume-Uni et États-Unis) ont été recrutés sur une base volontaire pour réviser des séries d'images mammographiques anonymisées (n=180) et des images répétées additionnelles (n=70) afin d’évaluer de façon subjective la densité mammaire selon la catégorisation BI-RADS. Les images ont été examinées dans des conditions de visionnement normalisées avec le logiciel Ziltron. La fiabilité entre les examinateurs a été analysée à l'aide du test de Kappa. Les radiologistes des États-Unis (n=25) ont jugé moins d'images comme « principalement adipeux » que les radiologistes du Royaume-Uni (n=24), de sorte qu'un plus grand nombre d'images ont été classées dans les catégories BI-RADS supérieures, notamment en BI-RADS 3. L'accord global pour tous les ensembles de données était de k=0,654 indiquant un accord substantiel entre les deux cohortes. Lorsque les données étaient réparties par catégories BI-RADS, le niveau d'accord variait de bon à substantiel. Des variations dans la façon dont les radiologistes des États-Unis et du Royaume-Uni classifient la densité mammaire ont été établies, notamment lorsque les données étaient divisées en catégories de densité mammaire. Cette variation supporte la nécessité d'une méthode fiable d’évaluation de la densité mammaire pour bonifier les voies individualisées de dépistage supplémentaire pour les seins denses. L'utilisation d'une méthode de caractérisation à deux échelles a démontré une amélioration de l'accord. Progression des connaissances: un échantillon plus important de radiologistes de différentes juridictions de densité mammaire confirme la variabilité subjective internationale dans la catégorisation de la densité et l'amélioration de l'accord avec une catégorisation à deux niveaux (élevée, faible). Cette variabilité démontre qu'il est temps d'arriver à une évaluation normalisée et automatisée de la densité mammaire.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1939-8654 1876-7982
DOI:	10.1016/j.jmir.2018.11.002