Automated classification of cytogenetic abnormalities in hematolymphoid neoplasms

Abstract Motivation Algorithms for classifying chromosomes, like convolutional deep neural networks (CNNs), show promise to augment cytogeneticists’ workflows; however, a critical limitation is their inability to accurately classify various structural chromosomal abnormalities. In hematopathology, r...

Full description

Saved in:
Bibliographic Details
Published in:Bioinformatics (Oxford, England) Vol. 38; no. 5; pp. 1420 - 1426
Main Authors: Cox, Andrew, Park, Chanhee, Koduru, Prasad, Wilson, Kathleen, Weinberg, Olga, Chen, Weina, García, Rolando, Kim, Daehwan
Format: Journal Article
Language:English
Published: England Oxford University Press 07-02-2022
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Motivation Algorithms for classifying chromosomes, like convolutional deep neural networks (CNNs), show promise to augment cytogeneticists’ workflows; however, a critical limitation is their inability to accurately classify various structural chromosomal abnormalities. In hematopathology, recurrent structural cytogenetic abnormalities herald diagnostic, prognostic and therapeutic implications, but are laborious for expert cytogeneticists to identify. Non-recurrent cytogenetic abnormalities also occur frequently cancerous cells. Here, we demonstrate the feasibility of using CNNs to accurately classify many recurrent cytogenetic abnormalities while being able to reliably detect non-recurrent, spurious abnormal chromosomes, as well as provide insights into dataset assembly, model selection and training methodology that improve overall generalizability and performance for chromosome classification. Results Our top-performing model achieved a mean weighted F1 score of 96.86% on the validation set and 94.03% on the test set. Gradient class activation maps indicated that our model learned biologically meaningful feature maps, reinforcing the clinical utility of our proposed approach. Altogether, this work: proposes a new dataset framework for training chromosome classifiers for use in a clinical environment, reveals that residual CNNs and cyclical learning rates confer superior performance, and demonstrates the feasibility of using this approach to automatically screen for many recurrent cytogenetic abnormalities while adeptly classifying non-recurrent abnormal chromosomes. Availability and implementation Software is freely available at https://github.com/DaehwanKimLab/Chromosome-ReAd. The data underlying this article cannot be shared publicly due to it being protected patient information. Supplementary information Supplementary data are available at Bioinformatics online.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1367-4803
1367-4811
DOI:10.1093/bioinformatics/btab822