Automated classification of cytogenetic abnormalities in hematolymphoid neoplasms
Abstract Motivation Algorithms for classifying chromosomes, like convolutional deep neural networks (CNNs), show promise to augment cytogeneticists’ workflows; however, a critical limitation is their inability to accurately classify various structural chromosomal abnormalities. In hematopathology, r...
Saved in:
Published in: | Bioinformatics (Oxford, England) Vol. 38; no. 5; pp. 1420 - 1426 |
---|---|
Main Authors: | , , , , , , , |
Format: | Journal Article |
Language: | English |
Published: |
England
Oxford University Press
07-02-2022
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Abstract
Motivation
Algorithms for classifying chromosomes, like convolutional deep neural networks (CNNs), show promise to augment cytogeneticists’ workflows; however, a critical limitation is their inability to accurately classify various structural chromosomal abnormalities. In hematopathology, recurrent structural cytogenetic abnormalities herald diagnostic, prognostic and therapeutic implications, but are laborious for expert cytogeneticists to identify. Non-recurrent cytogenetic abnormalities also occur frequently cancerous cells. Here, we demonstrate the feasibility of using CNNs to accurately classify many recurrent cytogenetic abnormalities while being able to reliably detect non-recurrent, spurious abnormal chromosomes, as well as provide insights into dataset assembly, model selection and training methodology that improve overall generalizability and performance for chromosome classification.
Results
Our top-performing model achieved a mean weighted F1 score of 96.86% on the validation set and 94.03% on the test set. Gradient class activation maps indicated that our model learned biologically meaningful feature maps, reinforcing the clinical utility of our proposed approach. Altogether, this work: proposes a new dataset framework for training chromosome classifiers for use in a clinical environment, reveals that residual CNNs and cyclical learning rates confer superior performance, and demonstrates the feasibility of using this approach to automatically screen for many recurrent cytogenetic abnormalities while adeptly classifying non-recurrent abnormal chromosomes.
Availability and implementation
Software is freely available at https://github.com/DaehwanKimLab/Chromosome-ReAd. The data underlying this article cannot be shared publicly due to it being protected patient information.
Supplementary information
Supplementary data are available at Bioinformatics online. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1367-4803 1367-4811 |
DOI: | 10.1093/bioinformatics/btab822 |