18FFDG PET radiomics to predict disease-free survival in cervical cancer: a multi-scanner/center study with external validation

To test the performances of native and tumour to liver ratio (TLR) radiomic features extracted from pre-treatment 2-[18F] fluoro-2-deoxy-D-glucose ([18F]FDG) PET/CT and combined with machine learning (ML) for predicting cancer recurrence in patients with locally advanced cervical cancer (LACC).PURPO...

Full description

Saved in:
Bibliographic Details
Published in:European journal of nuclear medicine and molecular imaging Vol. 48; no. 11; p. 3432
Main Authors: Ferreira, Marta, Lovinfosse, Pierre, Hermesse, Johanne, Decuypere, Marjolein, Rousseau, Caroline, Lucia, François, Schick, Ulrike, Reinhold, Caroline, Robin, Philippe, Hatt, Mathieu, Visvikis, Dimitris, Bernard, Claire, Leijenaar, Ralph T H, Kridelka, Frédéric, Lambin, Philippe, Meyer, Patrick E, Hustinx, Roland
Format: Journal Article
Language:English
Published: 01-10-2021
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:To test the performances of native and tumour to liver ratio (TLR) radiomic features extracted from pre-treatment 2-[18F] fluoro-2-deoxy-D-glucose ([18F]FDG) PET/CT and combined with machine learning (ML) for predicting cancer recurrence in patients with locally advanced cervical cancer (LACC).PURPOSETo test the performances of native and tumour to liver ratio (TLR) radiomic features extracted from pre-treatment 2-[18F] fluoro-2-deoxy-D-glucose ([18F]FDG) PET/CT and combined with machine learning (ML) for predicting cancer recurrence in patients with locally advanced cervical cancer (LACC).One hundred fifty-eight patients with LACC from multiple centers were retrospectively included in the study. Tumours were segmented using the Fuzzy Local Adaptive Bayesian (FLAB) algorithm. Radiomic features were extracted from the tumours and from regions drawn over the normal liver. Cox proportional hazard model was used to test statistical significance of clinical and radiomic features. Fivefold cross validation was used to tune the number of features. Seven different feature selection methods and four classifiers were tested. The models with the selected features were trained using bootstrapping and tested in data from each scanner independently. Reproducibility of radiomics features, clinical data added value and effect of ComBat-based harmonisation were evaluated across scanners.METHODSOne hundred fifty-eight patients with LACC from multiple centers were retrospectively included in the study. Tumours were segmented using the Fuzzy Local Adaptive Bayesian (FLAB) algorithm. Radiomic features were extracted from the tumours and from regions drawn over the normal liver. Cox proportional hazard model was used to test statistical significance of clinical and radiomic features. Fivefold cross validation was used to tune the number of features. Seven different feature selection methods and four classifiers were tested. The models with the selected features were trained using bootstrapping and tested in data from each scanner independently. Reproducibility of radiomics features, clinical data added value and effect of ComBat-based harmonisation were evaluated across scanners.After a median follow-up of 23 months, 29% of the patients recurred. No individual radiomic or clinical features were significantly associated with cancer recurrence. The best model was obtained using 10 TLR features combined with clinical information. The area under the curve (AUC), F1-score, precision and recall were respectively 0.78 (0.67-0.88), 0.49 (0.25-0.67), 0.42 (0.25-0.60) and 0.63 (0.20-0.80). ComBat did not improve the predictive performance of the best models. Both the TLR and the native models performance varied across scanners used in the test set.RESULTSAfter a median follow-up of 23 months, 29% of the patients recurred. No individual radiomic or clinical features were significantly associated with cancer recurrence. The best model was obtained using 10 TLR features combined with clinical information. The area under the curve (AUC), F1-score, precision and recall were respectively 0.78 (0.67-0.88), 0.49 (0.25-0.67), 0.42 (0.25-0.60) and 0.63 (0.20-0.80). ComBat did not improve the predictive performance of the best models. Both the TLR and the native models performance varied across scanners used in the test set.[18F]FDG PET radiomic features combined with ML add relevant information to the standard clinical parameters in terms of LACC patient's outcome but remain subject to variability across PET/CT devices.CONCLUSION[18F]FDG PET radiomic features combined with ML add relevant information to the standard clinical parameters in terms of LACC patient's outcome but remain subject to variability across PET/CT devices.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
content type line 23
ObjectType-Feature-2
ISSN:1619-7089
1619-7089
DOI:10.1007/s00259-021-05303-5