A Deep Learning Method with Cross Dropout Focal Loss Function for Imbalanced Semantic Segmentation

Deep learning methods have proven their potential in semantic segmentation. However, they depend on the data quality and training process. Often, the data corresponding to the objects to be segmented are of different sizes and this creates difficulties for the segmentation method. Objects are segmen...

Full description

Saved in:
Bibliographic Details
Published in:2022 Sensor Data Fusion: Trends, Solutions, Applications (SDF) pp. 1 - 6
Main Authors: Su, Jingxuan, Anderson, Sean, Mihaylova, Lyudmila S.
Format: Conference Proceeding
Language:English
Published: IEEE 12-10-2022
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Deep learning methods have proven their potential in semantic segmentation. However, they depend on the data quality and training process. Often, the data corresponding to the objects to be segmented are of different sizes and this creates difficulties for the segmentation method. Objects are segmented and associated with categories during the training process. Data imbalance is a challenging problem, which often results in unsatisfactory segmentation performance. This paper proposes a solution to this task based on a novel cross dropout focal loss (CDFL) function, which represents well the change between the cross-entropy and other state-of-the-art loss functions providing a balance between the precision and accuracy of segmentation. The performance of the considered fully convolutional network (FCN) with different loss functions is considered and carefully evaluated. The proposed loss function improves efficiently the semantic segmentation performance over other well-known loss functions. It is demonstrated on Cityscapes and PASCAL VOC 2010 publicly available datasets. The implementation is over relatively large data sets. The achieved mean accuracy of the proposed CDFL network on Cityscapes dataset is 76.41% and on PASCAL VOC 2010 dataset is 79.63% which is with approximately 2.5% improvement compared with the same network implemented with the cross-entropy loss function.
DOI:10.1109/SDF55338.2022.9931700