Combined Convolutional and Recurrent Neural Networks for Hierarchical Classification of Images

Deep learning models based on CNNs are predominantly used in image classification tasks. Such approaches, assuming independence of object categories, normally use a CNN as a feature learner and apply a flat classifier on top of it. Object classes in many settings have known hierarchical relations, a...

Full description

Saved in:
Bibliographic Details
Published in:2020 IEEE International Conference on Big Data (Big Data) pp. 1354 - 1361
Main Authors: Koo, Jaehoon, Klabjan, Diego, Utke, Jean
Format: Conference Proceeding
Language:English
Published: IEEE 10-12-2020
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Deep learning models based on CNNs are predominantly used in image classification tasks. Such approaches, assuming independence of object categories, normally use a CNN as a feature learner and apply a flat classifier on top of it. Object classes in many settings have known hierarchical relations, and classifiers exploiting these relations should perform better. We propose hierarchical classification models combining a CNN to extract hierarchical representations of images, and an RNN or sequence-to-sequence model to capture a hierarchical tree of classes. In addition, we apply residual learning to the RNN part in order to facilitate training our compound model and improve generalization of the model. Experimental results on a public and a real world proprietary dataset of images show that our hierarchical networks perform better than state-of-the-art CNNs.
DOI:10.1109/BigData50022.2020.9378237