Multiclass recognition and part localization with humans in the loop

We propose a visual recognition system that is designed for fine-grained visual categorization. The system is composed of a machine and a human user. The user, who is unable to carry out the recognition task by himself, is interactively asked to provide two heterogeneous forms of information: clicki...

Full description

Saved in:
Bibliographic Details
Published in:2011 International Conference on Computer Vision pp. 2524 - 2531
Main Authors: Wah, C., Branson, S., Perona, P., Belongie, S.
Format: Conference Proceeding
Language:English
Published: IEEE 01-11-2011
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We propose a visual recognition system that is designed for fine-grained visual categorization. The system is composed of a machine and a human user. The user, who is unable to carry out the recognition task by himself, is interactively asked to provide two heterogeneous forms of information: clicking on object parts and answering binary questions. The machine intelligently selects the most informative question to pose to the user in order to identify the object's class as quickly as possible. By leveraging computer vision and analyzing the user responses, the overall amount of human effort required, measured in seconds, is minimized. We demonstrate promising results on a challenging dataset of uncropped images, achieving a significant average reduction in human effort over previous methods.
ISBN:9781457711015
145771101X
ISSN:1550-5499
2380-7504
DOI:10.1109/ICCV.2011.6126539