Search Results - "Jawahar, C.V."
-
1
Improved Road Connectivity by Joint Learning of Orientation and Segmentation
Published in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01-06-2019)“…Road network extraction from satellite images often produce fragmented road segments leading to road maps unfit for real applications. Pixel-wise…”
Get full text
Conference Proceeding -
2
Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
Published in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01-01-2020)“…Humans involuntarily tend to infer parts of the conversation from lip movements when the speech is absent or corrupted by external noise. In this work, we…”
Get full text
Conference Proceeding -
3
Scene Text Visual Question Answering
Published in 2019 IEEE/CVF International Conference on Computer Vision (ICCV) (01-10-2019)“…Current visual question answering datasets do not consider the rich semantic information conveyed by text within an image. In this work, we present a new…”
Get full text
Conference Proceeding -
4
Dissimilarity Coefficient Based Weakly Supervised Object Detection
Published in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01-06-2019)“…We consider the problem of weakly supervised object detection, where the training samples are annotated using only image-level labels that indicate the…”
Get full text
Conference Proceeding -
5
Universal Semi-Supervised Semantic Segmentation
Published in 2019 IEEE/CVF International Conference on Computer Vision (ICCV) (01-10-2019)“…In recent years, the need for semantic segmentation has arisen across several different applications and environments. However, the expense and redundancy of…”
Get full text
Conference Proceeding -
6
Efficient Optimization for Rank-Based Loss Functions
Published in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (01-06-2018)“…The accuracy of information retrieval systems is often measured using complex loss functions such as the average precision (AP) or the normalized discounted…”
Get full text
Conference Proceeding -
7
Multi-Domain Incremental Learning for Semantic Segmentation
Published in 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (01-01-2022)“…Recent efforts in multi-domain learning for semantic segmentation attempt to learn multiple geographical datasets in a universal, joint model. A simple…”
Get full text
Conference Proceeding -
8
DGAZE: Driver Gaze Mapping on Road
Published in 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (24-10-2020)“…Driver gaze mapping is crucial to estimate driver attention and determine which objects the driver is focusing on while driving. We introduce DGAZE, the first…”
Get full text
Conference Proceeding -
9
A Multi-Space Approach to Zero-Shot Object Detection
Published in 2020 IEEE Winter Conference on Applications of Computer Vision (WACV) (01-03-2020)“…Object detection has been at the forefront for higher level vision tasks such as scene understanding and contextual reasoning. Therefore, solving object…”
Get full text
Conference Proceeding -
10
Improving Word Recognition using Multiple Hypotheses and Deep Embeddings
Published in 2020 25th International Conference on Pattern Recognition (ICPR) (10-01-2021)“…We propose a novel scheme for improving the word recognition accuracy using word image embeddings. We use a trained text recognizer, which can predict multiple…”
Get full text
Conference Proceeding -
11
Visual Speech Enhancement Without A Real Visual Stream
Published in 2021 IEEE Winter Conference on Applications of Computer Vision (WACV) (01-01-2021)“…In this work, we re-think the task of speech enhancement in unconstrained real-world environments. Current state- of-the-art methods use only the audio stream…”
Get full text
Conference Proceeding -
12
New Objects on the Road? No Problem, We'll Learn Them Too
Published in 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (23-10-2022)“…Object detection plays an essential role in providing localization, path planning, and decision making capabilities in autonomous navigation systems. However,…”
Get full text
Conference Proceeding -
13
A Deep Learning Approach for Robust Corridor Following
Published in 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (01-11-2019)“…For an autonomous corridor following task where the environment is continuously changing, several forms of environmental noise prevent an automated feature…”
Get full text
Conference Proceeding -
14
ICDAR 2019 Competition on Scene Text Visual Question Answering
Published in 2019 International Conference on Document Analysis and Recognition (ICDAR) (01-09-2019)“…This paper presents final results of ICDAR 2019 Scene Text Visual Question Answering competition (ST-VQA). ST-VQA introduces an important aspect that is not…”
Get full text
Conference Proceeding -
15
RoadText-1K: Text Detection & Recognition Dataset for Driving Videos
Published in 2020 IEEE International Conference on Robotics and Automation (ICRA) (01-05-2020)“…Perceiving text is crucial to understand semantics of outdoor scenes and hence is a critical requirement to build intelligent systems for driver assistance and…”
Get full text
Conference Proceeding -
16
Detecting, Tracking and Counting Motorcycle Rider Traffic Violations on Unconstrained Roads
Published in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01-06-2022)“…In many Asian countries with unconstrained road traffic conditions, driving violations such as not wearing helmets and triple-riding are a significant source…”
Get full text
Conference Proceeding -
17
Bringing semantics into word image representation
Published in Pattern recognition (01-12-2020)“…•We propose a normalized word representation which is invariant to word form inflections.•We introduce a novel semantic representation for word images which…”
Get full text
Journal Article -
18
Trajectory aligned features for first person action recognition
Published in Pattern recognition (01-02-2017)“…Egocentric videos are characterized by their ability to have the first person view. With the popularity of Google Glass and GoPro, use of egocentric videos is…”
Get full text
Journal Article -
19
Dataset agnostic document object detection
Published in Pattern recognition (01-10-2023)“…•Present an end-to-end trainable DOLNet to detect document objects more accurately.•DOLNet consists of Cascade Mask R-CNN, composite backbones with deformable…”
Get full text
Journal Article -
20
Looking Farther in Parametric Scene Parsing with Ground and Aerial Imagery
Published in 2021 IEEE International Conference on Robotics and Automation (ICRA) (30-05-2021)“…Parametric models that represent layout in terms of scene attributes are an attractive avenue for road scene understanding in autonomous navigation. Prior…”
Get full text
Conference Proceeding