Search Results - "Mathew, Minesh"
-
1
Asking questions on handwritten document collections
Published in International journal on document analysis and recognition (01-09-2021)“…This work addresses the problem of Question Answering (QA) on handwritten document collections. Unlike typical QA and Visual Question Answering (VQA)…”
Get full text
Journal Article -
2
RoadText-1K: Text Detection & Recognition Dataset for Driving Videos
Published in 2020 IEEE International Conference on Robotics and Automation (ICRA) (01-05-2020)“…Perceiving text is crucial to understand semantics of outdoor scenes and hence is a critical requirement to build intelligent systems for driver assistance and…”
Get full text
Conference Proceeding -
3
DocVQA: A Dataset for VQA on Document Images
Published in 2021 IEEE Winter Conference on Applications of Computer Vision (WACV) (01-01-2021)“…We present a new dataset for Visual Question Answering (VQA) on document images called DocVQA. The dataset consists of 50,000 questions defined on 12,000+…”
Get full text
Conference Proceeding -
4
MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
Published in 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI) (13-04-2021)“…Images in the medical domain are fundamentally different from the general domain images. Consequently, it is infeasible to directly employ general domain…”
Get full text
Conference Proceeding -
5
Watching the News: Towards VideoQA Models that can Read
Published in 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (01-01-2023)“…Video Question Answering methods focus on common-sense reasoning and visual cognition of objects or persons and their interactions over time. Current VideoQA…”
Get full text
Conference Proceeding -
6
InfographicVQA
Published in 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (01-01-2022)“…Infographics communicate information using a combination of textual, graphical and visual elements. This work explores the automatic understanding of…”
Get full text
Conference Proceeding -
7
Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam
Published in 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) (01-11-2017)“…Inspired by the success of Deep Learning based approaches to English scene text recognition, we pose and bench-mark scene text recognition for three Indic…”
Get full text
Conference Proceeding -
8
ICDAR 2019 Competition on Scene Text Visual Question Answering
Published in 2019 International Conference on Document Analysis and Recognition (ICDAR) (01-09-2019)“…This paper presents final results of ICDAR 2019 Scene Text Visual Question Answering competition (ST-VQA). ST-VQA introduces an important aspect that is not…”
Get full text
Conference Proceeding -
9
An empirical study of CTC based models for OCR of Indian languages
Published 13-05-2022“…Recognition of text on word or line images, without the need for sub-word segmentation has become the mainstream of research and development of text…”
Get full text
Journal Article -
10
Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Published 04-09-2023“…Researchers have extensively studied the field of vision and language, discovering that both visual and textual content is crucial for understanding scenes…”
Get full text
Journal Article -
11
Watching the News: Towards VideoQA Models that can Read
Published 10-11-2022“…Video Question Answering methods focus on commonsense reasoning and visual cognition of objects or persons and their interactions over time. Current VideoQA…”
Get full text
Journal Article -
12
DocVQA: A Dataset for VQA on Document Images
Published 01-07-2020“…We present a new dataset for Visual Question Answering (VQA) on document images called DocVQA. The dataset consists of 50,000 questions defined on 12,000+…”
Get full text
Journal Article -
13
Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Published in 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) (02-10-2023)“…Researchers have extensively studied the field of vision and language, discovering that both visual and textual content is crucial for understanding scenes…”
Get full text
Conference Proceeding -
14
Reading Between the Lanes: Text VideoQA on the Road
Published 08-07-2023“…Text and signs around roads provide crucial information for drivers, vital for safe navigation and situational awareness. Scene text recognition in motion is a…”
Get full text
Journal Article -
15
Improving CNN-RNN Hybrid Networks for Handwriting Recognition
Published in 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR) (01-08-2018)“…The success of deep learning based models have centered around recent architectures and the availability of large scale annotated data. In this work, we…”
Get full text
Conference Proceeding -
16
Unconstrained OCR for Urdu Using Deep CNN-RNN Hybrid Networks
Published in 2017 4th IAPR Asian Conference on Pattern Recognition (ACPR) (01-11-2017)“…Building robust text recognition systems for languages with cursive scripts like Urdu has always been challenging. Intricacies of the script and the absence of…”
Get full text
Conference Proceeding -
17
Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam
Published 09-04-2021“…Inspired by the success of Deep Learning based approaches to English scene text recognition, we pose and benchmark scene text recognition for three Indic…”
Get full text
Journal Article -
18
Towards Spotting and Recognition of Handwritten Words in Indic Scripts
Published in 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR) (01-08-2018)“…Handwriting recognition (HWR) in Indic scripts is a challenging problem due to the inherent subtleties in the scripts, cursive nature of the handwriting and…”
Get full text
Conference Proceeding -
19
Offline Handwriting Recognition on Devanagari Using a New Benchmark Dataset
Published in 2018 13th IAPR International Workshop on Document Analysis Systems (DAS) (01-04-2018)“…Handwriting recognition (HWR) in Indic scripts, like Devanagari is very challenging due to the subtleties in the scripts, variations in rendering and the…”
Get full text
Conference Proceeding -
20
ICDAR 2021 Competition on Document VisualQuestion Answering
Published 10-11-2021“…In this report we present results of the ICDAR 2021 edition of the Document Visual Question Challenges. This edition complements the previous tasks on Single…”
Get full text
Journal Article