Search Results - "Mathew, Minesh"

1
Asking questions on handwritten document collections by Mathew, Minesh, Gomez, Lluis, Karatzas, Dimosthenis, Jawahar, C. V.

Published in International journal on document analysis and recognition (01-09-2021)
“…This work addresses the problem of Question Answering (QA) on handwritten document collections. Unlike typical QA and Visual Question Answering (VQA)…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
RoadText-1K: Text Detection & Recognition Dataset for Driving Videos by Reddy, Sangeeth, Mathew, Minesh, Gomez, Lluis, Rusinol, Marcal, Karatzas, Dimosthenis, Jawahar, C.V.

Published in 2020 IEEE International Conference on Robotics and Automation (ICRA) (01-05-2020)
“…Perceiving text is crucial to understand semantics of outdoor scenes and hence is a critical requirement to build intelligent systems for driver assistance and…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
3
DocVQA: A Dataset for VQA on Document Images by Mathew, Minesh, Karatzas, Dimosthenis, Jawahar, C. V.

Published in 2021 IEEE Winter Conference on Applications of Computer Vision (WACV) (01-01-2021)
“…We present a new dataset for Visual Question Answering (VQA) on document images called DocVQA. The dataset consists of 50,000 questions defined on 12,000+…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
4
MMBERT: Multimodal BERT Pretraining for Improved Medical VQA by Khare, Yash, Bagal, Viraj, Mathew, Minesh, Devi, Adithi, Priyakumar, U Deva, Jawahar, CV

Published in 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI) (13-04-2021)
“…Images in the medical domain are fundamentally different from the general domain images. Consequently, it is infeasible to directly employ general domain…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
5
Watching the News: Towards VideoQA Models that can Read by Jahagirdar, Soumya, Mathew, Minesh, Karatzas, Dimosthenis, Jawahar, C. V.

Published in 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (01-01-2023)
“…Video Question Answering methods focus on common-sense reasoning and visual cognition of objects or persons and their interactions over time. Current VideoQA…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
6
InfographicVQA by Mathew, Minesh, Bagal, Viraj, Tito, Ruben, Karatzas, Dimosthenis, Valveny, Ernest, Jawahar, C. V.

Published in 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (01-01-2022)
“…Infographics communicate information using a combination of textual, graphical and visual elements. This work explores the automatic understanding of…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
7
Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam by Mathew, Minesh, Jain, Mohit, Jawahar, C. V.

Published in 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) (01-11-2017)
“…Inspired by the success of Deep Learning based approaches to English scene text recognition, we pose and bench-mark scene text recognition for three Indic…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
8
ICDAR 2019 Competition on Scene Text Visual Question Answering by Furkan Biten, Ali, Tito, Ruben, Mafla, Andres, Gomez, Lluis, Rusinol, Marcal, Mathew, Minesh, Jawahar, C.V., Valveny, Ernest, Karatzas, Dimosthenis

Published in 2019 International Conference on Document Analysis and Recognition (ICDAR) (01-09-2019)
“…This paper presents final results of ICDAR 2019 Scene Text Visual Question Answering competition (ST-VQA). ST-VQA introduces an important aspect that is not…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
9
An empirical study of CTC based models for OCR of Indian languages by Mathew, Minesh, Jawahar, CV

Published 13-05-2022
“…Recognition of text on word or line images, without the need for sub-word segmentation has become the mainstream of research and development of text…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
10
Understanding Video Scenes through Text: Insights from Text-based Video Question Answering by Jahagirdar, Soumya, Mathew, Minesh, Karatzas, Dimosthenis, Jawahar, C. V

Published 04-09-2023
“…Researchers have extensively studied the field of vision and language, discovering that both visual and textual content is crucial for understanding scenes…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
11
Watching the News: Towards VideoQA Models that can Read by Jahagirdar, Soumya, Mathew, Minesh, Karatzas, Dimosthenis, Jawahar, C. V

Published 10-11-2022
“…Video Question Answering methods focus on commonsense reasoning and visual cognition of objects or persons and their interactions over time. Current VideoQA…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
12
DocVQA: A Dataset for VQA on Document Images by Mathew, Minesh, Karatzas, Dimosthenis, Jawahar, C. V

Published 01-07-2020
“…We present a new dataset for Visual Question Answering (VQA) on document images called DocVQA. The dataset consists of 50,000 questions defined on 12,000+…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
13
Understanding Video Scenes through Text: Insights from Text-based Video Question Answering by Jahagirdar, Soumya, Mathew, Minesh, Karatzas, Dimosthenis, Jawahar, C. V.

Published in 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) (02-10-2023)
“…Researchers have extensively studied the field of vision and language, discovering that both visual and textual content is crucial for understanding scenes…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
14
Reading Between the Lanes: Text VideoQA on the Road by Tom, George, Mathew, Minesh, Garcia, Sergi, Karatzas, Dimosthenis, Jawahar, C. V

Published 08-07-2023
“…Text and signs around roads provide crucial information for drivers, vital for safe navigation and situational awareness. Scene text recognition in motion is a…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
15
Improving CNN-RNN Hybrid Networks for Handwriting Recognition by Dutta, Kartik, Krishnan, Praveen, Mathew, Minesh, Jawahar, C.V.

Published in 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR) (01-08-2018)
“…The success of deep learning based models have centered around recent architectures and the availability of large scale annotated data. In this work, we…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
16
Unconstrained OCR for Urdu Using Deep CNN-RNN Hybrid Networks by Jain, Mohit, Mathew, Minesh, Jawahar, C.V.

Published in 2017 4th IAPR Asian Conference on Pattern Recognition (ACPR) (01-11-2017)
“…Building robust text recognition systems for languages with cursive scripts like Urdu has always been challenging. Intricacies of the script and the absence of…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
17
Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam by Mathew, Minesh, Jain, Mohit, Jawahar, CV

Published 09-04-2021
“…Inspired by the success of Deep Learning based approaches to English scene text recognition, we pose and benchmark scene text recognition for three Indic…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
18
Towards Spotting and Recognition of Handwritten Words in Indic Scripts by Dutta, Kartik, Krishnan, Praveen, Mathew, Minesh, Jawahar, C.V.

Published in 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR) (01-08-2018)
“…Handwriting recognition (HWR) in Indic scripts is a challenging problem due to the inherent subtleties in the scripts, cursive nature of the handwriting and…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
19
Offline Handwriting Recognition on Devanagari Using a New Benchmark Dataset by Dutta, Kartik, Krishnan, Praveen, Mathew, Minesh, Jawahar, C.V.

Published in 2018 13th IAPR International Workshop on Document Analysis Systems (DAS) (01-04-2018)
“…Handwriting recognition (HWR) in Indic scripts, like Devanagari is very challenging due to the subtleties in the scripts, variations in rendering and the…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
20
ICDAR 2021 Competition on Document VisualQuestion Answering by Tito, Rubèn, Mathew, Minesh, Jawahar, C. V, Valveny, Ernest, Karatzas, Dimosthenis

Published 10-11-2021
“…In this report we present results of the ICDAR 2021 edition of the Document Visual Question Challenges. This edition complements the previous tasks on Single…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Mathew, Minesh"

Asking questions on handwritten document collections by Mathew, Minesh, Gomez, Lluis, Karatzas, Dimosthenis, Jawahar, C. V.

RoadText-1K: Text Detection & Recognition Dataset for Driving Videos by Reddy, Sangeeth, Mathew, Minesh, Gomez, Lluis, Rusinol, Marcal, Karatzas, Dimosthenis, Jawahar, C.V.

DocVQA: A Dataset for VQA on Document Images by Mathew, Minesh, Karatzas, Dimosthenis, Jawahar, C. V.

MMBERT: Multimodal BERT Pretraining for Improved Medical VQA by Khare, Yash, Bagal, Viraj, Mathew, Minesh, Devi, Adithi, Priyakumar, U Deva, Jawahar, CV

Watching the News: Towards VideoQA Models that can Read by Jahagirdar, Soumya, Mathew, Minesh, Karatzas, Dimosthenis, Jawahar, C. V.

InfographicVQA by Mathew, Minesh, Bagal, Viraj, Tito, Ruben, Karatzas, Dimosthenis, Valveny, Ernest, Jawahar, C. V.

Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam by Mathew, Minesh, Jain, Mohit, Jawahar, C. V.

ICDAR 2019 Competition on Scene Text Visual Question Answering by Furkan Biten, Ali, Tito, Ruben, Mafla, Andres, Gomez, Lluis, Rusinol, Marcal, Mathew, Minesh, Jawahar, C.V., Valveny, Ernest, Karatzas, Dimosthenis

An empirical study of CTC based models for OCR of Indian languages by Mathew, Minesh, Jawahar, CV

Understanding Video Scenes through Text: Insights from Text-based Video Question Answering by Jahagirdar, Soumya, Mathew, Minesh, Karatzas, Dimosthenis, Jawahar, C. V

Watching the News: Towards VideoQA Models that can Read by Jahagirdar, Soumya, Mathew, Minesh, Karatzas, Dimosthenis, Jawahar, C. V

DocVQA: A Dataset for VQA on Document Images by Mathew, Minesh, Karatzas, Dimosthenis, Jawahar, C. V

Understanding Video Scenes through Text: Insights from Text-based Video Question Answering by Jahagirdar, Soumya, Mathew, Minesh, Karatzas, Dimosthenis, Jawahar, C. V.

Reading Between the Lanes: Text VideoQA on the Road by Tom, George, Mathew, Minesh, Garcia, Sergi, Karatzas, Dimosthenis, Jawahar, C. V

Improving CNN-RNN Hybrid Networks for Handwriting Recognition by Dutta, Kartik, Krishnan, Praveen, Mathew, Minesh, Jawahar, C.V.

Unconstrained OCR for Urdu Using Deep CNN-RNN Hybrid Networks by Jain, Mohit, Mathew, Minesh, Jawahar, C.V.

Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam by Mathew, Minesh, Jain, Mohit, Jawahar, CV

Towards Spotting and Recognition of Handwritten Words in Indic Scripts by Dutta, Kartik, Krishnan, Praveen, Mathew, Minesh, Jawahar, C.V.

Offline Handwriting Recognition on Devanagari Using a New Benchmark Dataset by Dutta, Kartik, Krishnan, Praveen, Mathew, Minesh, Jawahar, C.V.

ICDAR 2021 Competition on Document VisualQuestion Answering by Tito, Rubèn, Mathew, Minesh, Jawahar, C. V, Valveny, Ernest, Karatzas, Dimosthenis

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication