Search Results - "Gandhe, Ankur"
-
1
Audio-Attention Discriminative Language Model for ASR Rescoring
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…End-to-end approaches for automatic speech recognition (ASR) benefit from directly modeling the probability of the word sequence given the input audio stream…”
Get full text
Conference Proceeding -
2
Usted: Improving ASR with a Unified Speech and Text Encoder-Decoder
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…Improving end-to-end speech recognition by incorporating external text data has been a longstanding research topic. There has been a recent focus on training…”
Get full text
Conference Proceeding -
3
RescoreBERT: Discriminative Speech Recognition Rescoring With Bert
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…Second-pass rescoring is an important component in automatic speech recognition (ASR) systems that is used to improve the outputs from a first-pass decoder by…”
Get full text
Conference Proceeding -
4
On-the-Fly Text Retrieval for end-to-end ASR Adaptation
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…End-to-end speech recognition models are improved by incorporating external text sources, typically by fusion with an external language model. Such language…”
Get full text
Conference Proceeding -
5
A Likelihood Ratio Based Domain Adaptation Method for E2E Models
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…End-to-end (E2E) automatic speech recognition models like Recurrent Neural Networks Transducer (RNN-T) are becoming a popular choice for streaming ASR…”
Get full text
Conference Proceeding -
6
Personalization Strategies for End-to-End Speech Recognition Systems
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06-06-2021)“…The recognition of personalized content, such as contact names, remains a challenging problem for end-to-end speech recognition systems. In this work, we…”
Get full text
Conference Proceeding -
7
Domain-Aware Neural Language Models for Speech Recognition
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06-06-2021)“…As voice assistants become more ubiquitous, they are increasingly expected to support and perform well on a wide variety of use-cases across different domains…”
Get full text
Conference Proceeding -
8
Lattention: Lattice-Attention in ASR Rescoring
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…Lattices form a compact representation of multiple hypotheses generated from an automatic speech recognition system and have been shown to improve performance…”
Get full text
Conference Proceeding -
9
Robust Acoustic And Semantic Contextual Biasing In Neural Transducers For Speech Recognition
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…Attention-based contextual biasing approaches have shown significant improvements in the recognition of generic and/or personal rare-words in End-to-End…”
Get full text
Conference Proceeding -
10
Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)“…Large Language Models (LLMs) have demonstrated superior abilities in tasks such as chatting, reasoning, and question-answering. However, standard LLMs may…”
Get full text
Conference Proceeding -
11
Procter: Pronunciation-Aware Contextual Adapter For Personalized Speech Recognition In Neural Transducers
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…End-to-End (E2E) automatic speech recognition (ASR) systems used in voice assistants often have difficulties recognizing infrequent words personalized to the…”
Get full text
Conference Proceeding -
12
Towards ASR Robust Spoken Language Understanding Through in-Context Learning with Word Confusion Networks
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)“…In the realm of spoken language understanding (SLU). numerous natural language understanding (NLU) methodologies have been adapted by supplying large language…”
Get full text
Conference Proceeding -
13
Scalable Language Model Adaptation for Spoken Dialogue Systems
Published in 2018 IEEE Spoken Language Technology Workshop (SLT) (01-12-2018)“…Language models (LM) for interactive speech recognition systems are trained on large amounts of data and the model parameters are optimized on past user data…”
Get full text
Conference Proceeding -
14
Optimization of Neural Network Language Models for keyword search
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2014)“…Recent works have shown Neural Network based Language Models (NNLMs) to be an effective modeling technique for Automatic Speech Recognition. Prior works have…”
Get full text
Conference Proceeding -
15
Audio-attention discriminative language model for ASR rescoring
Published 06-12-2019“…End-to-end approaches for automatic speech recognition (ASR) benefit from directly modeling the probability of the word sequence given the input audio stream…”
Get full text
Journal Article -
16
USTED: Improving ASR with a Unified Speech and Text Encoder-Decoder
Published 12-02-2022“…Improving end-to-end speech recognition by incorporating external text data has been a longstanding research topic. There has been a recent focus on training…”
Get full text
Journal Article -
17
Streaming Speech-to-Confusion Network Speech Recognition
Published 02-06-2023“…Proc. Interspeech, Aug. 2023, pp. 4099-4103 In interactive automatic speech recognition (ASR) systems, low-latency requirements limit the amount of search…”
Get full text
Journal Article -
18
On-the-fly Text Retrieval for End-to-End ASR Adaptation
Published 20-03-2023“…End-to-end speech recognition models are improved by incorporating external text sources, typically by fusion with an external language model. Such language…”
Get full text
Journal Article -
19
Speech Recognition Rescoring with Large Speech-Text Foundation Models
Published 25-09-2024“…Large language models (LLM) have demonstrated the ability to understand human language by leveraging large amount of text data. Automatic speech recognition…”
Get full text
Journal Article -
20
Multi-Task Language Modeling for Improving Speech Recognition of Rare Words
Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13-12-2021)“…End-to-end automatic speech recognition (ASR) systems are increasingly popular due to their relative architectural simplicity and competitive performance…”
Get full text
Conference Proceeding