Search Results - "Bulyko, Ivan"
-
1
RescoreBERT: Discriminative Speech Recognition Rescoring With Bert
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…Second-pass rescoring is an important component in automatic speech recognition (ASR) systems that is used to improve the outputs from a first-pass decoder by…”
Get full text
Conference Proceeding -
2
On-the-Fly Text Retrieval for end-to-end ASR Adaptation
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…End-to-end speech recognition models are improved by incorporating external text sources, typically by fusion with an external language model. Such language…”
Get full text
Conference Proceeding -
3
A Likelihood Ratio Based Domain Adaptation Method for E2E Models
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…End-to-end (E2E) automatic speech recognition models like Recurrent Neural Networks Transducer (RNN-T) are becoming a popular choice for streaming ASR…”
Get full text
Conference Proceeding -
4
Personalization Strategies for End-to-End Speech Recognition Systems
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06-06-2021)“…The recognition of personalized content, such as contact names, remains a challenging problem for end-to-end speech recognition systems. In this work, we…”
Get full text
Conference Proceeding -
5
Domain-Aware Neural Language Models for Speech Recognition
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06-06-2021)“…As voice assistants become more ubiquitous, they are increasingly expected to support and perform well on a wide variety of use-cases across different domains…”
Get full text
Conference Proceeding -
6
Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)“…Large Language Models (LLMs) have demonstrated superior abilities in tasks such as chatting, reasoning, and question-answering. However, standard LLMs may…”
Get full text
Conference Proceeding -
7
Procter: Pronunciation-Aware Contextual Adapter For Personalized Speech Recognition In Neural Transducers
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…End-to-End (E2E) automatic speech recognition (ASR) systems used in voice assistants often have difficulties recognizing infrequent words personalized to the…”
Get full text
Conference Proceeding -
8
Towards ASR Robust Spoken Language Understanding Through in-Context Learning with Word Confusion Networks
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)“…In the realm of spoken language understanding (SLU). numerous natural language understanding (NLU) methodologies have been adapted by supplying large language…”
Get full text
Conference Proceeding -
9
Mitigating Closed-Model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…In this work, we aim to enhance the system robustness of end-to-end automatic speech recognition (ASR) against adversarially-noisy speech examples. We focus on…”
Get full text
Conference Proceeding -
10
Towards Continual Entity Learning in Language Models for Conversational Agents
Published 30-07-2021“…Neural language models (LM) trained on diverse corpora are known to work well on previously seen entities, however, updating these models with dynamically…”
Get full text
Journal Article -
11
On-the-fly Text Retrieval for End-to-End ASR Adaptation
Published 20-03-2023“…End-to-end speech recognition models are improved by incorporating external text sources, typically by fusion with an external language model. Such language…”
Get full text
Journal Article -
12
Speech Recognition Rescoring with Large Speech-Text Foundation Models
Published 25-09-2024“…Large language models (LLM) have demonstrated the ability to understand human language by leveraging large amount of text data. Automatic speech recognition…”
Get full text
Journal Article -
13
Multi-Task Language Modeling for Improving Speech Recognition of Rare Words
Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13-12-2021)“…End-to-end automatic speech recognition (ASR) systems are increasingly popular due to their relative architectural simplicity and competitive performance…”
Get full text
Conference Proceeding -
14
Normalization of phonetic keyword search scores
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2014)“…As shown in [1, 2], score normalization is of crucial importance for improving the Average Term-Weighted Value (ATWV) measure that is commonly used for…”
Get full text
Conference Proceeding -
15
Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback
Published 04-11-2024“…While textless Spoken Language Models (SLMs) have shown potential in end-to-end speech-to-speech modeling, they still lag behind text-based Large Language…”
Get full text
Journal Article -
16
Discriminative Speech Recognition Rescoring with Pre-trained Language Models
Published 09-10-2023“…Second pass rescoring is a critical component of competitive automatic speech recognition (ASR) systems. Large language models have demonstrated their ability…”
Get full text
Journal Article -
17
A Likelihood Ratio based Domain Adaptation Method for E2E Models
Published 10-01-2022“…End-to-end (E2E) automatic speech recognition models like Recurrent Neural Networks Transducer (RNN-T) are becoming a popular choice for streaming ASR…”
Get full text
Journal Article -
18
Multi-Modal Retrieval For Large Language Model Based Speech Recognition
Published 13-06-2024“…Retrieval is a widely adopted approach for improving language models leveraging external information. As the field moves towards multi-modal large language…”
Get full text
Journal Article -
19
Personalization for BERT-based Discriminative Speech Recognition Rescoring
Published 13-07-2023“…Recognition of personalized content remains a challenge in end-to-end speech recognition. We explore three novel approaches that use personalized content in a…”
Get full text
Journal Article -
20
Scaling Laws for Discriminative Speech Recognition Rescoring Models
Published 27-06-2023“…Recent studies have found that model performance has a smooth power-law relationship, or scaling laws, with training data and model size, for a wide range of…”
Get full text
Journal Article