Search Results - "Bulyko, Ivan"

1
RescoreBERT: Discriminative Speech Recognition Rescoring With Bert by Xu, Liyan, Gu, Yile, Kolehmainen, Jari, Khan, Haidar, Gandhe, Ankur, Rastrow, Ariya, Stolcke, Andreas, Bulyko, Ivan

Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)
“…Second-pass rescoring is an important component in automatic speech recognition (ASR) systems that is used to improve the outputs from a first-pass decoder by…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
2
On-the-Fly Text Retrieval for end-to-end ASR Adaptation by Yusuf, Bolaji, Gourav, Aditya, Gandhe, Ankur, Bulyko, Ivan

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…End-to-end speech recognition models are improved by incorporating external text sources, typically by fusion with an external language model. Such language…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
3
A Likelihood Ratio Based Domain Adaptation Method for E2E Models by Choudhury, Chhavi, Gandhe, Ankur, Ding, Xiaohan, Bulyko, Ivan

Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)
“…End-to-end (E2E) automatic speech recognition models like Recurrent Neural Networks Transducer (RNN-T) are becoming a popular choice for streaming ASR…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
4
Personalization Strategies for End-to-End Speech Recognition Systems by Gourav, Aditya, Liu, Linda, Gandhe, Ankur, Gu, Yile, Lan, Guitang, Huang, Xiangyang, Kalmane, Shashank, Tiwari, Gautam, Filimonov, Denis, Rastrow, Ariya, Stolcke, Andreas, Bulyko, Ivan

Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06-06-2021)
“…The recognition of personalized content, such as contact names, remains a challenging problem for end-to-end speech recognition systems. In this work, we…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
5
Domain-Aware Neural Language Models for Speech Recognition by Liu, Linda, Gu, Yile, Gourav, Aditya, Gandhe, Ankur, Kalmane, Shashank, Filimonov, Denis, Rastrow, Ariya, Bulyko, Ivan

Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06-06-2021)
“…As voice assistants become more ubiquitous, they are increasingly expected to support and perform well on a wide variety of use-cases across different domains…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
6
Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue by Lin, Guan-Ting, Shivakumar, Prashanth Gurunath, Gandhe, Ankur, Yang, Chao-Han Huck, Gu, Yile, Ghosh, Shalini, Stolcke, Andreas, Lee, Hung-Yi, Bulyko, Ivan

Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)
“…Large Language Models (LLMs) have demonstrated superior abilities in tasks such as chatting, reasoning, and question-answering. However, standard LLMs may…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
7
Procter: Pronunciation-Aware Contextual Adapter For Personalized Speech Recognition In Neural Transducers by Pandey, Rahul, Ren, Roger, Luo, Qi, Liu, Jing, Rastrow, Ariya, Gandhe, Ankur, Filimonov, Denis, Strimel, Grant, Stolcke, Andreas, Bulyko, Ivan

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…End-to-End (E2E) automatic speech recognition (ASR) systems used in voice assistants often have difficulties recognizing infrequent words personalized to the…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
8
Towards ASR Robust Spoken Language Understanding Through in-Context Learning with Word Confusion Networks by Everson, Kevin, Gu, Yile, Yang, Huck, Shivakumar, Prashanth Gurunath, Lin, Guan-Ting, Kolehmainen, Jari, Bulyko, Ivan, Gandhe, Ankur, Ghosh, Shalini, Hamza, Wael, Lee, Hung-Yi, Rastrow, Anya, Stolcke, Andreas

Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)
“…In the realm of spoken language understanding (SLU). numerous natural language understanding (NLU) methodologies have been adapted by supplying large language…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
9
Mitigating Closed-Model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition by Yang, Chao-Han Huck, Ahmed, Zeeshan, Gu, Yile, Szurley, Joseph, Ren, Roger, Liu, Linda, Stolcke, Andreas, Bulyko, Ivan

Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)
“…In this work, we aim to enhance the system robustness of end-to-end automatic speech recognition (ASR) against adversarially-noisy speech examples. We focus on…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
10
Towards Continual Entity Learning in Language Models for Conversational Agents by Gadde, Ravi Teja, Bulyko, Ivan

Published 30-07-2021
“…Neural language models (LM) trained on diverse corpora are known to work well on previously seen entities, however, updating these models with dynamically…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
11
On-the-fly Text Retrieval for End-to-End ASR Adaptation by Yusuf, Bolaji, Gourav, Aditya, Gandhe, Ankur, Bulyko, Ivan

Published 20-03-2023
“…End-to-end speech recognition models are improved by incorporating external text sources, typically by fusion with an external language model. Such language…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
12
Speech Recognition Rescoring with Large Speech-Text Foundation Models by Shivakumar, Prashanth Gurunath, Kolehmainen, Jari, Gourav, Aditya, Gu, Yi, Gandhe, Ankur, Rastrow, Ariya, Bulyko, Ivan

Published 25-09-2024
“…Large language models (LLM) have demonstrated the ability to understand human language by leveraging large amount of text data. Automatic speech recognition…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
13
Multi-Task Language Modeling for Improving Speech Recognition of Rare Words by Yang, Chao-Han Huck, Liu, Linda, Gandhe, Ankur, Gu, Yile, Raju, Anirudh, Filimonov, Denis, Bulyko, Ivan

Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13-12-2021)
“…End-to-end automatic speech recognition (ASR) systems are increasingly popular due to their relative architectural simplicity and competitive performance…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
14
Normalization of phonetic keyword search scores by Karakos, Damianos, Bulyko, Ivan, Schwartz, Richard, Tsakalidis, Stavros, Long Nguyen, Makhoul, John

Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2014)
“…As shown in [1, 2], score normalization is of crucial importance for improving the Average Term-Weighted Value (ATWV) measure that is commonly used for…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
15
Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback by Lin, Guan-Ting, Shivakumar, Prashanth Gurunath, Gourav, Aditya, Gu, Yile, Gandhe, Ankur, Lee, Hung-yi, Bulyko, Ivan

Published 04-11-2024
“…While textless Spoken Language Models (SLMs) have shown potential in end-to-end speech-to-speech modeling, they still lag behind text-based Large Language…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
16
Discriminative Speech Recognition Rescoring with Pre-trained Language Models by Shivakumar, Prashanth Gurunath, Kolehmainen, Jari, Gu, Yile, Gandhe, Ankur, Rastrow, Ariya, Bulyko, Ivan

Published 09-10-2023
“…Second pass rescoring is a critical component of competitive automatic speech recognition (ASR) systems. Large language models have demonstrated their ability…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
17
A Likelihood Ratio based Domain Adaptation Method for E2E Models by Choudhury, Chhavi, Gandhe, Ankur, Ding, Xiaohan, Bulyko, Ivan

Published 10-01-2022
“…End-to-end (E2E) automatic speech recognition models like Recurrent Neural Networks Transducer (RNN-T) are becoming a popular choice for streaming ASR…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
18
Multi-Modal Retrieval For Large Language Model Based Speech Recognition by Kolehmainen, Jari, Gourav, Aditya, Shivakumar, Prashanth Gurunath, Gu, Yile, Gandhe, Ankur, Rastrow, Ariya, Strimel, Grant, Bulyko, Ivan

Published 13-06-2024
“…Retrieval is a widely adopted approach for improving language models leveraging external information. As the field moves towards multi-modal large language…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
19
Personalization for BERT-based Discriminative Speech Recognition Rescoring by Kolehmainen, Jari, Gu, Yile, Gourav, Aditya, Shivakumar, Prashanth Gurunath, Gandhe, Ankur, Rastrow, Ariya, Bulyko, Ivan

Published 13-07-2023
“…Recognition of personalized content remains a challenge in end-to-end speech recognition. We explore three novel approaches that use personalized content in a…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
20
Scaling Laws for Discriminative Speech Recognition Rescoring Models by Gu, Yile, Shivakumar, Prashanth Gurunath, Kolehmainen, Jari, Gandhe, Ankur, Rastrow, Ariya, Bulyko, Ivan

Published 27-06-2023
“…Recent studies have found that model performance has a smooth power-law relationship, or scaling laws, with training data and model size, for a wide range of…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Bulyko, Ivan"

RescoreBERT: Discriminative Speech Recognition Rescoring With Bert by Xu, Liyan, Gu, Yile, Kolehmainen, Jari, Khan, Haidar, Gandhe, Ankur, Rastrow, Ariya, Stolcke, Andreas, Bulyko, Ivan

On-the-Fly Text Retrieval for end-to-end ASR Adaptation by Yusuf, Bolaji, Gourav, Aditya, Gandhe, Ankur, Bulyko, Ivan

A Likelihood Ratio Based Domain Adaptation Method for E2E Models by Choudhury, Chhavi, Gandhe, Ankur, Ding, Xiaohan, Bulyko, Ivan

Personalization Strategies for End-to-End Speech Recognition Systems by Gourav, Aditya, Liu, Linda, Gandhe, Ankur, Gu, Yile, Lan, Guitang, Huang, Xiangyang, Kalmane, Shashank, Tiwari, Gautam, Filimonov, Denis, Rastrow, Ariya, Stolcke, Andreas, Bulyko, Ivan

Domain-Aware Neural Language Models for Speech Recognition by Liu, Linda, Gu, Yile, Gourav, Aditya, Gandhe, Ankur, Kalmane, Shashank, Filimonov, Denis, Rastrow, Ariya, Bulyko, Ivan

Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue by Lin, Guan-Ting, Shivakumar, Prashanth Gurunath, Gandhe, Ankur, Yang, Chao-Han Huck, Gu, Yile, Ghosh, Shalini, Stolcke, Andreas, Lee, Hung-Yi, Bulyko, Ivan

Procter: Pronunciation-Aware Contextual Adapter For Personalized Speech Recognition In Neural Transducers by Pandey, Rahul, Ren, Roger, Luo, Qi, Liu, Jing, Rastrow, Ariya, Gandhe, Ankur, Filimonov, Denis, Strimel, Grant, Stolcke, Andreas, Bulyko, Ivan

Mitigating Closed-Model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition by Yang, Chao-Han Huck, Ahmed, Zeeshan, Gu, Yile, Szurley, Joseph, Ren, Roger, Liu, Linda, Stolcke, Andreas, Bulyko, Ivan

Towards Continual Entity Learning in Language Models for Conversational Agents by Gadde, Ravi Teja, Bulyko, Ivan

On-the-fly Text Retrieval for End-to-End ASR Adaptation by Yusuf, Bolaji, Gourav, Aditya, Gandhe, Ankur, Bulyko, Ivan

Speech Recognition Rescoring with Large Speech-Text Foundation Models by Shivakumar, Prashanth Gurunath, Kolehmainen, Jari, Gourav, Aditya, Gu, Yi, Gandhe, Ankur, Rastrow, Ariya, Bulyko, Ivan

Multi-Task Language Modeling for Improving Speech Recognition of Rare Words by Yang, Chao-Han Huck, Liu, Linda, Gandhe, Ankur, Gu, Yile, Raju, Anirudh, Filimonov, Denis, Bulyko, Ivan

Normalization of phonetic keyword search scores by Karakos, Damianos, Bulyko, Ivan, Schwartz, Richard, Tsakalidis, Stavros, Long Nguyen, Makhoul, John

Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback by Lin, Guan-Ting, Shivakumar, Prashanth Gurunath, Gourav, Aditya, Gu, Yile, Gandhe, Ankur, Lee, Hung-yi, Bulyko, Ivan

Discriminative Speech Recognition Rescoring with Pre-trained Language Models by Shivakumar, Prashanth Gurunath, Kolehmainen, Jari, Gu, Yile, Gandhe, Ankur, Rastrow, Ariya, Bulyko, Ivan

A Likelihood Ratio based Domain Adaptation Method for E2E Models by Choudhury, Chhavi, Gandhe, Ankur, Ding, Xiaohan, Bulyko, Ivan

Multi-Modal Retrieval For Large Language Model Based Speech Recognition by Kolehmainen, Jari, Gourav, Aditya, Shivakumar, Prashanth Gurunath, Gu, Yile, Gandhe, Ankur, Rastrow, Ariya, Strimel, Grant, Bulyko, Ivan

Personalization for BERT-based Discriminative Speech Recognition Rescoring by Kolehmainen, Jari, Gu, Yile, Gourav, Aditya, Shivakumar, Prashanth Gurunath, Gandhe, Ankur, Rastrow, Ariya, Bulyko, Ivan

Scaling Laws for Discriminative Speech Recognition Rescoring Models by Gu, Yile, Shivakumar, Prashanth Gurunath, Kolehmainen, Jari, Gandhe, Ankur, Rastrow, Ariya, Bulyko, Ivan

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication