Search Results - "Klejch, Ondrej"

1
Adaptation Algorithms for Neural Network-Based Speech Recognition: An Overview by Bell, Peter, Fainberg, Joachim, Klejch, Ondrej, Li, Jinyu, Renals, Steve, Swietojanski, Pawel

Published in IEEE open journal of signal processing (2021)
“…We present a structured overview of adaptation algorithms for neural network-based speech recognition, considering both hybrid hidden Markov model / neural…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
Sequence-to-sequence models for punctuated transcription combining lexical and acoustic features by Klejch, Ondrej, Bell, Peter, Renals, Steve

Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2017)
“…In this paper we present an extension of our previously described neural machine translation based system for punctuated transcription. This extension allows…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
3
Punctuated transcription of multi-genre broadcasts using acoustic and lexical approaches by Klejch, Ondrej, Bell, Peter, Renals, Steve

Published in 2016 IEEE Spoken Language Technology Workshop (SLT) (01-12-2016)
“…In this paper we investigate the punctuated transcription of multi-genre broadcast media. We examine four systems, three of which are based on lexical…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
4
Ava Active Speaker: An Audio-Visual Dataset for Active Speaker Detection by Roth, Joseph, Chaudhuri, Sourish, Klejch, Ondrej, Marvin, Radhika, Gallagher, Andrew, Kaver, Liat, Ramaswamy, Sharadh, Stopczynski, Arkadiusz, Schmid, Cordelia, Xi, Zhonghua, Pantofaru, Caroline

Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)
“…Active speaker detection is an important component in video analysis algorithms for applications such as speaker diarization, video re-targeting for meetings,…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
5
Towards Zero-Shot Code-Switched Speech Recognition by Yan, Brian, Wiesner, Matthew, Klejch, Ondrej, Jyothi, Preethi, Watanabe, Shinji

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…In this work, we seek to build effective code-switched (CS) automatic speech recognition systems (ASR) under the zero-shot set-ting where no transcribed CS…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
6
The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR by Sanabria, Ramon, Bogoychev, Nikolay, Markl, Nina, Carmantini, Andrea, Klejch, Ondrej, Bell, Peter

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…English is the most widely spoken language in the world, used daily by millions of people as a first or second language in many different contexts. As a…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
7
Efficient Intelligibility Evaluation Using Keyword Spotting: A Study on Audio-Visual Speech Enhancement by Valentini-Botinhao, Cassia, Aldana Blanco, Andrea Lorena, Klejch, Ondrej, Bell, Peter

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…We propose a new method for human speech intelligibility evaluation based on keyword spotting. In this method, participants play a stimulus and select the word…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
8
Speech Collage: Code-Switched Audio Generation by Collaging Monolingual Corpora by Hussein, Amir, Zeinali, Dorsa, Klejch, Ondrej, Wiesner, Matthew, Yan, Brian, Chowdhury, Shammur, Ali, Ahmed, Watanabe, Shinji, Khudanpur, Sanjeev

Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)
“…Designing effective automatic speech recognition (ASR) systems for Code-Switching (CS) often depends on the availability of the transcribed CS resources. To…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
9
Beyond Oversmoothing: Evaluating DDPM and MSE for Scalable Speech Synthesis in ASR by Minixhofer, Christoph, Klejch, Ondrej, Bell, Peter

Published 16-10-2024
“…Synthetically generated speech has rapidly approached human levels of naturalness. However, the paradox remains that ASR systems, when trained on TTS output…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
10
Exploring Dominant Paths in CTC-Like ASR Models: Unraveling the Effectiveness of Viterbi Decoding by Zhao, Zeyu, Bell, Peter, Klejch, Ondrej

Published in 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW) (14-04-2024)
“…Connectionist Temporal Classification (CTC) has emerged as a fundamental technique in Automatic Speech Recognition (ASR), renowned for its ability to…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
11
Speaker Adaptive Training Using Model Agnostic Meta-Learning by Klejch, Ondrej, Fainberg, Joachim, Bell, Peter, Renals, Steve

Published in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01-12-2019)
“…Speaker adaptive training (SAT) of neural network acoustic models learns models in a way that makes them more suitable for adaptation to test conditions…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
12
Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR by Klejch, Ondrej, Wallington, Electra, Bell, Peter

Published 12-11-2021
“…We present a method for cross-lingual training an ASR system using absolutely no transcribed training data from the target language, and with no phonetic…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
13
Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling by Sanabria, Ramon, Klejch, Ondrej, Tang, Hao, Goldwater, Sharon

Published 03-06-2023
“…Acoustic word embeddings are typically created by training a pooling function using pairs of word-like units. For unsupervised systems, these are mined using…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
14
ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition by Li, Yuanchao, Zhao, Zeyu, Klejch, Ondrej, Bell, Peter, Lai, Catherine

Published 25-05-2023
“…In Speech Emotion Recognition (SER), textual data is often used alongside audio signals to address their inherent variability. However, the reliance on human…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
15
Towards Zero-Shot Code-Switched Speech Recognition by Yan, Brian, Wiesner, Matthew, Klejch, Ondrej, Jyothi, Preethi, Watanabe, Shinji

Published 02-11-2022
“…In this work, we seek to build effective code-switched (CS) automatic speech recognition systems (ASR) under the zero-shot setting where no transcribed CS…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
16
Acoustic Model Adaptation from Raw Waveforms with Sincnet by Fainberg, Joachim, Klejch, Ondrej, Loweimi, Erfan, Bell, Peter, Renals, Steve

Published in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01-12-2019)
“…Raw waveform acoustic modelling has recently gained interest due to neural networks' ability to learn feature extraction, and the potential for finding better…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
17
AVSE Challenge: Audio-Visual Speech Enhancement Challenge by Blanco, Andrea Lorena Aldana, Valentini-Botinhao, Cassia, Klejch, Ondrej, Gogate, Mandar, Dashtipour, Kia, Hussain, Amir, Bell, Peter

Published in 2022 IEEE Spoken Language Technology Workshop (SLT) (09-01-2023)
“…Audio-visual speech enhancement is the task of improving the quality of a speech signal when video of the speaker is available. It opens-up the opportunity of…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
18
The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR by Sanabria, Ramon, Bogoychev, Nikolay, Markl, Nina, Carmantini, Andrea, Klejch, Ondrej, Bell, Peter

Published 31-03-2023
“…English is the most widely spoken language in the world, used daily by millions of people as a first or second language in many different contexts. As a…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
19
Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features by Tsunoo, Emiru, Klejch, Ondrej, Bell, Peter, Renals, Steve

Published in 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01-12-2017)
“…A broadcast news stream consists of a number of stories and it is an important task to find the boundaries of stories automatically in news analysis. We…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
20
Lattice-Based Unsupervised Test-Time Adaptation of Neural Network Acoustic Models by Klejch, Ondrej, Fainberg, Joachim, Bell, Peter, Renals, Steve

Published 27-06-2019
“…Acoustic model adaptation to unseen test recordings aims to reduce the mismatch between training and testing conditions. Most adaptation schemes for neural…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Klejch, Ondrej"

Adaptation Algorithms for Neural Network-Based Speech Recognition: An Overview by Bell, Peter, Fainberg, Joachim, Klejch, Ondrej, Li, Jinyu, Renals, Steve, Swietojanski, Pawel

Sequence-to-sequence models for punctuated transcription combining lexical and acoustic features by Klejch, Ondrej, Bell, Peter, Renals, Steve

Punctuated transcription of multi-genre broadcasts using acoustic and lexical approaches by Klejch, Ondrej, Bell, Peter, Renals, Steve

Ava Active Speaker: An Audio-Visual Dataset for Active Speaker Detection by Roth, Joseph, Chaudhuri, Sourish, Klejch, Ondrej, Marvin, Radhika, Gallagher, Andrew, Kaver, Liat, Ramaswamy, Sharadh, Stopczynski, Arkadiusz, Schmid, Cordelia, Xi, Zhonghua, Pantofaru, Caroline

Towards Zero-Shot Code-Switched Speech Recognition by Yan, Brian, Wiesner, Matthew, Klejch, Ondrej, Jyothi, Preethi, Watanabe, Shinji

The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR by Sanabria, Ramon, Bogoychev, Nikolay, Markl, Nina, Carmantini, Andrea, Klejch, Ondrej, Bell, Peter

Efficient Intelligibility Evaluation Using Keyword Spotting: A Study on Audio-Visual Speech Enhancement by Valentini-Botinhao, Cassia, Aldana Blanco, Andrea Lorena, Klejch, Ondrej, Bell, Peter

Speech Collage: Code-Switched Audio Generation by Collaging Monolingual Corpora by Hussein, Amir, Zeinali, Dorsa, Klejch, Ondrej, Wiesner, Matthew, Yan, Brian, Chowdhury, Shammur, Ali, Ahmed, Watanabe, Shinji, Khudanpur, Sanjeev

Beyond Oversmoothing: Evaluating DDPM and MSE for Scalable Speech Synthesis in ASR by Minixhofer, Christoph, Klejch, Ondrej, Bell, Peter

Exploring Dominant Paths in CTC-Like ASR Models: Unraveling the Effectiveness of Viterbi Decoding by Zhao, Zeyu, Bell, Peter, Klejch, Ondrej

Speaker Adaptive Training Using Model Agnostic Meta-Learning by Klejch, Ondrej, Fainberg, Joachim, Bell, Peter, Renals, Steve

Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR by Klejch, Ondrej, Wallington, Electra, Bell, Peter

Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling by Sanabria, Ramon, Klejch, Ondrej, Tang, Hao, Goldwater, Sharon

ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition by Li, Yuanchao, Zhao, Zeyu, Klejch, Ondrej, Bell, Peter, Lai, Catherine

Towards Zero-Shot Code-Switched Speech Recognition by Yan, Brian, Wiesner, Matthew, Klejch, Ondrej, Jyothi, Preethi, Watanabe, Shinji

Acoustic Model Adaptation from Raw Waveforms with Sincnet by Fainberg, Joachim, Klejch, Ondrej, Loweimi, Erfan, Bell, Peter, Renals, Steve

AVSE Challenge: Audio-Visual Speech Enhancement Challenge by Blanco, Andrea Lorena Aldana, Valentini-Botinhao, Cassia, Klejch, Ondrej, Gogate, Mandar, Dashtipour, Kia, Hussain, Amir, Bell, Peter

The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR by Sanabria, Ramon, Bogoychev, Nikolay, Markl, Nina, Carmantini, Andrea, Klejch, Ondrej, Bell, Peter

Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features by Tsunoo, Emiru, Klejch, Ondrej, Bell, Peter, Renals, Steve

Lattice-Based Unsupervised Test-Time Adaptation of Neural Network Acoustic Models by Klejch, Ondrej, Fainberg, Joachim, Bell, Peter, Renals, Steve

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication