Search Results - "Schluter, Ralf"

1
From Feedforward to Recurrent LSTM Neural Networks for Language Modeling by Sundermeyer, Martin, Ney, Hermann, Schluter, Ralf

Published in IEEE/ACM transactions on audio, speech, and language processing (01-03-2015)
“…Language models have traditionally been estimated based on relative frequencies, using count statistics that can be extracted from huge amounts of text data…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
Investigations on an EM-Style Optimization Algorithm for Discriminative Training of HMMs by Heigold, Georg, Ney, Hermann, Schluter, Ralf

Published in IEEE transactions on audio, speech, and language processing (01-12-2013)
“…Today's speech recognition systems are based on hidden Markov models (HMMs) with Gaussian mixture models whose parameters are estimated using a discriminative…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
3
Mean-normalized stochastic gradient for large-scale deep learning by Wiesler, Simon, Richard, Alexander, Schluter, Ralf, Ney, Hermann

Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2014)
“…Deep neural networks are typically optimized with stochastic gradient descent (SGD). In this work, we propose a novel second-order stochastic optimization…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
4
A Comparison of Transformer and LSTM Encoder Decoder Models for ASR by Zeyer, Albert, Bahar, Parnia, Irie, Kazuki, Schluter, Ralf, Ney, Hermann

Published in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01-12-2019)
“…We present competitive results using a Transformer encoder-decoder-attention model for end-to-end speech recognition needing less training time compared to a…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
5
Does the Cost Function Matter in Bayes Decision Rule? by Schluter, Ralf, Nussbaum-Thom, Markus, Ney, Hermann

Published in IEEE transactions on pattern analysis and machine intelligence (01-02-2012)
“…In many tasks in pattern recognition, such as automatic speech recognition (ASR), optical character recognition (OCR), part-of-speech (POS) tagging, and other…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
WFST Enabled Solutions to ASR Problems: Beyond HMM Decoding by Hoffmeister, B., Heigold, G., Rybach, D., Schluter, R., Ney, H.

Published in IEEE transactions on audio, speech, and language processing (01-02-2012)
“…During the last decade, weighted finite-state transducers (WFSTs) have become popular in speech recognition. While their main field of application remains…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
7
End-to-End Speech Recognition: A Survey by Prabhavalkar, Rohit, Hori, Takaaki, Sainath, Tara N., Schluter, Ralf, Watanabe, Shinji

Published in IEEE/ACM transactions on audio, speech, and language processing (2024)
“…In the last decade of automatic speech recognition (ASR) research, the introduction of deep learning has brought considerable reductions in word error rate of…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
8
Tight Integrated End-to-End Training for Cascaded Speech Translation by Bahar, Parnia, Bieschke, Tobias, Schluter, Ralf, Ney, Hermann

Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19-01-2021)
“…A cascaded speech translation model relies on discrete and non-differentiable transcription, which provides a supervision signal from the source side and helps…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
9
Phoneme Based Neural Transducer for Large Vocabulary Speech Recognition by Zhou, Wei, Berger, Simon, Schluter, Ralf, Ney, Hermann

Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06-06-2021)
“…To join the advantages of classical and end-to-end approaches for speech recognition, we present a simple, novel and competitive approach for phoneme-based…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
10
Returnn: The RWTH extensible training framework for universal recurrent neural networks by Doetsch, Patrick, Zeyer, Albert, Voigtlaender, Paul, Kulikov, Ilia, Schluter, Ralf, Ney, Hermann

Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2017)
“…In this work we release our extensible and easily configurable neural network training software. It provides a rich set of functional layers with a particular…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
11
On Language Model Integration for RNN Transducer Based Speech Recognition by Zhou, Wei, Zheng, Zuoyun, Schluter, Ralf, Ney, Hermann

Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)
“…The mismatch between an external language model (LM) and the implicitly learned internal LM (ILM) of RNN-Transducer (RNN-T) can limit the performance of LM…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
12
Conformer-Based Hybrid ASR System For Switchboard Dataset by Zeineldeen, Mohammad, Xu, Jingjing, Luscher, Christoph, Michel, Wilfried, Gerstenberger, Alexander, Schluter, Ralf, Ney, Hermann

Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)
“…The recently proposed conformer architecture has been successfully used for end-to-end automatic speech recognition (ASR) architectures achieving…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
13
Efficient Sequence Training of Attention Models Using Approximative Recombination by Wynands, Nils-Philipp, Michel, Wilfried, Rosendahl, Jan, Schluter, Ralf, Ney, Hermann

Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)
“…Sequence discriminative training is a great tool to improve the performance of an automatic speech recognition system. It does, however, necessitate a sum over…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
14
Multilingual representations for low resource speech recognition and keyword search by Jia Cui, Kingsbury, Brian, Ramabhadran, Bhuvana, Sethy, Abhinav, Audhkhasi, Kartik, Xiaodong Cui, Kislal, Ellen, Mangu, Lidia, Nussbaum-Thom, Markus, Picheny, Michael, Tuske, Zoltan, Golik, Pavel, Schluter, Ralf, Ney, Hermann, Gales, Mark J. F., Knill, Kate M., Ragni, Anton, Haipeng Wang, Woodland, Phil

Published in 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (01-12-2015)
“…This paper examines the impact of multilingual (ML) acoustic representations on Automatic Speech Recognition (ASR) and keyword search (KWS) for low resource…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
15
Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems by Rossenbach, Nick, Zeyer, Albert, Schluter, Ralf, Ney, Hermann

Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)
“…Recent advances in text-to-speech (TTS) led to the development of flexible multi-speaker end-to-end TTS systems. We extend state-of-the-art attention-based…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
16
A comprehensive study of deep bidirectional LSTM RNNS for acoustic modeling in speech recognition by Zeyer, Albert, Doetsch, Patrick, Voigtlaender, Paul, Schluter, Ralf, Ney, Hermann

Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2017)
“…Recent experiments show that deep bidirectional long short-term memory (BLSTM) recurrent neural network acoustic models outperform feedforward neural networks…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
17
Exploring A Zero-Order Direct Hmm Based on Latent Attention for Automatic Speech Recognition by Bahar, Parnia, Makarov, Nikita, Zeyer, Albert, Schluter, Ralf, Ney, Hermann

Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)
“…In this paper, we study a simple yet elegant latent variable attention model for automatic speech recognition (ASR) which enables an integration of attention…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
18
Multilingual MRASTA features for low-resource keyword search and speech recognition systems by Tuske, Zoltan, Nolden, David, Schluter, Ralf, Ney, Hermann

Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2014)
“…This paper investigates the application of hierarchical MRASTA bottleneck (BN) features for under-resourced languages within the IARPA Babel project. Through…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
19
Faster sequence training by Zeyer, Albert, Kulikov, Ilia, Schluter, Ralf, Ney, Hermann

Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2017)
“…It has been shown that sequence-discriminative training can improve the performance for large vocabulary continuous speech recognition. Our main contribution…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
20
Upper and Lower Tight Error Bounds for Feature Omission with an Extension to Context Reduction by Schluter, Ralf, Beck, Eugen, Ney, Hermann

Published in IEEE transactions on pattern analysis and machine intelligence (01-02-2019)
“…In this work, fundamental analytic results in the form of error bounds are presented that quantify the effect of feature omission and selection for pattern…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Schluter, Ralf"

From Feedforward to Recurrent LSTM Neural Networks for Language Modeling by Sundermeyer, Martin, Ney, Hermann, Schluter, Ralf

Investigations on an EM-Style Optimization Algorithm for Discriminative Training of HMMs by Heigold, Georg, Ney, Hermann, Schluter, Ralf

Mean-normalized stochastic gradient for large-scale deep learning by Wiesler, Simon, Richard, Alexander, Schluter, Ralf, Ney, Hermann

A Comparison of Transformer and LSTM Encoder Decoder Models for ASR by Zeyer, Albert, Bahar, Parnia, Irie, Kazuki, Schluter, Ralf, Ney, Hermann

Does the Cost Function Matter in Bayes Decision Rule? by Schluter, Ralf, Nussbaum-Thom, Markus, Ney, Hermann

WFST Enabled Solutions to ASR Problems: Beyond HMM Decoding by Hoffmeister, B., Heigold, G., Rybach, D., Schluter, R., Ney, H.

End-to-End Speech Recognition: A Survey by Prabhavalkar, Rohit, Hori, Takaaki, Sainath, Tara N., Schluter, Ralf, Watanabe, Shinji

Tight Integrated End-to-End Training for Cascaded Speech Translation by Bahar, Parnia, Bieschke, Tobias, Schluter, Ralf, Ney, Hermann

Phoneme Based Neural Transducer for Large Vocabulary Speech Recognition by Zhou, Wei, Berger, Simon, Schluter, Ralf, Ney, Hermann

Returnn: The RWTH extensible training framework for universal recurrent neural networks by Doetsch, Patrick, Zeyer, Albert, Voigtlaender, Paul, Kulikov, Ilia, Schluter, Ralf, Ney, Hermann

On Language Model Integration for RNN Transducer Based Speech Recognition by Zhou, Wei, Zheng, Zuoyun, Schluter, Ralf, Ney, Hermann

Conformer-Based Hybrid ASR System For Switchboard Dataset by Zeineldeen, Mohammad, Xu, Jingjing, Luscher, Christoph, Michel, Wilfried, Gerstenberger, Alexander, Schluter, Ralf, Ney, Hermann

Efficient Sequence Training of Attention Models Using Approximative Recombination by Wynands, Nils-Philipp, Michel, Wilfried, Rosendahl, Jan, Schluter, Ralf, Ney, Hermann

Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems by Rossenbach, Nick, Zeyer, Albert, Schluter, Ralf, Ney, Hermann

A comprehensive study of deep bidirectional LSTM RNNS for acoustic modeling in speech recognition by Zeyer, Albert, Doetsch, Patrick, Voigtlaender, Paul, Schluter, Ralf, Ney, Hermann

Exploring A Zero-Order Direct Hmm Based on Latent Attention for Automatic Speech Recognition by Bahar, Parnia, Makarov, Nikita, Zeyer, Albert, Schluter, Ralf, Ney, Hermann

Multilingual MRASTA features for low-resource keyword search and speech recognition systems by Tuske, Zoltan, Nolden, David, Schluter, Ralf, Ney, Hermann

Faster sequence training by Zeyer, Albert, Kulikov, Ilia, Schluter, Ralf, Ney, Hermann

Upper and Lower Tight Error Bounds for Feature Omission with an Extension to Context Reduction by Schluter, Ralf, Beck, Eugen, Ney, Hermann

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication