Search Results - "Stuker, Sebastian"
-
1
Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2018)“…We summarize the accomplishments of a multi-disciplinary workshop exploring the computational and scientific issues surrounding the discovery of linguistic…”
Get full text
Conference Proceeding -
2
Multilingual shifting deep bottleneck features for low-resource ASR
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2014)“…In this work, we propose a deep bottleneck feature architecture that is able to leverage data from multiple languages. We also show that tonal features are…”
Get full text
Conference Proceeding -
3
Training time reduction and performance improvements from multilingual techniques on the BABEL ASR task
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2014)“…In the IARPA sponsored program BABEL we are faced with the challenge of training automatic speech recognition systems in sparse data conditions in very little…”
Get full text
Conference Proceeding -
4
Improving Sequence-To-Sequence Speech Recognition Training with On-The-Fly Data Augmentation
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…Sequence-to-Sequence (S2S) models recently started to show state-of-the-art performance for automatic speech recognition (ASR). With these large and deep…”
Get full text
Conference Proceeding -
5
Speech Technology for Unwritten Languages
Published in IEEE/ACM transactions on audio, speech, and language processing (01-01-2020)“…Speech technology plays an important role in our everyday life. Among others, speech is used for human-computer interaction, for instance for information…”
Get full text
Journal Article -
6
Neural Codes to Factor Language in Multilingual Speech Recognition
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)“…In the past, we adapted neural network based multilingual acoustic models using language codes. In this work, we study the extracted language codes and the…”
Get full text
Conference Proceeding -
7
Speech interaction strategies for a humanoid assistant
Published in MATEC Web of Conferences (01-01-2018)“…The goal of SecondHands, a H2020 project, is to design a robot that can offer help to a maintenance technician in a proactive manner. The robot is to act as a…”
Get full text
Journal Article Conference Proceeding -
8
Towards phoneme inventory discovery for documentation of unwritten languages
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2017)“…Documenting unwritten languages is a challenging task, even for trained specialists. To help linguists in better and faster documenting new languages is the…”
Get full text
Conference Proceeding -
9
Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech Recognition
Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13-12-2021)“…Neural sequence-to-sequence systems deliver state-of-the-art performance for automatic speech recognition (ASR). When using appropriate modeling units, e.g.,…”
Get full text
Conference Proceeding -
10
An automatic system for the simultaneous translation of lectures
Published in Journal of cheminformatics (11-03-2014)Get full text
Journal Article -
11
Semi-supervised training in low-resource ASR and KWS
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)“…In particular for "low resource" Keyword Search (KWS) and Speech-to-Text (STT) tasks, more untranscribed test data may be available than training data. Several…”
Get full text
Conference Proceeding -
12
Multi-stage Large Language Model Correction for Speech Recognition
Published 17-10-2023“…In this paper, we investigate the usage of large language models (LLMs) to improve the performance of competitive speech recognition systems. Different from…”
Get full text
Journal Article -
13
Modified polyphone decision tree specialization for porting multilingual Grapheme based ASR systems to new languages
Published in 2008 IEEE International Conference on Acoustics, Speech and Signal Processing (01-03-2008)“…Automatic speech recognition (ASR) systems have been developed only for a very limited number of the estimated 7,000 languages in the world. In order to avoid…”
Get full text
Conference Proceeding -
14
Toward Cross-Domain Speech Recognition with End-to-End Models
Published 09-03-2020“…In the area of multi-domain speech recognition, research in the past focused on hybrid acoustic models to build cross-domain and domain-invariant speech…”
Get full text
Journal Article -
15
Research Opportunities In Automatic Speech-To-Speech Translation
Published in IEEE potentials (01-05-2012)“…The field of speech-to-speech translation is a "well-established area of research that has a noteworthy tradition. It addresses a problem-the communication…”
Get full text
Journal Article -
16
Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech Recognition
Published 05-07-2021“…Neural sequence-to-sequence systems deliver state-of-the-art performance for automatic speech recognition (ASR). When using appropriate modeling units, e.g.,…”
Get full text
Journal Article -
17
A hybrid phonotactic language identification system with an SVM back-end for simultaneous lecture translation
Published in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2012)“…In this paper we describe our work in constructing a language identification system for use in our simultaneous lecture translation system. We first built PPR…”
Get full text
Conference Proceeding -
18
Neural Language Codes for Multilingual Acoustic Models
Published 05-07-2018“…Multilingual Speech Recognition is one of the most costly AI problems, because each language (7,000+) and even different accents require their own acoustic…”
Get full text
Journal Article -
19
Multilingual Adaptation of RNN Based ASR Systems
Published 13-11-2017“…In this work, we focus on multilingual systems based on recurrent neural networks (RNNs), trained using the Connectionist Temporal Classification (CTC) loss…”
Get full text
Journal Article -
20
Phonemic and Graphemic Multilingual CTC Based Speech Recognition
Published 13-11-2017“…Training automatic speech recognition (ASR) systems requires large amounts of data in the target language in order to achieve good performance. Whereas large…”
Get full text
Journal Article