Search Results - "Mei-Yuh Hwang"
-
1
Domain Adversarial Training for Accented Speech Recognition
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2018)“…In this paper, we propose a domain adversarial training (DAT) algorithm to alleviate the accented speech recognition problem. In order to reduce the mismatch…”
Get full text
Conference Proceeding -
2
Adversarial Regularization for Attention Based End-to-End Robust Speech Recognition
Published in IEEE/ACM transactions on audio, speech, and language processing (01-11-2019)“…End-to-end speech recognition, such as attention based approaches, is an emerging and attractive topic in recent years. It has achieved comparable performance…”
Get full text
Journal Article -
3
Contextual spoken language understanding using recurrent neural networks
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)“…We present a contextual spoken language understanding (contextual SLU) method using Recurrent Neural Networks (RNNs). Previous work has shown that context…”
Get full text
Conference Proceeding -
4
Region Proposal Network Based Small-Footprint Keyword Spotting
Published in IEEE signal processing letters (01-10-2019)“…We apply an anchor-based region proposal network (RPN) for end-to-end keyword spotting (KWS). RPNs have been widely used for object detection in image and…”
Get full text
Journal Article -
5
Recent innovations in speech-to-text transcription at SRI-ICSI-UW
Published in IEEE transactions on audio, speech, and language processing (01-09-2006)“…We summarize recent progress in automatic speech-to-text transcription at SRI, ICSI, and the University of Washington. The work encompasses all components of…”
Get full text
Journal Article -
6
Building A Highly Accurate Mandarin Speech Recognizer With Language-Independent Technologies and Language-Dependent Modules
Published in IEEE transactions on audio, speech, and language processing (01-09-2009)“…We describe a system for highly accurate large-vocabulary Mandarin speech recognition. The prevailing hidden Markov model based technologies are essentially…”
Get full text
Journal Article -
7
Mining Effective Negative Training Samples for Keyword Spotting
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…Max-pooling neural network architectures have been proven to be useful for keyword spotting (KWS), but standard training methods suffer from a class-imbalance…”
Get full text
Conference Proceeding -
8
A factorization network based method for multi-lingual domain classification
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)“…In many spoken language understanding systems (SLUS), domain classification is the most crucial component, as system responses based on wrong domains often…”
Get full text
Conference Proceeding -
9
Knowledge Distillation for Recurrent Neural Network Language Modeling with Trust Regularization
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)“…Recurrent Neural Networks (RNNs) have dominated language modeling because of their superior performance over traditional N-gram based models. In many…”
Get full text
Conference Proceeding -
10
End-to-end Speech Recognition Using a High Rank LSTM-CTC Based Model
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)“…Long Short Term Memory Connectionist Temporal Classification (LSTM-CTC) based end-to-end models are widely used in speech recognition due to its simplicity in…”
Get full text
Conference Proceeding -
11
New methods and evaluation experiments on translating TED talks in the IWSLT benchmark
Published in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2012)“…The IWSLT benchmark task is an annual evaluation campaign on spoken language translation held by the International Workshop on Spoken Language Processing…”
Get full text
Conference Proceeding -
12
Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition
Published in 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07 (01-04-2007)“…Recent developments in large vocabulary continuous speech recognition (LVCSR) have shown the effectiveness of discriminative training approaches, employing the…”
Get full text
Conference Proceeding -
13
Semi-supervised slot tagging in spoken language understanding using recurrent transductive support vector machines
Published in 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (01-12-2015)“…In this paper, we propose a recurrent transductive support vector machine (rtsvm) for semi-supervised slot tagging. Taking advantage of the superior sequence…”
Get full text
Conference Proceeding -
14
Multiple fixed beamformers with a spacial Wiener-form postfilter for far-field speech recognition
Published in 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) (01-11-2019)“…Far-field speech recognition is becoming a hot topic in research and industrial applications. In this paper, in order to improve far-field speech recognition…”
Get full text
Conference Proceeding -
15
Web-data augmented language models for Mandarin conversational speech recognition
Published in Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005 (2005)“…Lack of data is a problem in training language models for conversational speech recognition, particularly for languages other than English. Experiments in…”
Get full text
Conference Proceeding -
16
A Robust Nonlinear Microphone Array Postfilter for Noise Reduction
Published in 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC) (01-09-2018)“…We propose a robust nonlinear microphone array postfilter for noise reduction. This postfilter is formulated as a function of noise power ratio before and…”
Get full text
Conference Proceeding -
17
Shared-distribution hidden Markov models for speech recognition
Published in IEEE transactions on speech and audio processing (01-10-1993)“…A shared-distribution hidden Markov model (HMM) is presented for speaker-independent continuous speech recognition. The output distributions across different…”
Get full text
Journal Article -
18
Generating a task-adapted acoustic model from one or more different corpora
Published in The Journal of the Acoustical Society of America (2009)Get full text
Journal Article -
19
Predicting unseen triphones with senones
Published in IEEE transactions on speech and audio processing (01-11-1996)“…In large-vocabulary speech recognition, we often encounter triphones that are not covered in the training data. These unseen triphones are usually backed off…”
Get full text
Journal Article -
20