Search Results - "Mei-Yuh Hwang"

1
Domain Adversarial Training for Accented Speech Recognition by Sun, Sining, Yeh, Ching-Feng, Hwang, Mei-Yuh, Ostendorf, Mari, Xie, Lei

Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2018)
“…In this paper, we propose a domain adversarial training (DAT) algorithm to alleviate the accented speech recognition problem. In order to reduce the mismatch…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
2
Adversarial Regularization for Attention Based End-to-End Robust Speech Recognition by Sun, Sining, Guo, Pengcheng, Xie, Lei, Hwang, Mei-Yuh

Published in IEEE/ACM transactions on audio, speech, and language processing (01-11-2019)
“…End-to-end speech recognition, such as attention based approaches, is an emerging and attractive topic in recent years. It has achieved comparable performance…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
3
Contextual spoken language understanding using recurrent neural networks by Yangyang Shi, Kaisheng Yao, Hu Chen, Yi-Cheng Pan, Mei-Yuh Hwang, Baolin Peng

Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)
“…We present a contextual spoken language understanding (contextual SLU) method using Recurrent Neural Networks (RNNs). Previous work has shown that context…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
4
Region Proposal Network Based Small-Footprint Keyword Spotting by Hou, Jingyong, Shi, Yangyang, Ostendorf, Mari, Hwang, Mei-Yuh, Xie, Lei

Published in IEEE signal processing letters (01-10-2019)
“…We apply an anchor-based region proposal network (RPN) for end-to-end keyword spotting (KWS). RPNs have been widely used for object detection in image and…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
5
Recent innovations in speech-to-text transcription at SRI-ICSI-UW by Stolcke, A., Barry Chen, Franco, H., Venkata Ramana Rao Gadde, Graciarena, M., Mei-Yuh Hwang, Kirchhoff, K., Mandal, A., Morgan, N., Xin Lei, Ng, T., Ostendorf, M., Sonmez, K., Venkataraman, A., Vergyri, D., Wen Wang, Jing Zheng, Qifeng Zhu

Published in IEEE transactions on audio, speech, and language processing (01-09-2006)
“…We summarize recent progress in automatic speech-to-text transcription at SRI, ICSI, and the University of Washington. The work encompasses all components of…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
Building A Highly Accurate Mandarin Speech Recognizer With Language-Independent Technologies and Language-Dependent Modules by Mei-Yuh Hwang, Gang Peng, Ostendorf, M., Wen Wang, Faria, A., Heidel, A.

Published in IEEE transactions on audio, speech, and language processing (01-09-2009)
“…We describe a system for highly accurate large-vocabulary Mandarin speech recognition. The prevailing hidden Markov model based technologies are essentially…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
7
Mining Effective Negative Training Samples for Keyword Spotting by Hou, Jingyong, Shi, Yangyang, Ostendorf, Mari, Hwang, Mei-Yuh, Xie, Lei

Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)
“…Max-pooling neural network architectures have been proven to be useful for keyword spotting (KWS), but standard training methods suffer from a class-imbalance…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
8
A factorization network based method for multi-lingual domain classification by Yangyang Shi, Yi-Cheng Pan, Mei-Yuh Hwang, Kaisheng Yao, Hu Chen, Yuanhang Zou, Baolin Peng

Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)
“…In many spoken language understanding systems (SLUS), domain classification is the most crucial component, as system responses based on wrong domains often…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
9
Knowledge Distillation for Recurrent Neural Network Language Modeling with Trust Regularization by Shi, Yangyang, Hwang, Mei-Yuh, Lei, Xin, Sheng, Haoyu

Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)
“…Recurrent Neural Networks (RNNs) have dominated language modeling because of their superior performance over traditional N-gram based models. In many…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
10
End-to-end Speech Recognition Using a High Rank LSTM-CTC Based Model by Shi, Yangyang, Hwang, Mei-Yuh, Lei, Xin

Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)
“…Long Short Term Memory Connectionist Temporal Classification (LSTM-CTC) based end-to-end models are widely used in speech recognition due to its simplicity in…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
11
New methods and evaluation experiments on translating TED talks in the IWSLT benchmark by Axelrod, A., Xiaodong He, Li Deng, Acero, A., Mei-Yuh Hwang

Published in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2012)
“…The IWSLT benchmark task is an annual evaluation campaign on spoken language translation held by the International Workshop on Spoken Language Processing…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
12
Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition by Jing Zheng, Cetin, O., Mei-Yuh Hwang, Xin Lei, Stolcke, A., Morgan, N.

Published in 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07 (01-04-2007)
“…Recent developments in large vocabulary continuous speech recognition (LVCSR) have shown the effectiveness of discriminative training approaches, employing the…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
13
Semi-supervised slot tagging in spoken language understanding using recurrent transductive support vector machines by Yangyang Shi, Kaisheng Yao, Hu Chen, Yi-Cheng Pan, Mei-Yuh Hwang

Published in 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (01-12-2015)
“…In this paper, we propose a recurrent transductive support vector machine (rtsvm) for semi-supervised slot tagging. Taking advantage of the superior sequence…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
14
Multiple fixed beamformers with a spacial Wiener-form postfilter for far-field speech recognition by Sun, Sining, Zhou, Shuran, Hwang, Mei-Yuh, Xie, Lei, Li, Qin, Lei, Xin

Published in 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) (01-11-2019)
“…Far-field speech recognition is becoming a hot topic in research and industrial applications. In this paper, in order to improve far-field speech recognition…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
15
Web-data augmented language models for Mandarin conversational speech recognition by Ng, T., Ostendorf, M., Mei-Yuh Hwang, Manhung Siu, Bulyko, I., Xin Lei

Published in Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005 (2005)
“…Lack of data is a problem in training language models for conversational speech recognition, particularly for languages other than English. Experiments in…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
16
A Robust Nonlinear Microphone Array Postfilter for Noise Reduction by Bu, Suliang, Zhao, Yunxin, Hwang, Mei-Yuh, Sun, Sining

Published in 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC) (01-09-2018)
“…We propose a robust nonlinear microphone array postfilter for noise reduction. This postfilter is formulated as a function of noise power ratio before and…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
17
Shared-distribution hidden Markov models for speech recognition by Hwang, Mei-Yuh, Huang, Xuedong

Published in IEEE transactions on speech and audio processing (01-10-1993)
“…A shared-distribution hidden Markov model (HMM) is presented for speaker-independent continuous speech recognition. The output distributions across different…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
18
Generating a task-adapted acoustic model from one or more different corpora by Hwang, Mei Yuh

Published in The Journal of the Acoustical Society of America (2009)

Get full text

Journal Article
QR Code
Save to List

Saved in:
19
Predicting unseen triphones with senones by Mei-Yuh Hwang, Xuedong Huang, Alleva, F.A.

Published in IEEE transactions on speech and audio processing (01-11-1996)
“…In large-vocabulary speech recognition, we often encounter triphones that are not covered in the training data. These unseen triphones are usually backed off…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
20
Generating a task-adapted acoustic model from one or more supervised and/or unsupervised corpora by Hwang, Mei Yuh

Published in The Journal of the Acoustical Society of America (2007)

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Mei-Yuh Hwang"

Domain Adversarial Training for Accented Speech Recognition by Sun, Sining, Yeh, Ching-Feng, Hwang, Mei-Yuh, Ostendorf, Mari, Xie, Lei

Adversarial Regularization for Attention Based End-to-End Robust Speech Recognition by Sun, Sining, Guo, Pengcheng, Xie, Lei, Hwang, Mei-Yuh

Contextual spoken language understanding using recurrent neural networks by Yangyang Shi, Kaisheng Yao, Hu Chen, Yi-Cheng Pan, Mei-Yuh Hwang, Baolin Peng

Region Proposal Network Based Small-Footprint Keyword Spotting by Hou, Jingyong, Shi, Yangyang, Ostendorf, Mari, Hwang, Mei-Yuh, Xie, Lei

Building A Highly Accurate Mandarin Speech Recognizer With Language-Independent Technologies and Language-Dependent Modules by Mei-Yuh Hwang, Gang Peng, Ostendorf, M., Wen Wang, Faria, A., Heidel, A.

Mining Effective Negative Training Samples for Keyword Spotting by Hou, Jingyong, Shi, Yangyang, Ostendorf, Mari, Hwang, Mei-Yuh, Xie, Lei

A factorization network based method for multi-lingual domain classification by Yangyang Shi, Yi-Cheng Pan, Mei-Yuh Hwang, Kaisheng Yao, Hu Chen, Yuanhang Zou, Baolin Peng

Knowledge Distillation for Recurrent Neural Network Language Modeling with Trust Regularization by Shi, Yangyang, Hwang, Mei-Yuh, Lei, Xin, Sheng, Haoyu

End-to-end Speech Recognition Using a High Rank LSTM-CTC Based Model by Shi, Yangyang, Hwang, Mei-Yuh, Lei, Xin

New methods and evaluation experiments on translating TED talks in the IWSLT benchmark by Axelrod, A., Xiaodong He, Li Deng, Acero, A., Mei-Yuh Hwang

Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition by Jing Zheng, Cetin, O., Mei-Yuh Hwang, Xin Lei, Stolcke, A., Morgan, N.

Semi-supervised slot tagging in spoken language understanding using recurrent transductive support vector machines by Yangyang Shi, Kaisheng Yao, Hu Chen, Yi-Cheng Pan, Mei-Yuh Hwang

Multiple fixed beamformers with a spacial Wiener-form postfilter for far-field speech recognition by Sun, Sining, Zhou, Shuran, Hwang, Mei-Yuh, Xie, Lei, Li, Qin, Lei, Xin

Web-data augmented language models for Mandarin conversational speech recognition by Ng, T., Ostendorf, M., Mei-Yuh Hwang, Manhung Siu, Bulyko, I., Xin Lei

A Robust Nonlinear Microphone Array Postfilter for Noise Reduction by Bu, Suliang, Zhao, Yunxin, Hwang, Mei-Yuh, Sun, Sining

Shared-distribution hidden Markov models for speech recognition by Hwang, Mei-Yuh, Huang, Xuedong

Generating a task-adapted acoustic model from one or more different corpora by Hwang, Mei Yuh

Predicting unseen triphones with senones by Mei-Yuh Hwang, Xuedong Huang, Alleva, F.A.

Generating a task-adapted acoustic model from one or more supervised and/or unsupervised corpora by Hwang, Mei Yuh

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication