Search Results - "Chongjia Ni"

1
Preventing Early Endpointing for Online Automatic Speech Recognition by Zhao, Yingzhu, Ni, Chongjia, Leung, Cheung-Chi, Joty, Shafiq, Chng, Eng Siong, Ma, Bin

Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-01-2021)
“…With the recent development of end-to-end models in speech recognition, there have been more interests in adapting these models for online speech recognition…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
2
Modification on LSA speech enhancement for speech recognition by Chang Huai You, Bin Ma, Chongjia Ni

Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2017)
“…Speech recognition performance deteriorates in face of unknown noise. Speech enhancement offers a solution by reducing the noise in speech at runtime. However,…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
3
Efficient methods to train multilingual bottleneck feature extractors for low resource keyword search by Chongjia Ni, Cheung-Chi Leung, Lei Wang, Chen, Nancy F., Bin Ma

Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2017)
“…Training a bottleneck feature (BNF) extractor with multilingual data has been common in low resource keyword search. In a low resource application, the amount…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
4
Unsupervised data selection and word-morph mixed language model for tamil low-resource keyword search by Chongjia Ni, Cheung-Chi Leung, Lei Wang, Chen, Nancy F., Bin Ma

Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)
“…This paper considers an unsupervised data selection problem for the training data of an acoustic model and the vocabulary coverage of a keyword search system…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
5
From English pitch accent detection to Mandarin stress detection, where is the difference? by Ni, Chongjia, Liu, Wenju, Xu, Bo

Published in Computer speech & language (01-06-2012)
“…► The classifier combination method, which is the combination of boosting classification and regression tree and conditional random fields, is employed to…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
A keyword-aware grammar framework for LVCSR-based spoken keyword search by I-Fan Chen, Chongjia Ni, Boon Pang Lim, Chen, Nancy F., Chin-Hui Lee

Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)
“…In this paper, we proposed a method to realize the recently developed keyword-aware grammar for LVCSR-based keyword search using weight finite-state automata…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
7
Cross-lingual deep neural network based submodular unbiased data selection for low-resource keyword search by Chongjia Ni, Cheung-Chi Leung, Lei Wang, Haibo Liu, Feng Rao, Li Lu, Chen, Nancy F., Bin Ma, Haizhou Li

Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2016)
“…In this paper, we propose a cross-lingual deep neural network (DNN) based submodular unbiased data selection approach for low-resource keyword search (KWS). A…”

Get full text

Conference Proceeding Journal Article
QR Code
Save to List

Saved in:
8
Submodular data selection with acoustic and phonetic features for automatic speech recognition by Chongjia Ni, Lei Wang, Haibo Liu, Cheung-Chi Leung, Li Lu, Bin Ma

Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)
“…In this paper, we propose to use acoustic feature based submodular function optimization to select a subset of untranscribed data for manual transcription, and…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
9
Long short-term memory recurrent neural network based segment features for music genre classification by Jia Dai, Shan Liang, Wei Xue, Chongjia Ni, Wenju Liu

Published in 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) (01-10-2016)
“…In the conventional frame feature based music genre classification methods, the audio data is represented by independent frames and the sequential nature of…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
10
Investigate automatic speech recognition and keyword search for very low-resource language by Chongjia Ni, Bin Ma

Published in 2017 IEEE 2nd International Conference on Signal and Image Processing (ICSIP) (01-08-2017)
“…In this paper, pronunciation lexicon, multi-lingual bottleneck features, semi-supervised learning, and data selection are investigated to help to improve the…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
11
Investigation of using different Chinese word segmentation standards and algorithms for automatic speech recognition by Chongjia Ni, Cheung-Chi Leung

Published in The 9th International Symposium on Chinese Spoken Language Processing (01-09-2014)
“…Chinese word segmentation (CWS) is a necessary step in Mandarin Chinese automatic speech recognition (ASR), and it has an impact on the results of ASR…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
12
A novel codebook representation method and encoding strategy for bag-of-words based acoustic event classification by Jia Dai, Chongjia Ni, Wei Xue, Wenju Liu

Published in 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) (01-12-2015)
“…The bag-of-words (BoW) model has been widely used for acoustic event classification (AEC). The performance of the BoW based AEC model is much influenced by…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
13
A novel keyword+LVCSR-filler based grammar network representation for spoken keyword search by I-Fan Chen, Chongjia Ni, Boon Pang Lim, Chen, Nancy F., Chin-Hui Lee

Published in The 9th International Symposium on Chinese Spoken Language Processing (01-09-2014)
“…A novel spoken keyword search grammar representation framework is proposed to combine the advantages of conventional keyword-filler based keyword search (KWS)…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
14
Multiple time-span feature fusion for deep neural network modeling by Chongjia Ni, Chen, Nancy F., Bin Ma

Published in The 9th International Symposium on Chinese Spoken Language Processing (01-09-2014)
“…In this paper, we exploit long term information from multiple time-spans for automatic speech recognition. The multiple time-span information is encoded into…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
15
De'hubert: Disentangling Noise in a Self-Supervised Model for Robust Speech Recognition by Ng, Dianwen, Zhang, Ruixi, Yip, Jia Qi, Yang, Zhao, Ni, Jinjie, Zhang, Chong, Ma, Yukun, Ni, Chongjia, Chng, Eng Siong, Ma, Bin

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…Existing self-supervised pre-trained speech models have offered an effective way to leverage massive unannotated corpora to build good automatic speech…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
16
Contrastive Speech Mixup for Low-Resource Keyword Spotting by Ng, Dianwen, Zhang, Ruixi, Yip, Jia Qi, Zhang, Chong, Ma, Yukun, Nguyen, Trung Hieu, Ni, Chongjia, Chng, Eng Siong, Ma, Bin

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…Most of the existing neural-based models for keyword spotting (KWS) in smart devices require thousands of training samples to learn a decent audio…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
17
SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance by Yip, Jia Qi, Zhao, Shengkui, Ma, Yukun, Ni, Chongjia, Zhang, Chong, Wang, Hao, Nguyen, Trung Hieu, Zhou, Kun, Ng, Dianwen, Chng, Eng Siong, Ma, Bin

Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)
“…Dual-path is a popular architecture for speech separation models (e.g. Sepformer) which splits long sequences into overlapping chunks for its intra- and…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
18
Are Soft Prompts Good Zero-Shot Learners for Speech Recognition? by Ng, Dianwen, Zhang, Chong, Zhang, Ruixi, Ma, Yukun, Ritter-Gutierrez, Fabian, Nguyen, Trung Hieu, Ni, Chongjia, Zhao, Shengkui, Chng, Eng Siong, Ma, Bin

Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)
“…Large self-supervised pre-trained speech models require computationally expensive fine-tuning for downstream tasks. Soft prompt tuning offers a simple…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
19
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation by Zhao, Shengkui, Ma, Yukun, Ni, Chongjia, Zhang, Chong, Wang, Hao, Nguyen, Trung Hieu, Zhou, Kun, Yip, Jia Qi, Ng, Dianwen, Ma, Bin

Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)
“…Our previously proposed MossFormer has achieved promising performance in monaural speech separation. However, it predominantly adopts a self-attention-based…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
20
Independent Language Modeling Architecture for End-To-End ASR by Pham, Van Tung, Xu, Haihua, Khassanov, Yerbolat, Zeng, Zhiping, Chng, Eng Siong, Ni, Chongjia, Ma, Bin, Li, Haizhou

Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)
“…The attention-based end-to-end (E2E) automatic speech recognition (ASR) architecture allows for joint optimization of acoustic and language models within a…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:

Search Results - "Chongjia Ni"

Preventing Early Endpointing for Online Automatic Speech Recognition by Zhao, Yingzhu, Ni, Chongjia, Leung, Cheung-Chi, Joty, Shafiq, Chng, Eng Siong, Ma, Bin

Modification on LSA speech enhancement for speech recognition by Chang Huai You, Bin Ma, Chongjia Ni

Efficient methods to train multilingual bottleneck feature extractors for low resource keyword search by Chongjia Ni, Cheung-Chi Leung, Lei Wang, Chen, Nancy F., Bin Ma

Unsupervised data selection and word-morph mixed language model for tamil low-resource keyword search by Chongjia Ni, Cheung-Chi Leung, Lei Wang, Chen, Nancy F., Bin Ma

From English pitch accent detection to Mandarin stress detection, where is the difference? by Ni, Chongjia, Liu, Wenju, Xu, Bo

A keyword-aware grammar framework for LVCSR-based spoken keyword search by I-Fan Chen, Chongjia Ni, Boon Pang Lim, Chen, Nancy F., Chin-Hui Lee

Cross-lingual deep neural network based submodular unbiased data selection for low-resource keyword search by Chongjia Ni, Cheung-Chi Leung, Lei Wang, Haibo Liu, Feng Rao, Li Lu, Chen, Nancy F., Bin Ma, Haizhou Li

Submodular data selection with acoustic and phonetic features for automatic speech recognition by Chongjia Ni, Lei Wang, Haibo Liu, Cheung-Chi Leung, Li Lu, Bin Ma

Long short-term memory recurrent neural network based segment features for music genre classification by Jia Dai, Shan Liang, Wei Xue, Chongjia Ni, Wenju Liu

Investigate automatic speech recognition and keyword search for very low-resource language by Chongjia Ni, Bin Ma

Investigation of using different Chinese word segmentation standards and algorithms for automatic speech recognition by Chongjia Ni, Cheung-Chi Leung

A novel codebook representation method and encoding strategy for bag-of-words based acoustic event classification by Jia Dai, Chongjia Ni, Wei Xue, Wenju Liu

A novel keyword+LVCSR-filler based grammar network representation for spoken keyword search by I-Fan Chen, Chongjia Ni, Boon Pang Lim, Chen, Nancy F., Chin-Hui Lee

Multiple time-span feature fusion for deep neural network modeling by Chongjia Ni, Chen, Nancy F., Bin Ma

De'hubert: Disentangling Noise in a Self-Supervised Model for Robust Speech Recognition by Ng, Dianwen, Zhang, Ruixi, Yip, Jia Qi, Yang, Zhao, Ni, Jinjie, Zhang, Chong, Ma, Yukun, Ni, Chongjia, Chng, Eng Siong, Ma, Bin

Contrastive Speech Mixup for Low-Resource Keyword Spotting by Ng, Dianwen, Zhang, Ruixi, Yip, Jia Qi, Zhang, Chong, Ma, Yukun, Nguyen, Trung Hieu, Ni, Chongjia, Chng, Eng Siong, Ma, Bin

SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance by Yip, Jia Qi, Zhao, Shengkui, Ma, Yukun, Ni, Chongjia, Zhang, Chong, Wang, Hao, Nguyen, Trung Hieu, Zhou, Kun, Ng, Dianwen, Chng, Eng Siong, Ma, Bin

Are Soft Prompts Good Zero-Shot Learners for Speech Recognition? by Ng, Dianwen, Zhang, Chong, Zhang, Ruixi, Ma, Yukun, Ritter-Gutierrez, Fabian, Nguyen, Trung Hieu, Ni, Chongjia, Zhao, Shengkui, Chng, Eng Siong, Ma, Bin

MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation by Zhao, Shengkui, Ma, Yukun, Ni, Chongjia, Zhang, Chong, Wang, Hao, Nguyen, Trung Hieu, Zhou, Kun, Yip, Jia Qi, Ng, Dianwen, Ma, Bin

Independent Language Modeling Architecture for End-To-End ASR by Pham, Van Tung, Xu, Haihua, Khassanov, Yerbolat, Zeng, Zhiping, Chng, Eng Siong, Ni, Chongjia, Ma, Bin, Li, Haizhou

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication