Search Results - "Chongjia Ni"
-
1
Preventing Early Endpointing for Online Automatic Speech Recognition
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-01-2021)“…With the recent development of end-to-end models in speech recognition, there have been more interests in adapting these models for online speech recognition…”
Get full text
Conference Proceeding -
2
Modification on LSA speech enhancement for speech recognition
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2017)“…Speech recognition performance deteriorates in face of unknown noise. Speech enhancement offers a solution by reducing the noise in speech at runtime. However,…”
Get full text
Conference Proceeding -
3
Efficient methods to train multilingual bottleneck feature extractors for low resource keyword search
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2017)“…Training a bottleneck feature (BNF) extractor with multilingual data has been common in low resource keyword search. In a low resource application, the amount…”
Get full text
Conference Proceeding -
4
Unsupervised data selection and word-morph mixed language model for tamil low-resource keyword search
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)“…This paper considers an unsupervised data selection problem for the training data of an acoustic model and the vocabulary coverage of a keyword search system…”
Get full text
Conference Proceeding -
5
From English pitch accent detection to Mandarin stress detection, where is the difference?
Published in Computer speech & language (01-06-2012)“…► The classifier combination method, which is the combination of boosting classification and regression tree and conditional random fields, is employed to…”
Get full text
Journal Article -
6
A keyword-aware grammar framework for LVCSR-based spoken keyword search
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)“…In this paper, we proposed a method to realize the recently developed keyword-aware grammar for LVCSR-based keyword search using weight finite-state automata…”
Get full text
Conference Proceeding -
7
Cross-lingual deep neural network based submodular unbiased data selection for low-resource keyword search
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2016)“…In this paper, we propose a cross-lingual deep neural network (DNN) based submodular unbiased data selection approach for low-resource keyword search (KWS). A…”
Get full text
Conference Proceeding Journal Article -
8
Submodular data selection with acoustic and phonetic features for automatic speech recognition
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)“…In this paper, we propose to use acoustic feature based submodular function optimization to select a subset of untranscribed data for manual transcription, and…”
Get full text
Conference Proceeding -
9
Long short-term memory recurrent neural network based segment features for music genre classification
Published in 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) (01-10-2016)“…In the conventional frame feature based music genre classification methods, the audio data is represented by independent frames and the sequential nature of…”
Get full text
Conference Proceeding -
10
Investigate automatic speech recognition and keyword search for very low-resource language
Published in 2017 IEEE 2nd International Conference on Signal and Image Processing (ICSIP) (01-08-2017)“…In this paper, pronunciation lexicon, multi-lingual bottleneck features, semi-supervised learning, and data selection are investigated to help to improve the…”
Get full text
Conference Proceeding -
11
Investigation of using different Chinese word segmentation standards and algorithms for automatic speech recognition
Published in The 9th International Symposium on Chinese Spoken Language Processing (01-09-2014)“…Chinese word segmentation (CWS) is a necessary step in Mandarin Chinese automatic speech recognition (ASR), and it has an impact on the results of ASR…”
Get full text
Conference Proceeding -
12
A novel codebook representation method and encoding strategy for bag-of-words based acoustic event classification
Published in 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) (01-12-2015)“…The bag-of-words (BoW) model has been widely used for acoustic event classification (AEC). The performance of the BoW based AEC model is much influenced by…”
Get full text
Conference Proceeding -
13
A novel keyword+LVCSR-filler based grammar network representation for spoken keyword search
Published in The 9th International Symposium on Chinese Spoken Language Processing (01-09-2014)“…A novel spoken keyword search grammar representation framework is proposed to combine the advantages of conventional keyword-filler based keyword search (KWS)…”
Get full text
Conference Proceeding -
14
Multiple time-span feature fusion for deep neural network modeling
Published in The 9th International Symposium on Chinese Spoken Language Processing (01-09-2014)“…In this paper, we exploit long term information from multiple time-spans for automatic speech recognition. The multiple time-span information is encoded into…”
Get full text
Conference Proceeding -
15
De'hubert: Disentangling Noise in a Self-Supervised Model for Robust Speech Recognition
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…Existing self-supervised pre-trained speech models have offered an effective way to leverage massive unannotated corpora to build good automatic speech…”
Get full text
Conference Proceeding -
16
Contrastive Speech Mixup for Low-Resource Keyword Spotting
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…Most of the existing neural-based models for keyword spotting (KWS) in smart devices require thousands of training samples to learn a decent audio…”
Get full text
Conference Proceeding -
17
SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)“…Dual-path is a popular architecture for speech separation models (e.g. Sepformer) which splits long sequences into overlapping chunks for its intra- and…”
Get full text
Conference Proceeding -
18
Are Soft Prompts Good Zero-Shot Learners for Speech Recognition?
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)“…Large self-supervised pre-trained speech models require computationally expensive fine-tuning for downstream tasks. Soft prompt tuning offers a simple…”
Get full text
Conference Proceeding -
19
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)“…Our previously proposed MossFormer has achieved promising performance in monaural speech separation. However, it predominantly adopts a self-attention-based…”
Get full text
Conference Proceeding -
20
Independent Language Modeling Architecture for End-To-End ASR
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…The attention-based end-to-end (E2E) automatic speech recognition (ASR) architecture allows for joint optimization of acoustic and language models within a…”
Get full text
Conference Proceeding