Search Results - "Wang, Deliang"
-
1
Ideal ratio mask estimation using deep neural networks for robust speech recognition
Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01-05-2013)“…We propose a feature enhancement algorithm to improve robust automatic speech recognition (ASR). The algorithm estimates a smoothed ideal ratio mask (IRM) in…”
Get full text
Conference Proceeding -
2
Towards Scaling Up Classification-Based Speech Separation
Published in IEEE transactions on audio, speech, and language processing (01-07-2013)“…Formulating speech separation as a binary classification problem has been shown to be effective. While good separation performance is achieved in matched test…”
Get full text
Journal Article -
3
On Training Targets for Supervised Speech Separation
Published in IEEE/ACM transactions on audio, speech, and language processing (01-12-2014)“…Formulation of speech separation as a supervised learning problem has shown considerable promise. In its simplest form, a supervised learning algorithm,…”
Get full text
Journal Article -
4
Investigation of Speech Separation as a Front-End for Noise Robust Speech Recognition
Published in IEEE/ACM transactions on audio, speech, and language processing (01-04-2014)“…Recently, supervised classification has been shown to work well for the task of speech separation. We perform an in-depth evaluation of such techniques as a…”
Get full text
Journal Article -
5
Achieving Color‐Tunable and Time‐Dependent Organic Long Persistent Luminescence via Phosphorescence Energy Transfer for Advanced Anti‐Counterfeiting
Published in Advanced functional materials (01-01-2023)“…Organic ultralong room‐temperature phosphorescence (RTP) materials have promising applications in anti‐counterfeiting. To improve the encryption level, the…”
Get full text
Journal Article -
6
Complex Ratio Masking for Monaural Speech Separation
Published in IEEE/ACM transactions on audio, speech, and language processing (01-03-2016)“…Speech separation systems usually operate on the short-time Fourier transform (STFT) of noisy speech, and enhance only the magnitude spectrum while leaving the…”
Get full text
Journal Article -
7
Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising
Published in IEEE/ACM transactions on audio, speech, and language processing (01-07-2017)“…In real-world situations, speech is masked by both background noise and reverberation, which negatively affect perceptual quality and intelligibility. In this…”
Get full text
Journal Article -
8
Deep Learning Based Binaural Speech Separation in Reverberant Environments
Published in IEEE/ACM transactions on audio, speech, and language processing (01-05-2017)“…Speech signal is usually degraded by room reverberation and additive noises in real environments. This paper focuses on separating target speech signal in…”
Get full text
Journal Article -
9
Features for Masking-Based Monaural Speech Separation in Reverberant Conditions
Published in IEEE/ACM transactions on audio, speech, and language processing (01-05-2017)“…Monaural speech separation is a fundamental problem in speech and signal processing. This problem can be approached from a supervised learning perspective by…”
Get full text
Journal Article -
10
Exploring Monaural Features for Classification-Based Speech Segregation
Published in IEEE transactions on audio, speech, and language processing (01-02-2013)“…Monaural speech segregation has been a very challenging problem for decades. By casting speech segregation as a binary classification problem, recent advances…”
Get full text
Journal Article -
11
Factorization-Based Texture Segmentation
Published in IEEE transactions on image processing (01-11-2015)“…This paper introduces a factorization-based approach that efficiently segments textured images. We use local spectral histograms as features, and construct an…”
Get full text
Journal Article -
12
Towards Model Compression for Deep Learning Based Speech Enhancement
Published in IEEE/ACM transactions on audio, speech, and language processing (2021)“…The use of deep neural networks (DNNs) has dramatically elevated the performance of speech enhancement over the last decade. However, to achieve strong…”
Get full text
Journal Article -
13
A classification based approach to speech segregation
Published in The Journal of the Acoustical Society of America (01-11-2012)“…A key problem in computational auditory scene analysis (CASA) is monaural speech segregation, which has proven to be very challenging. For monaural mixtures,…”
Get full text
Journal Article -
14
An Unsupervised Approach to Cochannel Speech Separation
Published in IEEE transactions on audio, speech, and language processing (01-01-2013)“…Cochannel (two-talker) speech separation is predominantly addressed using pretrained speaker dependent models. In this paper, we propose an unsupervised…”
Get full text
Journal Article -
15
Complex Spectral Mapping for Single- and Multi-Channel Speech Enhancement and Robust ASR
Published in IEEE/ACM transactions on audio, speech, and language processing (2020)“…This study proposes a complex spectral mapping approach for single- and multi-channel speech enhancement, where deep neural networks (DNNs) are used to predict…”
Get full text
Journal Article -
16
A Tandem Algorithm for Pitch Estimation and Voiced Speech Segregation
Published in IEEE transactions on audio, speech, and language processing (01-11-2010)“…A lot of effort has been made in computational auditory scene analysis (CASA) to segregate speech from monaural mixtures. The performance of current CASA…”
Get full text
Journal Article -
17
On Adversarial Training and Loss Functions for Speech Enhancement
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2018)“…Generative adversarial networks (GANs) are becoming increasingly popular for image processing tasks. Researchers have started using GAN s for speech…”
Get full text
Conference Proceeding -
18
Time-Frequency Masking for Speech Separation and Its Potential for Hearing Aid Design
Published in Trends in amplification (01-12-2008)“…A new approach to the separation of speech from speech-in-noise mixtures is the use of time-frequency (T-F) masking. Originated in the field of computational…”
Get full text
Journal Article -
19
Robust Speaker Identification in Noisy and Reverberant Conditions
Published in IEEE/ACM transactions on audio, speech, and language processing (01-04-2014)“…Robustness of speaker recognition systems is crucial for real-world applications, which typically contain both additive noise and room reverberation. However,…”
Get full text
Journal Article -
20
On Cross-Corpus Generalization of Deep Learning Based Speech Enhancement
Published in IEEE/ACM transactions on audio, speech, and language processing (2020)“…In recent years, supervised approaches using deep neural networks (DNNs) have become the mainstream for speech enhancement. It has been established that DNNs…”
Get full text
Journal Article