Speech Source Separation using Variational Autoencoder and Bandpass Filter

Speech source separation is essential for speech-related applications because this process enhances the input speech signal for the main processing model. Most of the current approaches for this task focus on separating the speech of commonly high-frequency noises or a particular background sound. T...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE access Vol. 8; p. 1
Main Authors:	Do, Hao D., Tran, Son T., Chau, Duc T.
Format:	Journal Article
Language:	English
Published:	Piscataway IEEE 01-01-2020 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects:	Acoustics Background noise bandpass filter Bandpass filters Frequency ranges Generative model Separation Signal processing Speech Speech recognition speech separation Speech sounds variational autoencoder Verbal learning
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Speech source separation is essential for speech-related applications because this process enhances the input speech signal for the main processing model. Most of the current approaches for this task focus on separating the speech of commonly high-frequency noises or a particular background sound. They cannot clear the signals which intersect with the human speech in its frequency range. To deal with this problem, we propose a hybrid approach combining a variational autoencoder (VAE) and a bandpass filter (BPF). This method can extract and enhance the speech signal in the mixture of many elements such as speech signal, the high-frequency noises, and many kinds of different background sounds which interfere with the speech sound. Experimental results showed that our model can extract effectively the speech signal with 15.02 dB in Signal to Interference Ratio (SIR) and 12.99 dB in Signal to Distortion Ratio (SDR). On the other hand, we can adjust the passband to identify the range of frequency at the output signal to apply for a particular application like gender classification.
ISSN:	2169-3536 2169-3536
DOI:	10.1109/ACCESS.2020.3019495