Source Separation for Sound Event Detection in Domestic Environments using Jointly Trained Models

Sound Event Detection and Source Separation are closely related tasks: whereas the first aims to find the time boundaries of acoustic events inside a recording, the goal of the latter is to isolate each of the acoustic sources into different signals. This paper presents a Sound Event Detection syste...

Full description

Saved in:
Bibliographic Details
Published in:2022 International Workshop on Acoustic Signal Enhancement (IWAENC) pp. 1 - 5
Main Authors: de Benito-Gorron, Diego, Zmolikova, Katerina, Toledano, Doroteo T.
Format: Conference Proceeding
Language:English
Published: IEEE 05-09-2022
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Sound Event Detection and Source Separation are closely related tasks: whereas the first aims to find the time boundaries of acoustic events inside a recording, the goal of the latter is to isolate each of the acoustic sources into different signals. This paper presents a Sound Event Detection system formed by two independently pre-trained blocks for Source Separation and Sound Event Detection. We propose a joint-training scheme, where both blocks are trained at the same time, and a two-stage training, where each block trains while the other one is frozen. In addition, we compare the use of supervised and unsupervised pre-training for the Separation block, and two model selection strategies for Sound Event Detection. Our experiments show that the proposed methods are able to outperform the baseline systems of the DCASE 2021 Challenge Task 4.
DOI:10.1109/IWAENC53105.2022.9914755