Search Results - "Zmolikova, Katerina"
-
1
SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures
Published in IEEE journal of selected topics in signal processing (01-08-2019)“…The processing of speech corrupted by interfering overlapping speakers is one of the challenging problems with regards to today's automatic speech recognition…”
Get full text
Journal Article -
2
Analysis and interpretation of joint source separation and sound event detection in domestic environments
Published in PloS one (05-07-2024)“…In recent years, the relation between Sound Event Detection (SED) and Source Separation (SSep) has received a growing interest, in particular, with the aim to…”
Get full text
Journal Article -
3
Masked Spectrogram Prediction for Unsupervised Domain Adaptation in Speech Enhancement
Published in IEEE open journal of signal processing (2024)“…Supervised learning-based speech enhancement methods often work remarkably well in acoustic situations represented in the training corpus but generalize poorly…”
Get full text
Journal Article -
4
Single Channel Target Speaker Extraction and Recognition with Speaker Beam
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2018)“…This paper addresses the problem of single channel speech recognition of a target speaker in a mixture of speech signals. We propose to exploit auxiliary…”
Get full text
Conference Proceeding -
5
Improving Speaker Discrimination of Target Speech Extraction With Time-Domain Speakerbeam
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…Target speech extraction, which extracts a single target source in a mixture given clues about the target speaker, has attracted increasing attention. We have…”
Get full text
Conference Proceeding -
6
Speaker Activity Driven Neural Speech Extraction
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06-06-2021)“…Target speech extraction, which extracts the speech of a target speaker in a mixture given auxiliary speaker clues, has recently received increased interest…”
Get full text
Conference Proceeding -
7
Compact Network for Speakerbeam Target Speaker Extraction
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)“…Speech separation that separates a mixture of speech signals into each of its sources has been an active research topic for a long time and has seen recent…”
Get full text
Conference Proceeding -
8
Sequence summarizing neural network for speaker adaptation
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2016)“…In this paper, we propose a DNN adaptation technique, where the i-vector extractor is replaced by a Sequence Summarizing Neural Network (SSNN). Similarly to…”
Get full text
Conference Proceeding Journal Article -
9
Jointly Trained Transformers Models for Spoken Language Translation
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06-06-2021)“…End-to-End and cascade (ASR-MT) spoken language translation (SLT) systems are reaching comparable performances, however, a large degradation is observed when…”
Get full text
Conference Proceeding -
10
But System for the Second Dihard Speech Diarization Challenge
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…This paper describes the winning systems developed by the BUT team for the four tracks of the Second DIHARD Speech Diarization Challenge. For tracks 1 and 2…”
Get full text
Conference Proceeding -
11
Optimization of Speaker-Aware Multichannel Speech Extraction with ASR Criterion
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2018)“…This paper addresses the problem of recognizing speech corrupted by overlapping speakers in a multichannel setting. To extract a target speaker from the…”
Get full text
Conference Proceeding -
12
Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens
Published 04-10-2024“…Cascaded speech-to-speech translation systems often suffer from the error accumulation problem and high latency, which is a result of cascaded modules whose…”
Get full text
Journal Article -
13
Neural Target Speech Extraction: An Overview
Published 31-01-2023“…Humans can listen to a target speaker even in challenging acoustic conditions that have noise, reverberation, and interfering speakers. This phenomenon is…”
Get full text
Journal Article -
14
Source Separation for Sound Event Detection in Domestic Environments using Jointly Trained Models
Published in 2022 International Workshop on Acoustic Signal Enhancement (IWAENC) (05-09-2022)“…Sound Event Detection and Source Separation are closely related tasks: whereas the first aims to find the time boundaries of acoustic events inside a…”
Get full text
Conference Proceeding -
15
Speaker activity driven neural speech extraction
Published 14-01-2021“…Target speech extraction, which extracts the speech of a target speaker in a mixture given auxiliary speaker clues, has recently received increased interest…”
Get full text
Journal Article -
16
Listen only to me! How well can target speech extraction handle false alarms?
Published 10-04-2022“…Target speech extraction (TSE) extracts the speech of a target speaker in a mixture given auxiliary clues characterizing the speaker, such as an enrollment…”
Get full text
Journal Article -
17
Analysis of Impact of Emotions on Target Speech Extraction and Speech Separation
Published in 2022 International Workshop on Acoustic Signal Enhancement (IWAENC) (05-09-2022)“…Recently, the performance of blind speech separation (BSS) and target speech extraction (TSE) has greatly progressed. Most works, however, focus on relatively…”
Get full text
Conference Proceeding -
18
Integration of Variational Autoencoder and Spatial Clustering for Adaptive Multi-Channel Neural Speech Separation
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19-01-2021)“…In this paper, we propose a method combining variational autoencoder model of speech with a spatial clustering approach for multi-channel speech separation…”
Get full text
Conference Proceeding -
19
Integration of variational autoencoder and spatial clustering for adaptive multi-channel neural speech separation
Published 24-11-2020“…In this paper, we propose a method combining variational autoencoder model of speech with a spatial clustering approach for multi-channel speech separation…”
Get full text
Journal Article -
20
Learning speaker representation for neural network based multichannel speaker extraction
Published in 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01-12-2017)“…Recently, schemes employing deep neural networks (DNNs) for extracting speech from noisy observation have demonstrated great potential for noise robust…”
Get full text
Conference Proceeding