Search Results - "Mosner, Ladislav"
-
1
Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…When recognizing emotions from speech, we encounter two common problems: how to optimally capture emotion-relevant information from the speech signal and how…”
Get full text
Conference Proceeding -
2
Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)“…For real-world speech recognition applications, noise robustness is still a challenge. In this work, we adopt the teacher-student (T/S) learning technique…”
Get full text
Conference Proceeding -
3
13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE
Published in Computer speech & language (01-09-2020)“…•We present a “longitudinal study” of all important milestone techniques used in speaker recognition by evaluating on multiple NIST SREs.•We provide aa…”
Get full text
Journal Article -
4
Multisv: Dataset for Far-Field Multi-Channel Speaker Verification
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…Motivated by unconsolidated data situation and the lack of a standard benchmark in the field, we complement our previous efforts and present a comprehensive…”
Get full text
Conference Proceeding -
5
Dereverberation and Beamforming in Far-Field Speaker Recognition
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2018)“…This paper deals with far-field speaker recognition. On a corpus of NIST SRE 2010 data retransmitted in a real room with multiple microphones, we first…”
Get full text
Conference Proceeding -
6
Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…Recently, the pre-trained Transformer models have received a rising interest in the field of speech processing thanks to their great success in various…”
Get full text
Conference Proceeding -
7
Building and evaluation of a real room impulse response dataset
Published in IEEE journal of selected topics in signal processing (01-08-2019)“…This paper presents BUT ReverbDB-a dataset of real room impulse responses (RIR), background noises, and retransmitted speech data. The retransmitted data…”
Get full text
Journal Article -
8
Multi-Channel Speaker Verification with Conv-Tasnet Based Beamformer
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…We focus on the problem of speaker recognition in far-field multichannel data. The main contribution is introducing an alternative way of predicting spatial…”
Get full text
Conference Proceeding -
9
But System for the Second Dihard Speech Diarization Challenge
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…This paper describes the winning systems developed by the BUT team for the four tracks of the Second DIHARD Speech Diarization Challenge. For tracks 1 and 2…”
Get full text
Conference Proceeding -
10
Speech-based emotion recognition with self-supervised models using attentive channel-wise correlations and label smoothing
Published 03-11-2022“…When recognizing emotions from speech, we encounter two common problems: how to optimally capture emotion-relevant information from the speech signal and how…”
Get full text
Journal Article -
11
Extracting Speaker and Emotion Information from Self-Supervised Speech Models via Channel-Wise Correlations
Published in 2022 IEEE Spoken Language Technology Workshop (SLT) (09-01-2023)“…Self-supervised learning of speech representations from large amounts of unlabeled data has enabled state-of-the-art results in several speech processing…”
Get full text
Conference Proceeding -
12
An Attention-Based Backend Allowing Efficient Fine-Tuning of Transformer Models for Speaker Verification
Published in 2022 IEEE Spoken Language Technology Workshop (SLT) (09-01-2023)“…In recent years, self-supervised learning paradigm has received extensive attention due to its great success in various down-stream tasks. However, the…”
Get full text
Conference Proceeding -
13
Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations
Published 15-10-2022“…Self-supervised learning of speech representations from large amounts of unlabeled data has enabled state-of-the-art results in several speech processing…”
Get full text
Journal Article -
14
An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification
Published 03-10-2022“…In recent years, self-supervised learning paradigm has received extensive attention due to its great success in various down-stream tasks. However, the…”
Get full text
Journal Article -
15
Speaker Verification with Application-Aware Beamforming
Published in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01-12-2019)“…Multichannel speech processing applications usually employ beamformers as means of speech enhancement through spatial filtering. Beamformers with learnable…”
Get full text
Conference Proceeding -
16
Analysis of Impact of Emotions on Target Speech Extraction and Speech Separation
Published in 2022 International Workshop on Acoustic Signal Enhancement (IWAENC) (05-09-2022)“…Recently, the performance of blind speech separation (BSS) and target speech extraction (TSE) has greatly progressed. Most works, however, focus on relatively…”
Get full text
Conference Proceeding -
17
Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch
Published 19-03-2022“…In this paper, we analyze the behavior and performance of speaker embeddings and the back-end scoring model under domain and language mismatch. We present our…”
Get full text
Journal Article -
18
Building and Evaluation of a Real Room Impulse Response Dataset
Published 30-05-2019“…This paper presents BUT ReverbDB - a dataset of real room impulse responses (RIR), background noises and re-transmitted speech data. The retransmitted data…”
Get full text
Journal Article -
19
State-of-the-art Embeddings with Video-free Segmentation of the Source VoxCeleb Data
Published 03-10-2024“…In this paper, we refine and validate our method for training speaker embedding extractors using weak annotations. More specifically, we use only the audio…”
Get full text
Journal Article -
20
CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification
Published 23-09-2024“…Self-supervised learning (SSL) models for speaker verification (SV) have gained significant attention in recent years. However, existing SSL-based SV systems…”
Get full text
Journal Article