Search Results - "Mošner, Ladislav"
-
1
Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…When recognizing emotions from speech, we encounter two common problems: how to optimally capture emotion-relevant information from the speech signal and how…”
Get full text
Conference Proceeding -
2
Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)“…For real-world speech recognition applications, noise robustness is still a challenge. In this work, we adopt the teacher-student (T/S) learning technique…”
Get full text
Conference Proceeding -
3
13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE
Published in Computer speech & language (01-09-2020)“…•We present a “longitudinal study” of all important milestone techniques used in speaker recognition by evaluating on multiple NIST SREs.•We provide aa…”
Get full text
Journal Article -
4
Multisv: Dataset for Far-Field Multi-Channel Speaker Verification
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…Motivated by unconsolidated data situation and the lack of a standard benchmark in the field, we complement our previous efforts and present a comprehensive…”
Get full text
Conference Proceeding -
5
Dereverberation and Beamforming in Far-Field Speaker Recognition
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2018)“…This paper deals with far-field speaker recognition. On a corpus of NIST SRE 2010 data retransmitted in a real room with multiple microphones, we first…”
Get full text
Conference Proceeding -
6
Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…Recently, the pre-trained Transformer models have received a rising interest in the field of speech processing thanks to their great success in various…”
Get full text
Conference Proceeding -
7
Building and evaluation of a real room impulse response dataset
Published in IEEE journal of selected topics in signal processing (01-08-2019)“…This paper presents BUT ReverbDB-a dataset of real room impulse responses (RIR), background noises, and retransmitted speech data. The retransmitted data…”
Get full text
Journal Article -
8
State-of-the-art Embeddings with Video-free Segmentation of the Source VoxCeleb Data
Published 03-10-2024“…In this paper, we refine and validate our method for training speaker embedding extractors using weak annotations. More specifically, we use only the audio…”
Get full text
Journal Article -
9
CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification
Published 23-09-2024“…Self-supervised learning (SSL) models for speaker verification (SV) have gained significant attention in recent years. However, existing SSL-based SV systems…”
Get full text
Journal Article -
10
BUT CHiME-7 system description
Published 18-10-2023“…This paper describes the joint effort of Brno University of Technology (BUT), AGH University of Krakow and University of Buenos Aires on the development of…”
Get full text
Journal Article -
11
MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification
Published 11-11-2021“…Motivated by unconsolidated data situation and the lack of a standard benchmark in the field, we complement our previous efforts and present a comprehensive…”
Get full text
Journal Article -
12
Improving Speaker Verification with Self-Pretrained Transformer Models
Published 17-05-2023“…Recently, fine-tuning large pre-trained Transformer models using downstream datasets has received a rising interest. Despite their success, it is still…”
Get full text
Journal Article -
13
Parameter-efficient transfer learning of pre-trained Transformer models for speaker verification using adapters
Published 28-10-2022“…Recently, the pre-trained Transformer models have received a rising interest in the field of speech processing thanks to their great success in various…”
Get full text
Journal Article -
14
Analysis of impact of emotions on target speech extraction and speech separation
Published 15-08-2022“…Recently, the performance of blind speech separation (BSS) and target speech extraction (TSE) has greatly progressed. Most works, however, focus on relatively…”
Get full text
Journal Article -
15
Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries
Published 29-03-2022“…In this paper, we demonstrate a method for training speaker embedding extractors using weak annotation. More specifically, we are using the full VoxCeleb…”
Get full text
Journal Article -
16
Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings
Published 28-03-2022“…In speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring backends are commonly used, namely cosine scoring…”
Get full text
Journal Article -
17
Multi-Channel Speaker Verification with Conv-Tasnet Based Beamformer
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…We focus on the problem of speaker recognition in far-field multichannel data. The main contribution is introducing an alternative way of predicting spatial…”
Get full text
Conference Proceeding -
18
But System for the Second Dihard Speech Diarization Challenge
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…This paper describes the winning systems developed by the BUT team for the four tracks of the Second DIHARD Speech Diarization Challenge. For tracks 1 and 2…”
Get full text
Conference Proceeding -
19
BUT VOiCES 2019 System Description
Published 13-07-2019“…This is a description of our effort in VOiCES 2019 Speaker Recognition challenge. All systems in the fixed condition are based on the x-vector paradigm with…”
Get full text
Journal Article -
20
BUT System for the Second DIHARD Speech Diarization Challenge
Published 26-02-2020“…This paper describes the winning systems developed by the BUT team for the four tracks of the Second DIHARD Speech Diarization Challenge. For tracks 1 and 2…”
Get full text
Journal Article