Search Results - "Silnova, Anna"
-
1
Advancing speaker embedding learning: Wespeaker toolkit for research and production
Published in Speech communication (01-07-2024)“…Speaker modeling plays a crucial role in various tasks, and fixed-dimensional vector representations, known as speaker embeddings, are the predominant modeling…”
Get full text
Journal Article -
2
Speaker Verification Using End-to-end Adversarial Language Adaptation
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)“…In this paper we investigate the use of adversarial domain adaptation for addressing the problem of language mismatch between speaker recognition corpora. In…”
Get full text
Conference Proceeding -
3
End-to-End DNN Based Speaker Recognition Inspired by I-Vector and PLDA
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2018)“…Recently, several end-to-end speaker verification systems based on deep neural networks (DNNs) have been proposed. These systems have been proven to be…”
Get full text
Conference Proceeding -
4
Discriminative Training of VBx Diarization
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)“…Bayesian HMM clustering of x-vector sequences (VBx) has become a widely adopted diarization baseline model in publications and challenges. It uses an HMM to…”
Get full text
Conference Proceeding -
5
Analysis of the but Diarization System for Voxconverse Challenge
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06-06-2021)“…This paper describes the system developed by the BUT team for the fourth track of the VoxCeleb Speaker Recognition Challenge, focusing on diarization on the…”
Get full text
Conference Proceeding -
6
Toroidal Probabilistic Spherical Discriminant Analysis
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…In speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring back-ends are commonly used, namely cosine scoring…”
Get full text
Conference Proceeding -
7
End-to-end DNN based text-independent speaker recognition for long and short utterances
Published in Computer speech & language (01-01-2020)“…Recently several end-to-end speaker verification systems based on deep neural networks (DNNs) have been proposed. These systems have been proven to be…”
Get full text
Journal Article -
8
But System for the Second Dihard Speech Diarization Challenge
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…This paper describes the winning systems developed by the BUT team for the four tracks of the Second DIHARD Speech Diarization Challenge. For tracks 1 and 2…”
Get full text
Conference Proceeding -
9
13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE
Published in Computer speech & language (01-09-2020)“…•We present a “longitudinal study” of all important milestone techniques used in speaker recognition by evaluating on multiple NIST SREs.•We provide aa…”
Get full text
Journal Article -
10
Challenging margin-based speaker embedding extractors by using the variational information bottleneck
Published 18-06-2024“…Speaker embedding extractors are typically trained using a classification loss over the training speakers. During the last few years, the standard…”
Get full text
Journal Article -
11
Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Published 04-11-2024“…In spite of the popularity of end-to-end diarization systems nowadays, modular systems comprised of voice activity detection (VAD), speaker embedding…”
Get full text
Journal Article -
12
Leveraging Self-Supervised Learning for Speaker Diarization
Published 14-09-2024“…End-to-end neural diarization has evolved considerably over the past few years, but data scarcity is still a major obstacle for further improvements…”
Get full text
Journal Article -
13
Toroidal Probabilistic Spherical Discriminant Analysis
Published 27-10-2022“…In speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring back-ends are commonly used, namely cosine scoring…”
Get full text
Journal Article -
14
Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?
Published 29-02-2024“…In this paper, we apply the variational information bottleneck approach to end-to-end neural diarization with encoder-decoder attractors (EEND-EDA). This…”
Get full text
Journal Article -
15
Discriminative Training of VBx Diarization
Published 04-10-2023“…Bayesian HMM clustering of x-vector sequences (VBx) has become a widely adopted diarization baseline model in publications and challenges. It uses an HMM to…”
Get full text
Journal Article -
16
BUT Systems and Analyses for the ASVspoof 5 Challenge
Published 20-08-2024“…This paper describes the BUT submitted systems for the ASVspoof 5 challenge, along with analyses. For the conventional deepfake detection task, we use ResNet18…”
Get full text
Journal Article -
17
Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
Published 22-05-2023“…Combining end-to-end neural speaker diarization (EEND) with vector clustering (VC), known as EEND-VC, has gained interest for leveraging the strengths of both…”
Get full text
Journal Article -
18
Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries
Published 29-03-2022“…In this paper, we demonstrate a method for training speaker embedding extractors using weak annotation. More specifically, we are using the full VoxCeleb…”
Get full text
Journal Article -
19
Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings
Published 28-03-2022“…In speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring backends are commonly used, namely cosine scoring…”
Get full text
Journal Article -
20
Probabilistic embeddings for speaker diarization
Published 06-04-2020“…Speaker embeddings (x-vectors) extracted from very short segments of speech have recently been shown to give competitive performance in speaker diarization. We…”
Get full text
Journal Article