Search Results - "Diez, Mireia"

Refine Results
  1. 1
  2. 2

    High-performance Query-by-Example Spoken Term Detection on the SWS 2013 evaluation by Rodriguez-Fuentes, Luis J., Varona, Amparo, Penagarikano, Mikel, Bordel, German, Diez, Mireia

    “…In the last years, the task of Query-by-Example Spoken Term Detection (QbE-STD), which aims to find occurrences of a spoken query in a set of audio documents,…”
    Get full text
    Conference Proceeding
  3. 3

    Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks by Landini, Federico, Profant, Ján, Diez, Mireia, Burget, Lukáš

    Published in Computer speech & language (01-01-2022)
    “…The recently proposed VBx diarization method uses a Bayesian hidden Markov model to find speaker clusters in a sequence of x-vectors. In this work we perform…”
    Get full text
    Journal Article
  4. 4

    DiaPer: End-to-End Neural Diarization With Perceiver-Based Attractors by Landini, Federico, Diez, Mireia, Stafylakis, Themos, Burget, Lukas

    “…Until recently, the field of speaker diarization was dominated by cascaded systems. Due to their limitations, mainly regarding overlapped speech and cumbersome…”
    Get full text
    Journal Article
  5. 5

    Analysis of Speaker Diarization Based on Bayesian HMM With Eigenvoice Priors by Diez, Mireia, Burget, Lukas, Landini, Federico, Cernocky, Jan

    “…In our previous work, we introduced our Bayesian Hidden Markov Model with eigenvoice priors, which has been recently recognized as the state-of-the-art model…”
    Get full text
    Journal Article
  6. 6

    Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization by Landini, Federico, Diez, Mireia, Lozano-Diez, Alicia, Burget, Lukas

    “…End-to-end diarization presents an attractive alternative to standard cascaded diarization systems because a single system can handle all aspects of the task…”
    Get full text
    Conference Proceeding
  7. 7

    End-to-End DNN Based Speaker Recognition Inspired by I-Vector and PLDA by Rohdin, Johan, Silnova, Anna, Diez, Mireia, Plchot, Oldrch, Matejka, Pavel, Burget, Lukas

    “…Recently, several end-to-end speaker verification systems based on deep neural networks (DNNs) have been proposed. These systems have been proven to be…”
    Get full text
    Conference Proceeding
  8. 8

    Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge by Diez, Mireia, Burget, Lukas, Landini, Federico, Wang, Shuai, Cernocky, Honza

    “…This paper presents an analysis of our diarization system winning the second DIHARD speech diarization challenge, track 1. This system is based on clustering…”
    Get full text
    Conference Proceeding
  9. 9

    Discriminative Training of VBx Diarization by Klement, Dominik, Diez, Mireia, Landini, Federico, Burget, Lukas, Silnova, Anna, Delcroix, Marc, Tawara, Naohiro

    “…Bayesian HMM clustering of x-vector sequences (VBx) has become a widely adopted diarization baseline model in publications and challenges. It uses an HMM to…”
    Get full text
    Conference Proceeding
  10. 10

    Diacorrect: Error Correction Back-End for Speaker Diarization by Han, Jiangyu, Landini, Federico, Rohdin, Johan, Diez, Mireia, Burget, Lukas, Cao, Yuhang, Lu, Heng, Cernocky, Jan

    “…In this work, we propose an error correction framework, named DiaCorrect, to refine the output of a diarization system in a simple yet effective way. This…”
    Get full text
    Conference Proceeding
  11. 11

    Analysis of the but Diarization System for Voxconverse Challenge by Landini, Federico, Glembek, Ondrej, Matejka, Pavel, Rohdin, Johan, Burget, Lukas, Diez, Mireia, Silnova, Anna

    “…This paper describes the system developed by the BUT team for the fourth track of the VoxCeleb Speaker Recognition Challenge, focusing on diarization on the…”
    Get full text
    Conference Proceeding
  12. 12

    End-to-end DNN based text-independent speaker recognition for long and short utterances by Rohdin, Johan, Silnova, Anna, Diez, Mireia, Plchot, Oldřich, Matějka, Pavel, Burget, Lukáš, Glembek, Ondřej

    Published in Computer speech & language (01-01-2020)
    “…Recently several end-to-end speaker verification systems based on deep neural networks (DNNs) have been proposed. These systems have been proven to be…”
    Get full text
    Journal Article
  13. 13

    But System for the Second Dihard Speech Diarization Challenge by Landini, Federico, Wang, Shuai, Diez, Mireia, Burget, Lukas, Matejka, Pavel, Zmolikova, Katerina, Mosner, Ladislav, Silnova, Anna, Plchot, Oldrich, Novotny, Ondrej, Zeinali, Hossein, Rohdin, Johan

    “…This paper describes the winning systems developed by the BUT team for the four tracks of the Second DIHARD Speech Diarization Challenge. For tracks 1 and 2…”
    Get full text
    Conference Proceeding
  14. 14

    13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE by Matějka, Pavel, Plchot, Oldřich, Glembek, Ondřej, Burget, Lukáš, Rohdin, Johan, Zeinali, Hossein, Mošner, Ladislav, Silnova, Anna, Novotný, Ondřej, Diez, Mireia, “Honza” Černocký, Jan

    Published in Computer speech & language (01-09-2020)
    “…•We present a “longitudinal study” of all important milestone techniques used in speaker recognition by evaluating on multiple NIST SREs.•We provide aa…”
    Get full text
    Journal Article
  15. 15

    On the Projection of PLLRs for Unbounded Feature Distributions in Spoken Language Recognition by Diez, Mireia, Varona, Amparo, Penagarikano, Mikel, Rodriguez-Fuentes, Luis Javier, Bordel, German

    Published in IEEE signal processing letters (01-09-2014)
    “…The so called Phone Log-Likelihood Ratio (PLLR) features have been recently introduced as a novel and effective way of retrieving acoustic-phonetic information…”
    Get full text
    Journal Article
  16. 16

    On the Complementarity of Phone Posterior Probabilities for Improved Speaker Recognition by Diez, Mireia, Varona, Amparo, Penagarikano, Mikel, Rodriguez-Fuentes, Luis Javier, Bordel, German

    Published in IEEE signal processing letters (01-06-2014)
    “…In this letter, we apply Phone Log-Likelihood Ratio (PLLR) features to the task of speaker recognition. PLLRs, which are computed on the phone posterior…”
    Get full text
    Journal Article
  17. 17

    KALAKA-3: a database for the assessment of spoken language recognition technology on YouTube audios by Rodríguez-Fuentes, Luis Javier, Penagarikano, Mikel, Varona, Amparo, Diez, Mireia, Bordel, Germán

    Published in Language Resources and Evaluation (01-06-2016)
    “…KALAKA-3 is a speech database specifically designed for the development and evaluation of Spoken Language Recognition (SLR) systems. The database provides TV…”
    Get full text
    Journal Article
  18. 18

    DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors by Landini, Federico, Diez, Mireia, Stafylakis, Themos, Burget, Lukáš

    Published 07-12-2023
    “…Until recently, the field of speaker diarization was dominated by cascaded systems. Due to their limitations, mainly regarding overlapped speech and cumbersome…”
    Get full text
    Journal Article
  19. 19

    Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization by Pálka, Petr, Landini, Federico, Klement, Dominik, Diez, Mireia, Silnova, Anna, Delcroix, Marc, Burget, Lukáš

    Published 04-11-2024
    “…In spite of the popularity of end-to-end diarization systems nowadays, modular systems comprised of voice activity detection (VAD), speaker embedding…”
    Get full text
    Journal Article
  20. 20

    Leveraging Self-Supervised Learning for Speaker Diarization by Han, Jiangyu, Landini, Federico, Rohdin, Johan, Silnova, Anna, Diez, Mireia, Burget, Lukas

    Published 14-09-2024
    “…End-to-end neural diarization has evolved considerably over the past few years, but data scarcity is still a major obstacle for further improvements…”
    Get full text
    Journal Article