Search Results - "Mošner, Ladislav"

1
Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing by Kakouros, Sofoklis, Stafylakis, Themos, Mosner, Ladislav, Burget, Lukas

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…When recognizing emotions from speech, we encounter two common problems: how to optimally capture emotion-relevant information from the speech signal and how…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
2
Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning by Mosner, Ladislav, Wu, Minhua, Raju, Anirudh, Krishnan Parthasarathi, Sree Hari, Kumatani, Kenichi, Sundaram, Shiva, Maas, Roland, Hoffmeister, Bjorn

Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)
“…For real-world speech recognition applications, noise robustness is still a challenge. In this work, we adopt the teacher-student (T/S) learning technique…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
3
13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE by Matějka, Pavel, Plchot, Oldřich, Glembek, Ondřej, Burget, Lukáš, Rohdin, Johan, Zeinali, Hossein, Mošner, Ladislav, Silnova, Anna, Novotný, Ondřej, Diez, Mireia, “Honza” Černocký, Jan

Published in Computer speech & language (01-09-2020)
“…•We present a “longitudinal study” of all important milestone techniques used in speaker recognition by evaluating on multiple NIST SREs.•We provide aa…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
4
Multisv: Dataset for Far-Field Multi-Channel Speaker Verification by Mosner, Ladislav, Plchot, Oldrich, Burget, Lukas, Cernocky, Jan Honza

Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)
“…Motivated by unconsolidated data situation and the lack of a standard benchmark in the field, we complement our previous efforts and present a comprehensive…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
5
Dereverberation and Beamforming in Far-Field Speaker Recognition by Mosner, Ladislav, Matejka, Pavel, Novotny, Ondrej, Cernocky, Jan Honza

Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2018)
“…This paper deals with far-field speaker recognition. On a corpus of NIST SRE 2010 data retransmitted in a real room with multiple microphones, we first…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
6
Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters by Peng, Junyi, Stafylakis, Themos, Gu, Rongzhi, Plchot, Oldrich, Mosner, Ladislav, Burget, Lukas, Cernocky, Jan

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…Recently, the pre-trained Transformer models have received a rising interest in the field of speech processing thanks to their great success in various…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
7
Building and evaluation of a real room impulse response dataset by Szoke, Igor, Skacel, Miroslav, Mosner, Ladislav, Paliesek, Jakub, Cernocky, Jan

Published in IEEE journal of selected topics in signal processing (01-08-2019)
“…This paper presents BUT ReverbDB-a dataset of real room impulse responses (RIR), background noises, and retransmitted speech data. The retransmitted data…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
8
State-of-the-art Embeddings with Video-free Segmentation of the Source VoxCeleb Data by Barahona, Sara, Mošner, Ladislav, Stafylakis, Themos, Plchot, Oldřich, Peng, Junyi, Burget, Lukáš, Černocký, Jan

Published 03-10-2024
“…In this paper, we refine and validate our method for training speaker embedding extractors using weak annotations. More specifically, we use only the audio…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
9
CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification by Peng, Junyi, Mošner, Ladislav, Zhang, Lin, Plchot, Oldřich, Stafylakis, Themos, Burget, Lukáš, Černocký, Jan

Published 23-09-2024
“…Self-supervised learning (SSL) models for speaker verification (SV) have gained significant attention in recent years. However, existing SSL-based SV systems…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
10
BUT CHiME-7 system description by Karafiát, Martin, Veselý, Karel, Szöke, Igor, Mošner, Ladislav, Beneš, Karel, Witkowski, Marcin, Barchi, Germán, Pepino, Leonardo

Published 18-10-2023
“…This paper describes the joint effort of Brno University of Technology (BUT), AGH University of Krakow and University of Buenos Aires on the development of…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
11
MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification by Mošner, Ladislav, Plchot, Oldřich, Burget, Lukáš, Černocký, Jan

Published 11-11-2021
“…Motivated by unconsolidated data situation and the lack of a standard benchmark in the field, we complement our previous efforts and present a comprehensive…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
12
Improving Speaker Verification with Self-Pretrained Transformer Models by Peng, Junyi, Plchot, Oldřich, Stafylakis, Themos, Mošner, Ladislav, Burget, Lukáš, Černocký, Jan

Published 17-05-2023
“…Recently, fine-tuning large pre-trained Transformer models using downstream datasets has received a rising interest. Despite their success, it is still…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
13
Parameter-efficient transfer learning of pre-trained Transformer models for speaker verification using adapters by Peng, Junyi, Stafylakis, Themos, Gu, Rongzhi, Plchot, Oldřich, Mošner, Ladislav, Burget, Lukáš, Černocký, Jan

Published 28-10-2022
“…Recently, the pre-trained Transformer models have received a rising interest in the field of speech processing thanks to their great success in various…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
14
Analysis of impact of emotions on target speech extraction and speech separation by Švec, Ján, Žmolíková, Kateřina, Kocour, Martin, Delcroix, Marc, Ochiai, Tsubasa, Mošner, Ladislav, Černocký, Jan

Published 15-08-2022
“…Recently, the performance of blind speech separation (BSS) and target speech extraction (TSE) has greatly progressed. Most works, however, focus on relatively…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
15
Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries by Stafylakis, Themos, Mošner, Ladislav, Plchot, Oldřich, Rohdin, Johan, Silnova, Anna, Burget, Lukáš, Černocký, Jan "Honza''

Published 29-03-2022
“…In this paper, we demonstrate a method for training speaker embedding extractors using weak annotation. More specifically, we are using the full VoxCeleb…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
16
Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings by Brümmer, Niko, Swart, Albert, Mošner, Ladislav, Silnova, Anna, Plchot, Oldřich, Stafylakis, Themos, Burget, Lukáš

Published 28-03-2022
“…In speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring backends are commonly used, namely cosine scoring…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
17
Multi-Channel Speaker Verification with Conv-Tasnet Based Beamformer by Mosner, Ladislav, Plchot, Oldrich, Burget, Lukas, Cernocky, Jan Honza

Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)
“…We focus on the problem of speaker recognition in far-field multichannel data. The main contribution is introducing an alternative way of predicting spatial…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
18
But System for the Second Dihard Speech Diarization Challenge by Landini, Federico, Wang, Shuai, Diez, Mireia, Burget, Lukas, Matejka, Pavel, Zmolikova, Katerina, Mosner, Ladislav, Silnova, Anna, Plchot, Oldrich, Novotny, Ondrej, Zeinali, Hossein, Rohdin, Johan

Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)
“…This paper describes the winning systems developed by the BUT team for the four tracks of the Second DIHARD Speech Diarization Challenge. For tracks 1 and 2…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
19
BUT VOiCES 2019 System Description by Zeinali, Hossein, Matějka, Pavel, Mošner, Ladislav, Plchot, Oldřich, Silnova, Anna, Novotný, Ondřej, Profant, Ján, Glembek, Ondřej, Burget, Lukáš

Published 13-07-2019
“…This is a description of our effort in VOiCES 2019 Speaker Recognition challenge. All systems in the fixed condition are based on the x-vector paradigm with…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
20
BUT System for the Second DIHARD Speech Diarization Challenge by Landini, Federico, Wang, Shuai, Diez, Mireia, Burget, Lukáš, Matějka, Pavel, Žmolíková, Kateřina, Mošner, Ladislav, Silnova, Anna, Plchot, Oldřich, Novotný, Ondřej, Zeinali, Hossein, Rohdin, Johan

Published 26-02-2020
“…This paper describes the winning systems developed by the BUT team for the four tracks of the Second DIHARD Speech Diarization Challenge. For tracks 1 and 2…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Mošner, Ladislav"

Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing by Kakouros, Sofoklis, Stafylakis, Themos, Mosner, Ladislav, Burget, Lukas

Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning by Mosner, Ladislav, Wu, Minhua, Raju, Anirudh, Krishnan Parthasarathi, Sree Hari, Kumatani, Kenichi, Sundaram, Shiva, Maas, Roland, Hoffmeister, Bjorn

13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE by Matějka, Pavel, Plchot, Oldřich, Glembek, Ondřej, Burget, Lukáš, Rohdin, Johan, Zeinali, Hossein, Mošner, Ladislav, Silnova, Anna, Novotný, Ondřej, Diez, Mireia, “Honza” Černocký, Jan

Multisv: Dataset for Far-Field Multi-Channel Speaker Verification by Mosner, Ladislav, Plchot, Oldrich, Burget, Lukas, Cernocky, Jan Honza

Dereverberation and Beamforming in Far-Field Speaker Recognition by Mosner, Ladislav, Matejka, Pavel, Novotny, Ondrej, Cernocky, Jan Honza

Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters by Peng, Junyi, Stafylakis, Themos, Gu, Rongzhi, Plchot, Oldrich, Mosner, Ladislav, Burget, Lukas, Cernocky, Jan

Building and evaluation of a real room impulse response dataset by Szoke, Igor, Skacel, Miroslav, Mosner, Ladislav, Paliesek, Jakub, Cernocky, Jan

State-of-the-art Embeddings with Video-free Segmentation of the Source VoxCeleb Data by Barahona, Sara, Mošner, Ladislav, Stafylakis, Themos, Plchot, Oldřich, Peng, Junyi, Burget, Lukáš, Černocký, Jan

CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification by Peng, Junyi, Mošner, Ladislav, Zhang, Lin, Plchot, Oldřich, Stafylakis, Themos, Burget, Lukáš, Černocký, Jan

BUT CHiME-7 system description by Karafiát, Martin, Veselý, Karel, Szöke, Igor, Mošner, Ladislav, Beneš, Karel, Witkowski, Marcin, Barchi, Germán, Pepino, Leonardo

MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification by Mošner, Ladislav, Plchot, Oldřich, Burget, Lukáš, Černocký, Jan

Improving Speaker Verification with Self-Pretrained Transformer Models by Peng, Junyi, Plchot, Oldřich, Stafylakis, Themos, Mošner, Ladislav, Burget, Lukáš, Černocký, Jan

Parameter-efficient transfer learning of pre-trained Transformer models for speaker verification using adapters by Peng, Junyi, Stafylakis, Themos, Gu, Rongzhi, Plchot, Oldřich, Mošner, Ladislav, Burget, Lukáš, Černocký, Jan

Analysis of impact of emotions on target speech extraction and speech separation by Švec, Ján, Žmolíková, Kateřina, Kocour, Martin, Delcroix, Marc, Ochiai, Tsubasa, Mošner, Ladislav, Černocký, Jan

Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries by Stafylakis, Themos, Mošner, Ladislav, Plchot, Oldřich, Rohdin, Johan, Silnova, Anna, Burget, Lukáš, Černocký, Jan "Honza''

Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings by Brümmer, Niko, Swart, Albert, Mošner, Ladislav, Silnova, Anna, Plchot, Oldřich, Stafylakis, Themos, Burget, Lukáš

Multi-Channel Speaker Verification with Conv-Tasnet Based Beamformer by Mosner, Ladislav, Plchot, Oldrich, Burget, Lukas, Cernocky, Jan Honza

But System for the Second Dihard Speech Diarization Challenge by Landini, Federico, Wang, Shuai, Diez, Mireia, Burget, Lukas, Matejka, Pavel, Zmolikova, Katerina, Mosner, Ladislav, Silnova, Anna, Plchot, Oldrich, Novotny, Ondrej, Zeinali, Hossein, Rohdin, Johan

BUT VOiCES 2019 System Description by Zeinali, Hossein, Matějka, Pavel, Mošner, Ladislav, Plchot, Oldřich, Silnova, Anna, Novotný, Ondřej, Profant, Ján, Glembek, Ondřej, Burget, Lukáš

BUT System for the Second DIHARD Speech Diarization Challenge by Landini, Federico, Wang, Shuai, Diez, Mireia, Burget, Lukáš, Matějka, Pavel, Žmolíková, Kateřina, Mošner, Ladislav, Silnova, Anna, Plchot, Oldřich, Novotný, Ondřej, Zeinali, Hossein, Rohdin, Johan

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication