Search Results - "Mosner, Ladislav"

1
Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing by Kakouros, Sofoklis, Stafylakis, Themos, Mosner, Ladislav, Burget, Lukas

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…When recognizing emotions from speech, we encounter two common problems: how to optimally capture emotion-relevant information from the speech signal and how…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
2
Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning by Mosner, Ladislav, Wu, Minhua, Raju, Anirudh, Krishnan Parthasarathi, Sree Hari, Kumatani, Kenichi, Sundaram, Shiva, Maas, Roland, Hoffmeister, Bjorn

Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)
“…For real-world speech recognition applications, noise robustness is still a challenge. In this work, we adopt the teacher-student (T/S) learning technique…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
3
13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE by Matějka, Pavel, Plchot, Oldřich, Glembek, Ondřej, Burget, Lukáš, Rohdin, Johan, Zeinali, Hossein, Mošner, Ladislav, Silnova, Anna, Novotný, Ondřej, Diez, Mireia, “Honza” Černocký, Jan

Published in Computer speech & language (01-09-2020)
“…•We present a “longitudinal study” of all important milestone techniques used in speaker recognition by evaluating on multiple NIST SREs.•We provide aa…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
4
Multisv: Dataset for Far-Field Multi-Channel Speaker Verification by Mosner, Ladislav, Plchot, Oldrich, Burget, Lukas, Cernocky, Jan Honza

Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)
“…Motivated by unconsolidated data situation and the lack of a standard benchmark in the field, we complement our previous efforts and present a comprehensive…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
5
Dereverberation and Beamforming in Far-Field Speaker Recognition by Mosner, Ladislav, Matejka, Pavel, Novotny, Ondrej, Cernocky, Jan Honza

Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2018)
“…This paper deals with far-field speaker recognition. On a corpus of NIST SRE 2010 data retransmitted in a real room with multiple microphones, we first…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
6
Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters by Peng, Junyi, Stafylakis, Themos, Gu, Rongzhi, Plchot, Oldrich, Mosner, Ladislav, Burget, Lukas, Cernocky, Jan

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…Recently, the pre-trained Transformer models have received a rising interest in the field of speech processing thanks to their great success in various…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
7
Building and evaluation of a real room impulse response dataset by Szoke, Igor, Skacel, Miroslav, Mosner, Ladislav, Paliesek, Jakub, Cernocky, Jan

Published in IEEE journal of selected topics in signal processing (01-08-2019)
“…This paper presents BUT ReverbDB-a dataset of real room impulse responses (RIR), background noises, and retransmitted speech data. The retransmitted data…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
8
Multi-Channel Speaker Verification with Conv-Tasnet Based Beamformer by Mosner, Ladislav, Plchot, Oldrich, Burget, Lukas, Cernocky, Jan Honza

Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)
“…We focus on the problem of speaker recognition in far-field multichannel data. The main contribution is introducing an alternative way of predicting spatial…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
9
But System for the Second Dihard Speech Diarization Challenge by Landini, Federico, Wang, Shuai, Diez, Mireia, Burget, Lukas, Matejka, Pavel, Zmolikova, Katerina, Mosner, Ladislav, Silnova, Anna, Plchot, Oldrich, Novotny, Ondrej, Zeinali, Hossein, Rohdin, Johan

Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)
“…This paper describes the winning systems developed by the BUT team for the four tracks of the Second DIHARD Speech Diarization Challenge. For tracks 1 and 2…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
10
Speech-based emotion recognition with self-supervised models using attentive channel-wise correlations and label smoothing by Kakouros, Sofoklis, Stafylakis, Themos, Mosner, Ladislav, Burget, Lukas

Published 03-11-2022
“…When recognizing emotions from speech, we encounter two common problems: how to optimally capture emotion-relevant information from the speech signal and how…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
11
Extracting Speaker and Emotion Information from Self-Supervised Speech Models via Channel-Wise Correlations by Stafylakis, Themos, Mosner, Ladislav, Kakouros, Sofoklis, Plchot, Oldrich, Burget, Lukas, Cernocky, Jan

Published in 2022 IEEE Spoken Language Technology Workshop (SLT) (09-01-2023)
“…Self-supervised learning of speech representations from large amounts of unlabeled data has enabled state-of-the-art results in several speech processing…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
12
An Attention-Based Backend Allowing Efficient Fine-Tuning of Transformer Models for Speaker Verification by Peng, Junyi, Plchot, Oldrich, Stafylakis, Themos, Mosner, Ladislav, Burget, Lukas, Cernocky, Jan

Published in 2022 IEEE Spoken Language Technology Workshop (SLT) (09-01-2023)
“…In recent years, self-supervised learning paradigm has received extensive attention due to its great success in various down-stream tasks. However, the…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
13
Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations by Stafylakis, Themos, Mosner, Ladislav, Kakouros, Sofoklis, Plchot, Oldrich, Burget, Lukas, Cernocky, Jan

Published 15-10-2022
“…Self-supervised learning of speech representations from large amounts of unlabeled data has enabled state-of-the-art results in several speech processing…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
14
An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification by Peng, Junyi, Plchot, Oldrich, Stafylakis, Themos, Mosner, Ladislav, Burget, Lukas, Cernocky, Jan

Published 03-10-2022
“…In recent years, self-supervised learning paradigm has received extensive attention due to its great success in various down-stream tasks. However, the…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
15
Speaker Verification with Application-Aware Beamforming by Mosner, Ladislav, Plchot, Oldrich, Rohdin, Johan, Burget, Lukas, Cernocky, Jan

Published in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01-12-2019)
“…Multichannel speech processing applications usually employ beamformers as means of speech enhancement through spatial filtering. Beamformers with learnable…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
16
Analysis of Impact of Emotions on Target Speech Extraction and Speech Separation by Svec, Jan, Zmolikova, Katerina, Kocour, Martin, Delcroix, Marc, Ochiai, Tsubasa, Mosner, Ladislav, Cernocky, Jan Honza

Published in 2022 International Workshop on Acoustic Signal Enhancement (IWAENC) (05-09-2022)
“…Recently, the performance of blind speech separation (BSS) and target speech extraction (TSE) has greatly progressed. Most works, however, focus on relatively…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
17
Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch by Silnova, Anna, Stafylakis, Themos, Mosner, Ladislav, Plchot, Oldrich, Rohdin, Johan, Matejka, Pavel, Burget, Lukas, Glembek, Ondrej, Brummer, Niko

Published 19-03-2022
“…In this paper, we analyze the behavior and performance of speaker embeddings and the back-end scoring model under domain and language mismatch. We present our…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
18
Building and Evaluation of a Real Room Impulse Response Dataset by Szoke, Igor, Skacel, Miroslav, Mosner, Ladislav, Paliesek, Jakub, Cernocky, Jan "Honza"

Published 30-05-2019
“…This paper presents BUT ReverbDB - a dataset of real room impulse responses (RIR), background noises and re-transmitted speech data. The retransmitted data…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
19
State-of-the-art Embeddings with Video-free Segmentation of the Source VoxCeleb Data by Barahona, Sara, Mošner, Ladislav, Stafylakis, Themos, Plchot, Oldřich, Peng, Junyi, Burget, Lukáš, Černocký, Jan

Published 03-10-2024
“…In this paper, we refine and validate our method for training speaker embedding extractors using weak annotations. More specifically, we use only the audio…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
20
CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification by Peng, Junyi, Mošner, Ladislav, Zhang, Lin, Plchot, Oldřich, Stafylakis, Themos, Burget, Lukáš, Černocký, Jan

Published 23-09-2024
“…Self-supervised learning (SSL) models for speaker verification (SV) have gained significant attention in recent years. However, existing SSL-based SV systems…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Mosner, Ladislav"

Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing by Kakouros, Sofoklis, Stafylakis, Themos, Mosner, Ladislav, Burget, Lukas

Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning by Mosner, Ladislav, Wu, Minhua, Raju, Anirudh, Krishnan Parthasarathi, Sree Hari, Kumatani, Kenichi, Sundaram, Shiva, Maas, Roland, Hoffmeister, Bjorn

13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE by Matějka, Pavel, Plchot, Oldřich, Glembek, Ondřej, Burget, Lukáš, Rohdin, Johan, Zeinali, Hossein, Mošner, Ladislav, Silnova, Anna, Novotný, Ondřej, Diez, Mireia, “Honza” Černocký, Jan

Multisv: Dataset for Far-Field Multi-Channel Speaker Verification by Mosner, Ladislav, Plchot, Oldrich, Burget, Lukas, Cernocky, Jan Honza

Dereverberation and Beamforming in Far-Field Speaker Recognition by Mosner, Ladislav, Matejka, Pavel, Novotny, Ondrej, Cernocky, Jan Honza

Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters by Peng, Junyi, Stafylakis, Themos, Gu, Rongzhi, Plchot, Oldrich, Mosner, Ladislav, Burget, Lukas, Cernocky, Jan

Building and evaluation of a real room impulse response dataset by Szoke, Igor, Skacel, Miroslav, Mosner, Ladislav, Paliesek, Jakub, Cernocky, Jan

Multi-Channel Speaker Verification with Conv-Tasnet Based Beamformer by Mosner, Ladislav, Plchot, Oldrich, Burget, Lukas, Cernocky, Jan Honza

But System for the Second Dihard Speech Diarization Challenge by Landini, Federico, Wang, Shuai, Diez, Mireia, Burget, Lukas, Matejka, Pavel, Zmolikova, Katerina, Mosner, Ladislav, Silnova, Anna, Plchot, Oldrich, Novotny, Ondrej, Zeinali, Hossein, Rohdin, Johan

Speech-based emotion recognition with self-supervised models using attentive channel-wise correlations and label smoothing by Kakouros, Sofoklis, Stafylakis, Themos, Mosner, Ladislav, Burget, Lukas

Extracting Speaker and Emotion Information from Self-Supervised Speech Models via Channel-Wise Correlations by Stafylakis, Themos, Mosner, Ladislav, Kakouros, Sofoklis, Plchot, Oldrich, Burget, Lukas, Cernocky, Jan

An Attention-Based Backend Allowing Efficient Fine-Tuning of Transformer Models for Speaker Verification by Peng, Junyi, Plchot, Oldrich, Stafylakis, Themos, Mosner, Ladislav, Burget, Lukas, Cernocky, Jan

Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations by Stafylakis, Themos, Mosner, Ladislav, Kakouros, Sofoklis, Plchot, Oldrich, Burget, Lukas, Cernocky, Jan

An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification by Peng, Junyi, Plchot, Oldrich, Stafylakis, Themos, Mosner, Ladislav, Burget, Lukas, Cernocky, Jan

Speaker Verification with Application-Aware Beamforming by Mosner, Ladislav, Plchot, Oldrich, Rohdin, Johan, Burget, Lukas, Cernocky, Jan

Analysis of Impact of Emotions on Target Speech Extraction and Speech Separation by Svec, Jan, Zmolikova, Katerina, Kocour, Martin, Delcroix, Marc, Ochiai, Tsubasa, Mosner, Ladislav, Cernocky, Jan Honza

Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch by Silnova, Anna, Stafylakis, Themos, Mosner, Ladislav, Plchot, Oldrich, Rohdin, Johan, Matejka, Pavel, Burget, Lukas, Glembek, Ondrej, Brummer, Niko

Building and Evaluation of a Real Room Impulse Response Dataset by Szoke, Igor, Skacel, Miroslav, Mosner, Ladislav, Paliesek, Jakub, Cernocky, Jan "Honza"

State-of-the-art Embeddings with Video-free Segmentation of the Source VoxCeleb Data by Barahona, Sara, Mošner, Ladislav, Stafylakis, Themos, Plchot, Oldřich, Peng, Junyi, Burget, Lukáš, Černocký, Jan

CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification by Peng, Junyi, Mošner, Ladislav, Zhang, Lin, Plchot, Oldřich, Stafylakis, Themos, Burget, Lukáš, Černocký, Jan

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication