Search Results - "Zaiem, Salah"
-
1
DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Published in Transactions of the Association for Computational Linguistics (19-09-2022)“…Finding word boundaries in continuous speech is challenging as there is little or no equivalent of a ‘space’ delimiter between words. Popular Bayesian…”
Get full text
Journal Article -
2
Pretext Tasks Selection for Multitask Self-Supervised Audio Representation Learning
Published in IEEE journal of selected topics in signal processing (01-10-2022)“…Through solving pretext tasks, self-supervised learning leverages unlabeled data to extract useful latent representations replacing traditional input features…”
Get full text
Journal Article -
3
Speech self-supervised representations benchmarking: A case for larger probing heads
Published in Computer speech & language (01-01-2025)“…Self-supervised learning (SSL) leverages large datasets of unlabeled speech to reach impressive performance with reduced amounts of annotated data. The high…”
Get full text
Journal Article -
4
Pretext Tasks selection for multitask self-supervised speech representation learning
Published in IEEE journal of selected topics in signal processing (01-10-2022)“…Through solving pretext tasks, self-supervised learning leverages unlabeled data to extract useful latent representations replacing traditional input features…”
Get full text
Journal Article -
5
Leveraging Data Collection and Unsupervised Learning for Code-Switched Tunisian Arabic Automatic Speech Recognition
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)“…Crafting an effective Automatic Speech Recognition (ASR) solution for dialects demands innovative approaches that not only address the data scarcity issue but…”
Get full text
Conference Proceeding -
6
CL-MASR: A Continual Learning Benchmark for Multilingual ASR
Published in IEEE/ACM transactions on audio, speech, and language processing (2024)“…Modern multilingual automatic speech recognition (ASR) systems like Whisper have made it possible to transcribe audio in multiple languages with a single…”
Get full text
Journal Article -
7
End-to-End Speech Recognition from Federated Acoustic Models
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…Training Automatic Speech Recognition (ASR) models under federated learning (FL) settings has attracted a lot of attention recently. However, the FL scenarios…”
Get full text
Conference Proceeding -
8
Less Forgetting for Better Generalization: Exploring Continual-learning Fine-tuning Methods for Speech Self-supervised Representations
Published 30-06-2024“…Despite being trained on massive and diverse datasets, speech self-supervised encoders are generally used for downstream purposes as mere frozen feature…”
Get full text
Journal Article -
9
Fine-Tuning Strategies for Faster Inference Using Speech Self-Supervised Models: A Comparative Study
Published in 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW) (04-06-2023)“…Self-supervised learning (SSL) has allowed substantial progress in Automatic Speech Recognition (ASR) performance in low-resource settings. In this context, it…”
Get full text
Conference Proceeding -
10
Big model only for hard audios: Sample dependent Whisper model selection for efficient inferences
Published 22-09-2023“…Recent progress in Automatic Speech Recognition (ASR) has been coupled with a substantial increase in the model sizes, which may now contain billions of…”
Get full text
Journal Article -
11
Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations
Published 01-06-2023“…Self-Supervised Learning (SSL) has allowed leveraging large amounts of unlabeled speech data to improve the performance of speech recognition models even with…”
Get full text
Journal Article -
12
Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning
Published 08-04-2022“…Contrastive learning enables learning useful audio and speech representations without ground-truth labels by maximizing the similarity between latent…”
Get full text
Journal Article -
13
Sequence to Sequence Learning for Query Expansion
Published 25-12-2018“…Using sequence to sequence algorithms for query expansion has not been explored yet in Information Retrieval literature nor in Question-Answering's. We tried…”
Get full text
Journal Article -
14
Leveraging Data Collection and Unsupervised Learning for Code-switched Tunisian Arabic Automatic Speech Recognition
Published 20-09-2023“…Crafting an effective Automatic Speech Recognition (ASR) solution for dialects demands innovative approaches that not only address the data scarcity issue but…”
Get full text
Journal Article -
15
Speech Self-Supervised Representations Benchmarking: a Case for Larger Probing Heads
Published 28-08-2023“…Self-supervised learning (SSL) leverages large datasets of unlabeled speech to reach impressive performance with reduced amounts of annotated data. The high…”
Get full text
Journal Article -
16
Speech Self-Supervised Representation Benchmarking: Are We Doing it Right?
Published 01-06-2023“…INTERSPEECH 2023 Self-supervised learning (SSL) has recently allowed leveraging large datasets of unlabeled speech signals to reach impressive performance on…”
Get full text
Journal Article -
17
Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative Study
Published 12-03-2023“…Self-supervised learning (SSL) has allowed substantial progress in Automatic Speech Recognition (ASR) performance in low-resource settings. In this context, it…”
Get full text
Journal Article -
18
Conditional independence for pretext task selection in Self-supervised speech representation learning
Published 01-07-2021“…Through solving pretext tasks, self-supervised learning (SSL) leverages unlabeled data to extract useful latent representations replacing traditional input…”
Get full text
Journal Article -
19
Pretext Tasks selection for multitask self-supervised speech representation learning
Published 11-11-2022“…Through solving pretext tasks, self-supervised learning leverages unlabeled data to extract useful latent representations replacing traditional input features…”
Get full text
Journal Article -
20
How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
Published 15-06-2024“…Discrete audio tokens have recently gained attention for their potential to bridge the gap between audio and language processing. Ideal audio tokens must…”
Get full text
Journal Article