Search Results - "Illina, Irina"
-
1
Topic segmentation in ASR transcripts using bidirectional RNNS for change detection
Published in 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01-12-2017)“…Topic segmentation methods are mostly based on the idea of lexical cohesion, in which lexical distributions are analysed across the document and segment…”
Get full text
Conference Proceeding -
2
Training RNN language models on uncertain ASR hypotheses in limited data scenarios
Published in Computer speech & language (01-01-2024)“…Training domain-specific automatic speech recognition (ASR) systems requires a suitable amount of data comprising the target domain. In several scenarios, such…”
Get full text
Journal Article -
3
DNN Uncertainty Propagation Using GMM-Derived Uncertainty Features for Noise Robust ASR
Published in IEEE signal processing letters (01-03-2018)“…The uncertainty decoding framework is known to improve the deep neural network (DNN)-based automatic speech recognition (ASR) performance in noisy…”
Get full text
Journal Article -
4
A combined evaluation of established and new approaches for speech recognition in varied reverberation conditions
Published in Computer speech & language (01-11-2017)“…•A detailed study of various techniques for reverberation robust ASR is conducted.•Performances are evaluated and compared for new as well as established…”
Get full text
Journal Article -
5
Dynamic adjustment of language models for automatic speech recognition using word similarity
Published in 2016 IEEE Spoken Language Technology Workshop (SLT) (01-12-2016)“…Out-of-vocabulary (OOV) words can pose a particular problem for automatic speech recognition (ASR) of broadcast news. The language models (LMs) of ASR systems…”
Get full text
Conference Proceeding -
6
DNN-Based Mask Estimation for Distributed Speech Enhancement in Spatially Unconstrained Microphone Arrays
Published in IEEE/ACM transactions on audio, speech, and language processing (2021)“…Deep neural network (DNN)-based speech enhancement algorithms in microphone arrays have now proven to be efficient solutions to speech understanding and speech…”
Get full text
Journal Article -
7
VoiceHome-2, an extended corpus for multichannel speech processing in real homes
Published in Speech communication (01-01-2019)“…We present a new, extended version of the voiceHome corpus for distant-microphone speech processing in domestic environments. This 5-hour corpus includes short…”
Get full text
Journal Article -
8
Distributed Speech Separation in Spatially Unconstrained Microphone Arrays
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06-06-2021)“…Speech separation with several speakers is a challenging task because of the non-stationarity of the speech and the strong signal similarity between…”
Get full text
Conference Proceeding -
9
DNN-based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…Multichannel processing is widely used for speech enhancement but several limitations appear when trying to deploy these solutions in the real world…”
Get full text
Conference Proceeding -
10
Discriminative importance weighting of augmented training data for acoustic model training
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2017)“…DNN based acoustic models require a large amount of training data. Parametric data augmentation techniques such as adding noise, reverberation, or changing the…”
Get full text
Conference Proceeding -
11
BERT and fastText Embeddings for Automatic Detection of Toxic Speech
Published in 2020 International Multi-Conference on: “Organization of Knowledge and Advanced Technologies” (OCTA) (01-02-2020)“…With the expansion of Internet usage, catering to the dissemination of thoughts and expressions of an individual, there has been an immense increase in the…”
Get full text
Conference Proceeding -
12
Modelling Semantic Context of OOV Words in Large Vocabulary Continuous Speech Recognition
Published in IEEE/ACM transactions on audio, speech, and language processing (01-03-2017)“…The diachronic nature of broadcast news data leads to the problem of out-of-vocabulary (OOV) words in large vocabulary continuous speech recognition (LVCSR)…”
Get full text
Journal Article -
13
OOV Proper Name retrieval using topic and lexical context models
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)“…Retrieving Proper Names (PNs) specific to an audio document can be useful for vocabulary selection and OOV recovery in speech recognition, as well as in…”
Get full text
Conference Proceeding -
14
Document level semantic context for retrieving OOV proper names
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2016)“…Recognition of Proper Names (PNs) in speech is important for content based indexing and browsing of audio-video data. However, many PNs are Out-Of-Vocabulary…”
Get full text
Conference Proceeding Journal Article -
15
A wavelet-based parameterization for speech/music discrimination
Published in Computer speech & language (01-04-2010)“…This paper addresses the problem of parameterization for speech/music discrimination. The current successful parameterization based on cepstral coefficients…”
Get full text
Journal Article -
16
DNN-Based Semantic Model for Rescoring N-best Speech Recognition List
Published 02-11-2020“…The word error rate (WER) of an automatic speech recognition (ASR) system increases when a mismatch occurs between the training and the testing conditions due…”
Get full text
Journal Article -
17
Transferring Knowledge via Neighborhood-Aware Optimal Transport for Low-Resource Hate Speech Detection
Published 17-10-2022“…The concerning rise of hateful content on online platforms has increased the attention towards automatic hate speech detection, commonly formulated as a…”
Get full text
Journal Article -
18
Evaluating grapheme-to-phoneme converters in automatic speech recognition context
Published in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2012)“…This paper deals with the evaluation of grapheme-to-phoneme (G2P) converters in a speech recognition context. The precision and recall rates are investigated…”
Get full text
Conference Proceeding -
19
SAMbA: Speech enhancement with Asynchronous ad-hoc Microphone Arrays
Published 31-07-2023“…Speech enhancement in ad-hoc microphone arrays is often hindered by the asynchronization of the devices composing the microphone array. Asynchronization comes…”
Get full text
Journal Article -
20
A Wavelet-Based Parameterization for Speech/Music Discrimination
Published in Computer speech & language (16-01-2010)“…This paper addresses the problem of parameterization for speech/music discrimination. The current successful parameterization based on cepstral coefficients…”
Get full text
Journal Article