Search Results - "Pelecanos, Jason"
-
1
Speaker age estimation on conversational telephone speech using senone posterior based i-vectors
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2016)“…Automatic age estimation from speech has a variety of applications including natural human-computer interaction, targeted advertising, customer-agent pairing…”
Get full text
Conference Proceeding Journal Article -
2
Nearest neighbor discriminant analysis for language recognition
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)“…Many state-of-the-art i-vector based voice biometric systems use linear discriminant analysis (LDA) as a post-processing stage to increase the computational…”
Get full text
Conference Proceeding -
3
Nearest neighbor based i-vector normalization for robust speaker recognition under unseen channel conditions
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)“…Many state-of-the-art speaker recognition engines use i-vectors to represent variable-length acoustic signals in a fixed low-dimensional total variability…”
Get full text
Conference Proceeding -
4
Unsupervised channel adaptation for language identification using co-training
Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01-05-2013)“…Language identification (LID) of speech signals in conditions like adverse radio communication channel is a challenging problem. In this paper, we address the…”
Get full text
Conference Proceeding -
5
Enhancing Frequency Shifted Speech Signals in Single Side-Band Communication
Published in IEEE signal processing letters (01-12-2013)“…The spectral quality of speech signals communicated over high-frequency single side band (HF-SSB) radio channels is affected by acoustic artifacts like linear…”
Get full text
Journal Article -
6
A Bayesian Attention Neural Network Layer for Speaker Recognition
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)“…Neural network based attention modeling has found utility in areas such as visual analysis, speech recognition and more recently speaker recognition. Attention…”
Get full text
Conference Proceeding -
7
Online speaker diarization using adapted i-vector transforms
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2016)“…Many speaker diarization systems operate in an off-line mode. Such systems typically find homogeneous segments and then cluster these segments according to…”
Get full text
Conference Proceeding Journal Article -
8
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)“…We introduce a multilingual speaker change detection model (USM-SCD) that can simultaneously detect speaker turns and perform ASR for 96 languages. This model…”
Get full text
Conference Proceeding -
9
Using Polynomial Kernel Support Vector Machines for Speaker Verification
Published in IEEE signal processing letters (01-09-2013)“…In this letter, we propose a discriminative modeling approach for the speaker verification problem that uses polynomial kernel support vector machines…”
Get full text
Journal Article -
10
Feature normalization for speaker verification in room reverberation
Published in 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2011)“…The performance of a typical speaker verification system degrades significantly in reverberant environments. This degradation is partly due to the conventional…”
Get full text
Conference Proceeding -
11
A novel approach to detecting non-native speakers and their native language
Published in 2010 IEEE International Conference on Acoustics, Speech and Signal Processing (01-03-2010)“…Speech contains valuable information regarding the traits of speakers. This paper investigates two aspects of this information. The first is automatic…”
Get full text
Conference Proceeding -
12
Speaker diarization: A perspective on challenges and opportunities from theory to practice
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2017)“…This paper discusses some challenges and opportunities in developing a speaker diarization system for operation on real world call center telephony data. We…”
Get full text
Conference Proceeding -
13
Unifying PLDA and polynomial kernel SVMS
Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01-05-2013)“…Probabilistic linear discriminant analysis (PLDA) is a generative model to explain between and within class variations. When the underlying latent variables…”
Get full text
Conference Proceeding -
14
Synth2Aug: Cross-Domain Speaker Recognition with TTS Synthesized Speech
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19-01-2021)“…In recent years, Text-To-Speech (TTS) has been used as a data augmentation technique for speech recognition to help complement inadequacies in the training…”
Get full text
Conference Proceeding -
15
Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition
Published 05-04-2021“…Many neural network speaker recognition systems model each speaker using a fixed-dimensional embedding vector. These embeddings are generally compared using…”
Get full text
Journal Article -
16
Parameter-Free Attentive Scoring for Speaker Verification
Published 10-03-2022“…This paper presents a novel study of parameter-free attentive scoring for speaker verification. Parameter-free scoring provides the flexibility of comparing…”
Get full text
Journal Article -
17
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Published 14-09-2023“…We introduce a multilingual speaker change detection model (USM-SCD) that can simultaneously detect speaker turns and perform ASR for 96 languages. This model…”
Get full text
Journal Article -
18
SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System
Published 05-04-2021“…In this paper, we describe SpeakerStew - a hybrid system to perform speaker verification on 46 languages. Two core ideas were explored in this system: (1)…”
Get full text
Journal Article -
19
Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech
Published 23-11-2020“…In recent years, Text-To-Speech (TTS) has been used as a data augmentation technique for speech recognition to help complement inadequacies in the training…”
Get full text
Journal Article -
20
Keyword-conditioned phone N-gram modeling with contextual information for speaker verification
Published in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2012)“…In this paper we present our current work on automatic speaker recognition using keyword-conditioned phone N-gram modeling. We propose the use of contextual…”
Get full text
Conference Proceeding