Search Results - "Pelecanos, Jason"

1
Speaker age estimation on conversational telephone speech using senone posterior based i-vectors by Sadjadi, Seyed Omid, Ganapathy, Sriram, Pelecanos, Jason W.

Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2016)
“…Automatic age estimation from speech has a variety of applications including natural human-computer interaction, targeted advertising, customer-agent pairing…”

Get full text

Conference Proceeding Journal Article
QR Code
Save to List

Saved in:
2
Nearest neighbor discriminant analysis for language recognition by Sadjadi, Seyed Omid, Pelecanos, Jason W., Ganapathy, Sriram

Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)
“…Many state-of-the-art i-vector based voice biometric systems use linear discriminant analysis (LDA) as a post-processing stage to increase the computational…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
3
Nearest neighbor based i-vector normalization for robust speaker recognition under unseen channel conditions by Weizhong Zhu, Sadjadi, Seyed Omid, Pelecanos, Jason W.

Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2015)
“…Many state-of-the-art speaker recognition engines use i-vectors to represent variable-length acoustic signals in a fixed low-dimensional total variability…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
4
Unsupervised channel adaptation for language identification using co-training by Ganapathy, Sriram, Omar, Mohamed, Pelecanos, Jason

Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01-05-2013)
“…Language identification (LID) of speech signals in conditions like adverse radio communication channel is a challenging problem. In this paper, we address the…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
5
Enhancing Frequency Shifted Speech Signals in Single Side-Band Communication by Ganapathy, Sriram, Pelecanos, Jason

Published in IEEE signal processing letters (01-12-2013)
“…The spectral quality of speech signals communicated over high-frequency single side band (HF-SSB) radio channels is affected by acoustic artifacts like linear…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
A Bayesian Attention Neural Network Layer for Speaker Recognition by Zhu, Weizhong, Pelecanos, Jason

Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)
“…Neural network based attention modeling has found utility in areas such as visual analysis, speech recognition and more recently speaker recognition. Attention…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
7
Online speaker diarization using adapted i-vector transforms by Weizhong Zhu, Pelecanos, Jason

Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2016)
“…Many speaker diarization systems operate in an off-line mode. Such systems typically find homogeneous segments and then cluster these segments according to…”

Get full text

Conference Proceeding Journal Article
QR Code
Save to List

Saved in:
8
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models by Zhao, Guanlong, Wang, Yongqiang, Pelecanos, Jason, Zhang, Yu, Liao, Hank, Huang, Yiling, Lu, Han, Wang, Quan

Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)
“…We introduce a multilingual speaker change detection model (USM-SCD) that can simultaneously detect speaker turns and perform ASR for 96 languages. This model…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
9
Using Polynomial Kernel Support Vector Machines for Speaker Verification by Yaman, S., Pelecanos, J.

Published in IEEE signal processing letters (01-09-2013)
“…In this letter, we propose a discriminative modeling approach for the speaker verification problem that uses polynomial kernel support vector machines…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
10
Feature normalization for speaker verification in room reverberation by Ganapathy, Sriram, Pelecanos, Jason, Omar, Mohamed Kamal

Published in 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2011)
“…The performance of a typical speaker verification system degrades significantly in reverberant environments. This degradation is partly due to the conventional…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
11
A novel approach to detecting non-native speakers and their native language by Omar, Mohamed Kamal, Pelecanos, Jason

Published in 2010 IEEE International Conference on Acoustics, Speech and Signal Processing (01-03-2010)
“…Speech contains valuable information regarding the traits of speakers. This paper investigates two aspects of this information. The first is automatic…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
12
Speaker diarization: A perspective on challenges and opportunities from theory to practice by Church, Kenneth, Weizhong Zhu, Vopicka, Josef, Pelecanos, Jason, Dimitriadis, Dimitrios, Fousek, Petr

Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2017)
“…This paper discusses some challenges and opportunities in developing a speaker diarization system for operation on real world call center telephony data. We…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
13
Unifying PLDA and polynomial kernel SVMS by Yaman, Sibel, Pelecanos, Jason, Weizhong Zhu

Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01-05-2013)
“…Probabilistic linear discriminant analysis (PLDA) is a generative model to explain between and within class variations. When the underlying latent variables…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
14
Synth2Aug: Cross-Domain Speaker Recognition with TTS Synthesized Speech by Huang, Yiling, Chen, Yutian, Pelecanos, Jason, Wang, Quan

Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19-01-2021)
“…In recent years, Text-To-Speech (TTS) has been used as a data augmentation technique for speech recognition to help complement inadequacies in the training…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
15
Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition by Pelecanos, Jason, Wang, Quan, Moreno, Ignacio Lopez

Published 05-04-2021
“…Many neural network speaker recognition systems model each speaker using a fixed-dimensional embedding vector. These embeddings are generally compared using…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
16
Parameter-Free Attentive Scoring for Speaker Verification by Pelecanos, Jason, Wang, Quan, Huang, Yiling, Moreno, Ignacio Lopez

Published 10-03-2022
“…This paper presents a novel study of parameter-free attentive scoring for speaker verification. Parameter-free scoring provides the flexibility of comparing…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
17
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models by Zhao, Guanlong, Wang, Yongqiang, Pelecanos, Jason, Zhang, Yu, Liao, Hank, Huang, Yiling, Lu, Han, Wang, Quan

Published 14-09-2023
“…We introduce a multilingual speaker change detection model (USM-SCD) that can simultaneously detect speaker turns and perform ASR for 96 languages. This model…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
18
SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System by Chojnacka, Roza, Pelecanos, Jason, Wang, Quan, Moreno, Ignacio Lopez

Published 05-04-2021
“…In this paper, we describe SpeakerStew - a hybrid system to perform speaker verification on 46 languages. Two core ideas were explored in this system: (1)…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
19
Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech by Huang, Yiling, Chen, Yutian, Pelecanos, Jason, Wang, Quan

Published 23-11-2020
“…In recent years, Text-To-Speech (TTS) has been used as a data augmentation technique for speech recognition to help complement inadequacies in the training…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
20
Keyword-conditioned phone N-gram modeling with contextual information for speaker verification by Han, K. J., Pelecanos, J., Omar, M. K.

Published in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2012)
“…In this paper we present our current work on automatic speaker recognition using keyword-conditioned phone N-gram modeling. We propose the use of contextual…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:

Search Results - "Pelecanos, Jason"

Speaker age estimation on conversational telephone speech using senone posterior based i-vectors by Sadjadi, Seyed Omid, Ganapathy, Sriram, Pelecanos, Jason W.

Nearest neighbor discriminant analysis for language recognition by Sadjadi, Seyed Omid, Pelecanos, Jason W., Ganapathy, Sriram

Nearest neighbor based i-vector normalization for robust speaker recognition under unseen channel conditions by Weizhong Zhu, Sadjadi, Seyed Omid, Pelecanos, Jason W.

Unsupervised channel adaptation for language identification using co-training by Ganapathy, Sriram, Omar, Mohamed, Pelecanos, Jason

Enhancing Frequency Shifted Speech Signals in Single Side-Band Communication by Ganapathy, Sriram, Pelecanos, Jason

A Bayesian Attention Neural Network Layer for Speaker Recognition by Zhu, Weizhong, Pelecanos, Jason

Online speaker diarization using adapted i-vector transforms by Weizhong Zhu, Pelecanos, Jason

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models by Zhao, Guanlong, Wang, Yongqiang, Pelecanos, Jason, Zhang, Yu, Liao, Hank, Huang, Yiling, Lu, Han, Wang, Quan

Using Polynomial Kernel Support Vector Machines for Speaker Verification by Yaman, S., Pelecanos, J.

Feature normalization for speaker verification in room reverberation by Ganapathy, Sriram, Pelecanos, Jason, Omar, Mohamed Kamal

A novel approach to detecting non-native speakers and their native language by Omar, Mohamed Kamal, Pelecanos, Jason

Speaker diarization: A perspective on challenges and opportunities from theory to practice by Church, Kenneth, Weizhong Zhu, Vopicka, Josef, Pelecanos, Jason, Dimitriadis, Dimitrios, Fousek, Petr

Unifying PLDA and polynomial kernel SVMS by Yaman, Sibel, Pelecanos, Jason, Weizhong Zhu

Synth2Aug: Cross-Domain Speaker Recognition with TTS Synthesized Speech by Huang, Yiling, Chen, Yutian, Pelecanos, Jason, Wang, Quan

Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition by Pelecanos, Jason, Wang, Quan, Moreno, Ignacio Lopez

Parameter-Free Attentive Scoring for Speaker Verification by Pelecanos, Jason, Wang, Quan, Huang, Yiling, Moreno, Ignacio Lopez

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models by Zhao, Guanlong, Wang, Yongqiang, Pelecanos, Jason, Zhang, Yu, Liao, Hank, Huang, Yiling, Lu, Han, Wang, Quan

SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System by Chojnacka, Roza, Pelecanos, Jason, Wang, Quan, Moreno, Ignacio Lopez

Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech by Huang, Yiling, Chen, Yutian, Pelecanos, Jason, Wang, Quan

Keyword-conditioned phone N-gram modeling with contextual information for speaker verification by Han, K. J., Pelecanos, J., Omar, M. K.

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication