Comparison of the automatic speaker recognition performance over standard features
This paper presents a study of speaker recognition accuracy depending on the choice of features, window width and model complexity. The standard features were considered, such as linear and perceptual prediction coefficients (LPC and PLP) and mel-frequency cepstral coefficients (MFCC). Gaussian mixt...
Saved in:
Published in: | 2012 IEEE 10th Jubilee International Symposium on Intelligent Systems and Informatics pp. 341 - 344 |
---|---|
Main Authors: | , , , |
Format: | Conference Proceeding |
Language: | English |
Published: |
IEEE
01-09-2012
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | This paper presents a study of speaker recognition accuracy depending on the choice of features, window width and model complexity. The standard features were considered, such as linear and perceptual prediction coefficients (LPC and PLP) and mel-frequency cepstral coefficients (MFCC). Gaussian mixture model (GMM), with the use of HTK tools, was chosen for speaker modelling. Speech database S70W100s120, recorded at the Electrical Engineering Department of Belgrade University, was used for purposes of system training and testing. Ten speaker models and the universal background model (UBM) were trained. |
---|---|
ISBN: | 1467347515 9781467347518 |
ISSN: | 1949-047X 1949-0488 |
DOI: | 10.1109/SISY.2012.6339541 |