Improvement of phone recognition accuracy using source and system features
The goal of this work is to improve phone recognition accuracy using combination of source and system features. As speech is produced by exciting time varying vocal tract system with time varying excitation, we want to explore both source and system components of speech production system for phone r...
Saved in:
Published in: | 2015 International Conference on Signal Processing and Communication Engineering Systems pp. 501 - 505 |
---|---|
Main Authors: | , , |
Format: | Conference Proceeding |
Language: | English |
Published: |
IEEE
01-01-2015
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The goal of this work is to improve phone recognition accuracy using combination of source and system features. As speech is produced by exciting time varying vocal tract system with time varying excitation, we want to explore both source and system components of speech production system for phone recognition. The excitation source information is derived by processing linear prediction residual of speech signal. Mel-frequency cepstral coefficient features are used for capturing vocal tract information. The Phone Recognition Systems (PRSs) are developed using hidden Markov models. The proposed PRSs are developed for English and an Indian language Bengali using TEVIIT and Phonetic, Prosodically Rich Transcribed speech corpora, respectively. We have also developed tandem PRSs using the phone posteriors obtained from feedforward neural networks. The tandem PRSs developed using combination of excitation source and system features, outperform the conventional tandem systems developed using system features alone. |
---|---|
DOI: | 10.1109/SPACES.2015.7058205 |