Improvement of phone recognition accuracy using source and system features
The goal of this work is to improve phone recognition accuracy using combination of source and system features. As speech is produced by exciting time varying vocal tract system with time varying excitation, we want to explore both source and system components of speech production system for phone r...
Saved in:
Published in: | 2015 International Conference on Signal Processing and Communication Engineering Systems pp. 501 - 505 |
---|---|
Main Authors: | , , |
Format: | Conference Proceeding |
Language: | English |
Published: |
IEEE
01-01-2015
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Abstract | The goal of this work is to improve phone recognition accuracy using combination of source and system features. As speech is produced by exciting time varying vocal tract system with time varying excitation, we want to explore both source and system components of speech production system for phone recognition. The excitation source information is derived by processing linear prediction residual of speech signal. Mel-frequency cepstral coefficient features are used for capturing vocal tract information. The Phone Recognition Systems (PRSs) are developed using hidden Markov models. The proposed PRSs are developed for English and an Indian language Bengali using TEVIIT and Phonetic, Prosodically Rich Transcribed speech corpora, respectively. We have also developed tandem PRSs using the phone posteriors obtained from feedforward neural networks. The tandem PRSs developed using combination of excitation source and system features, outperform the conventional tandem systems developed using system features alone. |
---|---|
AbstractList | The goal of this work is to improve phone recognition accuracy using combination of source and system features. As speech is produced by exciting time varying vocal tract system with time varying excitation, we want to explore both source and system components of speech production system for phone recognition. The excitation source information is derived by processing linear prediction residual of speech signal. Mel-frequency cepstral coefficient features are used for capturing vocal tract information. The Phone Recognition Systems (PRSs) are developed using hidden Markov models. The proposed PRSs are developed for English and an Indian language Bengali using TEVIIT and Phonetic, Prosodically Rich Transcribed speech corpora, respectively. We have also developed tandem PRSs using the phone posteriors obtained from feedforward neural networks. The tandem PRSs developed using combination of excitation source and system features, outperform the conventional tandem systems developed using system features alone. |
Author | Rao, K. Sreenivasa Reddy, M. Gurunath Manjunath, K. E. |
Author_xml | – sequence: 1 givenname: K. E. surname: Manjunath fullname: Manjunath, K. E. email: ke.manjunath@gmail.com organization: Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India – sequence: 2 givenname: K. Sreenivasa surname: Rao fullname: Rao, K. Sreenivasa email: ksrao@iitkgp.ac.in organization: Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India – sequence: 3 givenname: M. Gurunath surname: Reddy fullname: Reddy, M. Gurunath email: mgurunathreddy@gmail.com organization: Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India |
BookMark | eNotj9FKwzAUQCMo6Oa-YC_5gdabNGmax1GmTgYK0-cRk5sZsWlJWqF_78A9nYcDB86CXMc-IiFrBiVjoB8Ob5t2eyg5MFkqkA0HeUUWTCit67MXt2SV8zcAMC0UiOaOvOy6IfW_2GEcae_p8HUu0oS2P8Uwhj5SY-2UjJ3plEM80dxPySI10dE85xE76tGMU8J8T268-cm4unBJPh637-1zsX992rWbfREYr8bCKo9OKGas0ozXRjSSSahN473TDpiohVW1-DQNCum1Ae408MpDxZ1q0FZLsv7vBkQ8Dil0Js3Hy271B0MCTkI |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/SPACES.2015.7058205 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library Online IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE url: http://ieeexplore.ieee.org/Xplore/DynWel.jsp sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering |
EISBN | 1479961094 9781479961092 |
EndPage | 505 |
ExternalDocumentID | 7058205 |
Genre | orig-research |
GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK IEGSK IERZE OCL RIE RIL |
ID | FETCH-LOGICAL-i123t-c7fed471ac79126a4851506a8ffd9d01464c764ba8e45f9a02d9023f032d78ec3 |
IEDL.DBID | RIE |
IngestDate | Thu Jun 29 18:38:07 EDT 2023 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i123t-c7fed471ac79126a4851506a8ffd9d01464c764ba8e45f9a02d9023f032d78ec3 |
PageCount | 5 |
ParticipantIDs | ieee_primary_7058205 |
PublicationCentury | 2000 |
PublicationDate | 20150101 |
PublicationDateYYYYMMDD | 2015-01-01 |
PublicationDate_xml | – month: 01 year: 2015 text: 20150101 day: 01 |
PublicationDecade | 2010 |
PublicationTitle | 2015 International Conference on Signal Processing and Communication Engineering Systems |
PublicationTitleAbbrev | SPACES |
PublicationYear | 2015 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0001947048 |
Score | 1.6218963 |
Snippet | The goal of this work is to improve phone recognition accuracy using combination of source and system features. As speech is produced by exciting time varying... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 501 |
SubjectTerms | Accuracy excitation source features Hidden Markov models Mel frequency cepstral coefficient phone posteriors phone recognition system RMFCCs Speech Speech processing Speech recognition tandem systems Training |
Title | Improvement of phone recognition accuracy using source and system features |
URI | https://ieeexplore.ieee.org/document/7058205 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NSwMxEB1sT3rxoxW_ycGjabe7yWZzlNpSPEihCt5KkpmIl63U7sF_b7K7tApevIVASJiEvPDy5g3AbZJrg1ZI7oRBHm4_y60ny4Uji1IRWhkThWcL9fRaPEyiTc7dNheGiGrxGQ1is_7Lx5WrIlU2VIkMgCU70FG6aHK1dnyKFiqcxtZYaJTo4WJ-P54sonpLDtqRv0qo1AgyPfzf3EfQ36XisfkWZI5hj8oTOPjhItiDx4YYqHk-tvIsqs2JbZVBq5IZ56q1cV8sqtzfWEPYM1Mia4ycmafa3_OzDy_TyfN4xtsSCfw9QM6GO-UJQ4SNU3qU5kaEB5RMclN4jxqjMYxwKhfWFCSk1yZJUQeU9kmWoirIZafQLcOizoC53OEosZiaaIKFxha5y2J5dGUzK7U9h16MyvKjccFYtgG5-Lv7EvZj4Buy4gq6m3VF19D5xOqm3rdvxgWbNg |
link.rule.ids | 310,311,782,786,791,792,798,27934,54768 |
linkProvider | IEEE |
linkToHtml | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NTwIxEJ0IHtSLH2D8tgePLiy77XZ7NAhBRUICJt5I22mJl8UAe_Df2-5uQBMv3pomTZtp09e8vnkDcBcmQqKiLNBUYuBuPxUoa1RAtVHIuEHFfKLwYMJH7-ljz9vk3G9yYYwxhfjMtHyz-MvHhc49VdbmIXOAxWqwy6h7JpfZWltGRVDuzmNlLdQJRXsyfuj2Jl6_xVrV2F9FVAoM6R_-b_YjaG6T8ch4AzPHsGOyEzj44SPYgOeSGiiYPrKwxOvNDdlogxYZkVrnS6m_iNe5z0lJ2ROZISmtnIk1hcPnqglv_d60OwiqIgnBhwOddaC5NehiLDUXnSiR1D2hWJjI1FoU6K1hqOYJVTI1lFkhwwiFw2kbxhHy1Oj4FOqZW9QZEJ1o7IQKI-ltsFCqNNGxL5DOVayYUOfQ8FGZfZY-GLMqIBd_d9_C3mD6OpwNn0Yvl7DvN6GkLq6gvl7m5hpqK8xvij38BiRNnog |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2015+International+Conference+on+Signal+Processing+and+Communication+Engineering+Systems&rft.atitle=Improvement+of+phone+recognition+accuracy+using+source+and+system+features&rft.au=Manjunath%2C+K.+E.&rft.au=Rao%2C+K.+Sreenivasa&rft.au=Reddy%2C+M.+Gurunath&rft.date=2015-01-01&rft.pub=IEEE&rft.spage=501&rft.epage=505&rft_id=info:doi/10.1109%2FSPACES.2015.7058205&rft.externalDocID=7058205 |