Improvement of phone recognition accuracy using source and system features

The goal of this work is to improve phone recognition accuracy using combination of source and system features. As speech is produced by exciting time varying vocal tract system with time varying excitation, we want to explore both source and system components of speech production system for phone r...

Full description

Saved in:
Bibliographic Details
Published in:2015 International Conference on Signal Processing and Communication Engineering Systems pp. 501 - 505
Main Authors: Manjunath, K. E., Rao, K. Sreenivasa, Reddy, M. Gurunath
Format: Conference Proceeding
Language:English
Published: IEEE 01-01-2015
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract The goal of this work is to improve phone recognition accuracy using combination of source and system features. As speech is produced by exciting time varying vocal tract system with time varying excitation, we want to explore both source and system components of speech production system for phone recognition. The excitation source information is derived by processing linear prediction residual of speech signal. Mel-frequency cepstral coefficient features are used for capturing vocal tract information. The Phone Recognition Systems (PRSs) are developed using hidden Markov models. The proposed PRSs are developed for English and an Indian language Bengali using TEVIIT and Phonetic, Prosodically Rich Transcribed speech corpora, respectively. We have also developed tandem PRSs using the phone posteriors obtained from feedforward neural networks. The tandem PRSs developed using combination of excitation source and system features, outperform the conventional tandem systems developed using system features alone.
AbstractList The goal of this work is to improve phone recognition accuracy using combination of source and system features. As speech is produced by exciting time varying vocal tract system with time varying excitation, we want to explore both source and system components of speech production system for phone recognition. The excitation source information is derived by processing linear prediction residual of speech signal. Mel-frequency cepstral coefficient features are used for capturing vocal tract information. The Phone Recognition Systems (PRSs) are developed using hidden Markov models. The proposed PRSs are developed for English and an Indian language Bengali using TEVIIT and Phonetic, Prosodically Rich Transcribed speech corpora, respectively. We have also developed tandem PRSs using the phone posteriors obtained from feedforward neural networks. The tandem PRSs developed using combination of excitation source and system features, outperform the conventional tandem systems developed using system features alone.
Author Rao, K. Sreenivasa
Reddy, M. Gurunath
Manjunath, K. E.
Author_xml – sequence: 1
  givenname: K. E.
  surname: Manjunath
  fullname: Manjunath, K. E.
  email: ke.manjunath@gmail.com
  organization: Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
– sequence: 2
  givenname: K. Sreenivasa
  surname: Rao
  fullname: Rao, K. Sreenivasa
  email: ksrao@iitkgp.ac.in
  organization: Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
– sequence: 3
  givenname: M. Gurunath
  surname: Reddy
  fullname: Reddy, M. Gurunath
  email: mgurunathreddy@gmail.com
  organization: Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
BookMark eNotj9FKwzAUQCMo6Oa-YC_5gdabNGmax1GmTgYK0-cRk5sZsWlJWqF_78A9nYcDB86CXMc-IiFrBiVjoB8Ob5t2eyg5MFkqkA0HeUUWTCit67MXt2SV8zcAMC0UiOaOvOy6IfW_2GEcae_p8HUu0oS2P8Uwhj5SY-2UjJ3plEM80dxPySI10dE85xE76tGMU8J8T268-cm4unBJPh637-1zsX992rWbfREYr8bCKo9OKGas0ozXRjSSSahN473TDpiohVW1-DQNCum1Ae408MpDxZ1q0FZLsv7vBkQ8Dil0Js3Hy271B0MCTkI
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/SPACES.2015.7058205
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library Online
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE
  url: http://ieeexplore.ieee.org/Xplore/DynWel.jsp
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 1479961094
9781479961092
EndPage 505
ExternalDocumentID 7058205
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IEGSK
IERZE
OCL
RIE
RIL
ID FETCH-LOGICAL-i123t-c7fed471ac79126a4851506a8ffd9d01464c764ba8e45f9a02d9023f032d78ec3
IEDL.DBID RIE
IngestDate Thu Jun 29 18:38:07 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i123t-c7fed471ac79126a4851506a8ffd9d01464c764ba8e45f9a02d9023f032d78ec3
PageCount 5
ParticipantIDs ieee_primary_7058205
PublicationCentury 2000
PublicationDate 20150101
PublicationDateYYYYMMDD 2015-01-01
PublicationDate_xml – month: 01
  year: 2015
  text: 20150101
  day: 01
PublicationDecade 2010
PublicationTitle 2015 International Conference on Signal Processing and Communication Engineering Systems
PublicationTitleAbbrev SPACES
PublicationYear 2015
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0001947048
Score 1.6218963
Snippet The goal of this work is to improve phone recognition accuracy using combination of source and system features. As speech is produced by exciting time varying...
SourceID ieee
SourceType Publisher
StartPage 501
SubjectTerms Accuracy
excitation source features
Hidden Markov models
Mel frequency cepstral coefficient
phone posteriors
phone recognition system
RMFCCs
Speech
Speech processing
Speech recognition
tandem systems
Training
Title Improvement of phone recognition accuracy using source and system features
URI https://ieeexplore.ieee.org/document/7058205
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NSwMxEB1sT3rxoxW_ycGjabe7yWZzlNpSPEihCt5KkpmIl63U7sF_b7K7tApevIVASJiEvPDy5g3AbZJrg1ZI7oRBHm4_y60ny4Uji1IRWhkThWcL9fRaPEyiTc7dNheGiGrxGQ1is_7Lx5WrIlU2VIkMgCU70FG6aHK1dnyKFiqcxtZYaJTo4WJ-P54sonpLDtqRv0qo1AgyPfzf3EfQ36XisfkWZI5hj8oTOPjhItiDx4YYqHk-tvIsqs2JbZVBq5IZ56q1cV8sqtzfWEPYM1Mia4ycmafa3_OzDy_TyfN4xtsSCfw9QM6GO-UJQ4SNU3qU5kaEB5RMclN4jxqjMYxwKhfWFCSk1yZJUQeU9kmWoirIZafQLcOizoC53OEosZiaaIKFxha5y2J5dGUzK7U9h16MyvKjccFYtgG5-Lv7EvZj4Buy4gq6m3VF19D5xOqm3rdvxgWbNg
link.rule.ids 310,311,782,786,791,792,798,27934,54768
linkProvider IEEE
linkToHtml http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NTwIxEJ0IHtSLH2D8tgePLiy77XZ7NAhBRUICJt5I22mJl8UAe_Df2-5uQBMv3pomTZtp09e8vnkDcBcmQqKiLNBUYuBuPxUoa1RAtVHIuEHFfKLwYMJH7-ljz9vk3G9yYYwxhfjMtHyz-MvHhc49VdbmIXOAxWqwy6h7JpfZWltGRVDuzmNlLdQJRXsyfuj2Jl6_xVrV2F9FVAoM6R_-b_YjaG6T8ch4AzPHsGOyEzj44SPYgOeSGiiYPrKwxOvNDdlogxYZkVrnS6m_iNe5z0lJ2ROZISmtnIk1hcPnqglv_d60OwiqIgnBhwOddaC5NehiLDUXnSiR1D2hWJjI1FoU6K1hqOYJVTI1lFkhwwiFw2kbxhHy1Oj4FOqZW9QZEJ1o7IQKI-ltsFCqNNGxL5DOVayYUOfQ8FGZfZY-GLMqIBd_d9_C3mD6OpwNn0Yvl7DvN6GkLq6gvl7m5hpqK8xvij38BiRNnog
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2015+International+Conference+on+Signal+Processing+and+Communication+Engineering+Systems&rft.atitle=Improvement+of+phone+recognition+accuracy+using+source+and+system+features&rft.au=Manjunath%2C+K.+E.&rft.au=Rao%2C+K.+Sreenivasa&rft.au=Reddy%2C+M.+Gurunath&rft.date=2015-01-01&rft.pub=IEEE&rft.spage=501&rft.epage=505&rft_id=info:doi/10.1109%2FSPACES.2015.7058205&rft.externalDocID=7058205