Estimation of place of articulation of fricatives from spectral features

An investigation is carried out for speaker-independent acoustic-to-articulatory mapping for fricative utterances using simultaneously acquired speech signals and articulatory data. The relation of the place of articulation with the spectral characteristics is examined using several earlier reported...

Full description

Saved in:
Bibliographic Details
Published in:International journal of speech technology Vol. 26; no. 4; pp. 1061 - 1078
Main Authors: Nataraj, K. S., Pandey, Prem C., Dasgupta, Hirak
Format: Journal Article
Language:English
Published: New York Springer US 01-12-2023
Springer Nature B.V
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:An investigation is carried out for speaker-independent acoustic-to-articulatory mapping for fricative utterances using simultaneously acquired speech signals and articulatory data. The relation of the place of articulation with the spectral characteristics is examined using several earlier reported spectral features and six proposed spectral features (maximum-sum segment centroid, normalized sum of absolute spectral slopes, and four spectral energy features). A method is presented for estimating the place of articulation using a feedforward neural network. It is evaluated using a dataset comprising utterances with a mix of phonetic contexts and from multiple speakers, five-fold cross-validation, and networks with different hidden layers and neurons. The six proposed spectral features used as the input feature set resulted in the lowest estimation error and low sensitivity to the training data size. Estimation using this feature set with an optimal network provided a correlation coefficient of 0.978 and an RMS error of 2.54 mm. The errors were smaller than the differences between the adjacent places, indicating that the method may be helpful in providing visual feedback of articulatory efforts in speech training aids.
ISSN:1381-2416
1572-8110
DOI:10.1007/s10772-023-10076-3