PDF optimized parametric vector quantization of speech line spectral frequencies

A computationally efficient, high quality, vector quantization scheme based on a parametric probability density function (PDF) is developed for encoding speech line spectral frequencies (LSF). For this purpose, speech LSFs are modeled as i.i.d realizations of a multivariate normal mixture density. T...

Full description

Saved in:
Bibliographic Details
Published in:2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421) pp. 87 - 89
Main Authors: Subramaniam, A.D., Rao, B.D.
Format: Conference Proceeding
Language:English
Published: IEEE 2000
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:A computationally efficient, high quality, vector quantization scheme based on a parametric probability density function (PDF) is developed for encoding speech line spectral frequencies (LSF). For this purpose, speech LSFs are modeled as i.i.d realizations of a multivariate normal mixture density. The mixture model parameters are efficiently estimated from the training data using the expectation maximization (EM) algorithm. The estimated density is suitably quantized using transform coding and bit-allocation techniques for both fixed rate and variable rate systems. Source encoding using the resultant codebook involves no searches and its computational complexity is minimal and independent of the rate of the system. Experimental results show that the proposed scheme provides 2-3 bits gain over conventional MSVQ schemes. The proposed memoryless quantizer is enhanced to form a quantizer with memory. The quantizer with memory provides transparent quality speech at 20 bits/frame.
ISBN:9780780364165
0780364163
DOI:10.1109/SCFT.2000.878407