Query-by-example spoken term detection using phonetic posteriorgram templates

This paper examines a query-by-example approach to spoken term detection in audio files. The approach is designed for low-resource situations in which limited or no in-domain training material is available and accurate word-based speech recognition capability is unavailable. Instead of using word or...

Full description

Saved in:
Bibliographic Details
Published in:2009 IEEE Workshop on Automatic Speech Recognition & Understanding pp. 421 - 426
Main Authors: Hazen, T.J., Shen, W., White, C.
Format: Conference Proceeding
Language:English
Published: IEEE 01-12-2009
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper examines a query-by-example approach to spoken term detection in audio files. The approach is designed for low-resource situations in which limited or no in-domain training material is available and accurate word-based speech recognition capability is unavailable. Instead of using word or phone strings as search terms, the user presents the system with audio snippets of desired search terms to act as the queries. Query and test materials are represented using phonetic posteriorgrams obtained from a phonetic recognition system. Query matches in the test data are located using a modified dynamic time warping search between query templates and test utterances. Experiments using this approach are presented using data from the Fisher corpus.
ISBN:1424454786
9781424454785
DOI:10.1109/ASRU.2009.5372889