Acoustic reduction in conversational Dutch: A quantitative analysis based on automatically generated segmental transcriptions

In spontaneous, conversational speech, words are often reduced compared to their citation forms, such that a word like yesterday may sound like [ ’ j ε ʃ e i ] . The present chapter investigates such acoustic reduction. The study of reduction needs large corpora that are transcribed phonetically. Th...

Full description

Saved in:
Bibliographic Details
Published in:Journal of phonetics Vol. 39; no. 1; pp. 96 - 109
Main Authors: Schuppler, Barbara, Ernestus, Mirjam, Scharenborg, Odette, Boves, Lou
Format: Journal Article
Language:English
Published: Elsevier Ltd 2011
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In spontaneous, conversational speech, words are often reduced compared to their citation forms, such that a word like yesterday may sound like [ ’ j ε ʃ e i ] . The present chapter investigates such acoustic reduction. The study of reduction needs large corpora that are transcribed phonetically. The first part of this chapter describes an automatic transcription procedure used to obtain such a large phonetically transcribed corpus of Dutch spontaneous dialogues, which is subsequently used for the investigation of acoustic reduction. First, the orthographic transcriptions were adapted for automatic processing. Next, the phonetic transcription of the corpus was created by means of a forced alignment using a lexicon with multiple pronunciation variants per word. These variants were generated by applying phonological and reduction rules to the canonical phonetic transcriptions of the words. The second part of this chapter reports the results of a quantitative analysis of reduction in the corpus on the basis of the generated transcriptions and gives an inventory of segmental reductions in standard Dutch. Overall, we found that reduction is more pervasive in spontaneous Dutch than previously documented. ►A Dutch spontaneous speech corpus was automatically transcribed with an ASR system. ►By applying 31 reduction rules to canonical pronunciations, variants were created. ►The transcriptions reached a high agreement with manual transcriptions (85.9%). ►40.3% of the word tokens were reduced, 9.1% of all syllabic nuclei were absent. ►We analyze the impact of segmental context on vowel and consonant reductions.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0095-4470
1095-8576
DOI:10.1016/j.wocn.2010.11.006