OrienTel: Turkish telephone speech database

This paper describes a Turkish telephone speech database created within the framework of OrienTel (IST-2000-28373), a 5th framework project. OrienTel aims to collect telephone speech data from 21 languages. The Turkish database has been successfully completed in 16 months. The work includes recordin...

Full description

Saved in:
Bibliographic Details
Published in:Proceedings of the IEEE 12th Signal Processing and Communications Applications Conference, 2004 pp. 280 - 283
Main Authors: Ciloglu, T., Acar, D., Tokatli, A.
Format: Conference Proceeding
Language:English
Turkish
Published: IEEE 2004
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper describes a Turkish telephone speech database created within the framework of OrienTel (IST-2000-28373), a 5th framework project. OrienTel aims to collect telephone speech data from 21 languages. The Turkish database has been successfully completed in 16 months. The work includes recordings, annotations and documentation of 1700 recording sessions. The speaker distribution has been balanced with respect to criteria such as age, sex, dialect, calling environment and network. The database contains digits, numbers, time, date, words and sentences. It is the first Turkish speech database of its size and also of its detailed systematic manner followed in the preparation and validation.
ISBN:0780383184
9780780383180
DOI:10.1109/SIU.2004.1338314