OrienTel: Turkish telephone speech database
This paper describes a Turkish telephone speech database created within the framework of OrienTel (IST-2000-28373), a 5th framework project. OrienTel aims to collect telephone speech data from 21 languages. The Turkish database has been successfully completed in 16 months. The work includes recordin...
Saved in:
Published in: | Proceedings of the IEEE 12th Signal Processing and Communications Applications Conference, 2004 pp. 280 - 283 |
---|---|
Main Authors: | , , |
Format: | Conference Proceeding |
Language: | English Turkish |
Published: |
IEEE
2004
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | This paper describes a Turkish telephone speech database created within the framework of OrienTel (IST-2000-28373), a 5th framework project. OrienTel aims to collect telephone speech data from 21 languages. The Turkish database has been successfully completed in 16 months. The work includes recordings, annotations and documentation of 1700 recording sessions. The speaker distribution has been balanced with respect to criteria such as age, sex, dialect, calling environment and network. The database contains digits, numbers, time, date, words and sentences. It is the first Turkish speech database of its size and also of its detailed systematic manner followed in the preparation and validation. |
---|---|
ISBN: | 0780383184 9780780383180 |
DOI: | 10.1109/SIU.2004.1338314 |