14th International Congress of Phonetic Sciences (ICPhS-14)

San Francisco, CA, USA
August 1-7, 1999


Experiences from Building Two Large Telephone Speech Databases for Swedish

Kjell Elenius

Department of Speech, Music and Hearing, KTH, Stockholm, Sweden

The objective of the EU-funded SpeechDat project was to create large-scale speech databases for voice-driven teleservices. The paper deals with the design of two such Swedish resources: 5000 speakers over the fixed telephone network, and 1000 over the mobile network. It also reports on experiences from speaker recruitment and presents statistics on speaker distribution. Results regarding orthographic labelling of pronunciation, pronunciation errors and non-speech events are also included.

Full Paper

Bibliographic reference.  Elenius, Kjell (1999): "Experiences from building two large telephone speech databases for Swedish", In ICPhS-14, 1741-1744.