15th International Congress of Phonetic Sciences (ICPhS-15)

Barcelona, Spain
August 3-9, 2003


Automatic Modelling of Rhythm and Intonation for Language Identification

Jean-Luc Rouas (1), Jerome Farinas (1), François Pellegrino (2)

(1) IRIT-CNRS, France
(2) DDL-CNRS/Université Lumière Lyon 2, France

This paper deals with an approach to Automatic Language Identification using only prosodic modeling. The traditional approach for language identification focuses mainly on phonotactics because it gives the best results. Recent studies reveal that humans use different levels of perception to identify a language, in particular prosodic cues.
   Among prosodic features, rhythm is known to carry a substantial information about language identity. Rhythm is produced by the periodicity of a pattern that can be syllable, which is a language specific unit. That is why we introduced the notion of "Pseudo-Syllable", derived from the most frequent syllable structure in the world, the Consonant-Vowel structure. In this paper, an automatic and language independent rhythmic units extraction algorithm is described: using a vowel detection algorithm, rhythmic units matching the Consonant-Vowel structure are segmented.
   Two models describing rhythm and intonation of each language are then learned using Gaussian Mixtures.

Full Paper

Bibliographic reference.  Rouas, Jean-Luc / Farinas, Jerome / Pellegrino, François (2003): "Automatic modelling of rhythm and intonation for language identification", In ICPhS-15, 567-570.