15th International Congress of Phonetic Sciences (ICPhS-15)
This study explores a multilingual phonotactic approach to automatic
language identification using Broadcast News data. The definition of
a multilingual phoneset is discussed and an upper limit on the performance
of the phonotactic approach is estimated by eliminating any degradation
due to recognition errors. This upper bound is compared to automatic
language identification based on a phonotactic approach. The eight
languages of interest are: Arabic, Mandarin, English, French, German,
Italian, Portuguese and Spanish. A perceptual test has been carried
out to compare human and machine performance in similar configurations.
Different phoneset classes have been experimented with, ranging from a binary C/V distinction to a shared phone set of 70 phones. Experiments show that phonotactic constraints are in theory able to identify a language (among 8) with close to 100% on very short sequences of 1-2 seconds. Automatic and human performances on very short sequences both remain below the theoretical performances.
Bibliographic reference. Adda-Decker, Martine / Antoine, Fabien / Boula de Mareüil, Philippe / Vasilescu, Ioana / Lamel, Lori / Vaissière, Jacqueline / Geoffrois, Edouard / Liénard, Jean-Sylvain (2003): "Phonetic knowledge, phonotactics and perceptual validation for automatic language identification", In ICPhS-15, 747-750.