14th International Congress of Phonetic Sciences (ICPhS-14)
San Francisco, CA, USA
This paper explores the nature of linguistic information present in the temporal structure of speech. We derive TempoRAl Patterns (TRAPs) which describe the temporal evolution of different phonemes in conversational speech. The temporal spread of the TRAPs in the region preceding as well as following the occurrence of a phoneme suggests that the information regarding the phoneme is available in rather long time durations around it. We propose an approach using TRAPs for phonetic feature extraction in speech recognition instead of the conventional spectral- based features. The resulting temporal-based automatic speech recognition (ASR) system yields recognition performance which is comparable to that of the conventional spectral-based ASR system and it is inherently robust to certain types of noise.
Bibliographic reference. Sharma, Sangita / Hermansky, Hynek (1999): "Speech recognition from temporal patterns", In ICPhS-14, 1661-1664.