15th International Congress of Phonetic Sciences (ICPhS-15)Barcelona, Spain |
In this paper, we describe the modeling of the F0 curve of a female
voice in several Restricted-Domains and in a general-domain, aimed
at developing a Speech Synthesis System for Spanish. For modeling F0,
we have used Multi-Layer Perceptrons, based on our previous experience
with a male voice.
For isolated speech and for continuous speech,
the use of specialized MLPs is always preferred. The main difference
between restricted and non-restricted domains is the relevance of the
number of the recording carrier sentence for predicting F0 in a restricted
domain. The most relevant predicting parameters are stress, the position
in the intonation group and the type of the group.
Bibliographic reference. Montero, J. M. / D'Haro, L. F. / Córdoba, R. / Vallejo, J. A. / Gutiérrez-Arriola, J. / Pardo, J. M. (2003): "ANN F0 modeling for female-voice synthesis in Spanish: restricted and non-restricted domains", In ICPhS-15, 563-566.