14th International Congress of Phonetic Sciences (ICPhS-14)
San Francisco, CA, USA
This paper aims at investigating, with quantitative data, the
relationship between voice and the image information carried
within the voice.
Speech samples from 8 males and 8 females were recorded. Two pictures were taken for each speaker: one showing the whole body with the reference background so that physical characteristics may be easily compared, and the other showing only the face.
Then subjects were asked to match the voices with the corresponding pictures. The results showed that, with the whole-body pictures, 6 females and 4 males were correctly identified while with the face-only pictures, 3 females and 2 males were correctly matched. However, even in cases of mismatch, there was a strong tendency for participants to agree on which voice should correspond to which picture in all cases.
It seems that voice does give the listener information on physical characteristics, with greater accuracy than we expected.
Bibliographic reference. Moon, Seung-Jae (1999): "What you hear is what you see?: a progress report", In ICPhS-14, 1241-1244.