Context-independent multilingual emotion recognition from speech signals

被引：43

作者：

Vladimir Hozjan

Zdravko Kačič

机构：

[1] University of Maribor, Fac. of Elec. Eng. and Comp. Sci., 2000 Maribor

来源：

International Journal of Speech Technology | 2003年 / 6卷 / 3期

关键词：

Cross language emotion recognition; Emotion recognition; Emotions; Speech;

D O I：

10.1023/A:1023426522496

中图分类号：

学科分类号：

摘要：

This paper presents and discusses an analysis of multilingual emotion recognition from speech with database-specific emotional features. Recognition was performed on English, Slovenian, Spanish, and French InterFace emotional speech databases. The InterFace databases included several neutral speaking styles and six emotions: disgust, surprise, joy, fear, anger and sadness. Speech features for emotion recognition were determined in two steps. In the first step, low-level features were defined and in the second high-level features were calculated from low-level features. Low-level features are composed from pitch, derivative of pitch, energy, derivative of energy, and duration of speech segments. High-level features are statistical presentations of low-level features. Database-specific emotional features were selected from high-level features that contain the most information about emotions in speech. Speaker-dependent and monolingual emotion recognisers were defined, as well as multilingual recognisers. Emotion recognition was performed using artificial neural networks. The achieved recognition accuracy was highest for speaker-dependent emotion recognition, smaller for monolingual emotion recognition and smallest for multilingual recognition. The database-specific emotional features are most convenient for use in multilingual emotion recognition. Among speaker-dependent, monolingual, and multilingual emotion recognition, the difference between emotion recognition with all high-level features and emotion recognition with database-specific emotional features is smallest for multilingual emotion recognition - 3.84%.

引用

页码：311 / 320

页数：9

共 28 条

[1]

Anscombe E., Geach P., Descartes Philosophical Writing, (1970)

[2]

Armon-Jones C., The Social Functions of Emotion, (1986)

[3]

Arnold M.B., Emotion and Personality, 2, (1980)

[4]

Banse R., Scherer K.R., Acoustic profiles in vocal emotion expression, Journal of Personality and Social Psychology, 70, 3, pp. 614-636, (1996)

[5]

Camurri A., Coglio A., An architecture for emotional agents, IEEE Multimedia, pp. 24-33, (1998)

[6]

Cornelius R., The Science of Emotion, (1996)

[7]

Cornelius R., Theoretical approaches to emotion, Proc. from ISCA Workshop on Speech and Emotion. Belfast, pp. 3-11, (2000)

[8]

Ekman P., Universals and cultural differences in facial expressions of emotion, Nebraska Symposium on Motivation 1971, 19, pp. 207-283, (1972)

[9]

Fridlund A.J., Human Facial Expression. An Evolutionary View, (1994)

[10]

Frijda N.H., The Emotions, (1986)

← 1 2 3 →