COMPARISON OF SPEAKER DEPENDENT AND SPEAKER INDEPENDENT EMOTION RECOGNITION

被引:25
|
作者
Rybka, Jan [1 ]
Janicki, Artur [2 ]
机构
[1] Warsaw Univ Technol, Inst Comp Sci, PL-00665 Warsaw, Poland
[2] Warsaw Univ Technol, Inst Telecommun, PL-00665 Warsaw, Poland
关键词
speech processing; emotion recognition; EMO-DB; support vector machines; artificial neural networks; SPEECH;
D O I
10.2478/amcs-2013-0060
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes a study of emotion recognition based on speech analysis. The introduction to the theory contains a review of emotion inventories used in various studies of emotion recognition as well as the speech corpora applied, methods of speech parametrization, and the most commonly employed classification algorithms. In the current study the EMO-DB speech corpus and three selected classifiers, the k-Nearest Neighbor (k-NN), the Artificial Neural Network (ANN) and Support Vector Machines (SVMs), were used in experiments. SVMs turned out to provide the best classification accuracy of 75.44% in the speaker dependent mode, that is, when speech samples from the same speaker were included in the training corpus. Various speaker dependent and speaker independent configurations were analyzed and compared. Emotion recognition in speaker dependent conditions usually yielded higher accuracy results than a similar but speaker independent configuration. The improvement was especially well observed if the base recognition ratio of a given speaker was low. Happiness and anger, as well as boredom and neutrality, proved to be the pairs of emotions most often confused.
引用
收藏
页码:797 / 808
页数:12
相关论文
共 50 条
  • [21] TEXT INDEPENDENT SPEAKER RECOGNITION
    FOIL, JT
    JOHNSON, DH
    IEEE COMMUNICATIONS MAGAZINE, 1983, 21 (09) : 22 - 25
  • [22] Comparison of Gender- and Speaker-adaptive Emotion Recognition
    Sidorov, Maxim
    Ultes, Stefan
    Schmitt, Alexander
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 3476 - 3480
  • [23] Speaker independent emotion recognition based on SVM/HMMs fusion system
    Fu, Liqin
    Mao, Xia
    Chen, Lijiang
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 61 - 65
  • [24] Speaker-Independent Emotion Recognition based on Feature Vector Classification
    Park, Jeong-Sik
    Kim, Ji-Hwan
    Yoon, Sang-Min
    Oh, Yung-Hwan
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2775 - +
  • [25] DYNAMIC SPEAKER ADAPTATION IN SPEAKER-INDEPENDENT WORD RECOGNITION
    HEWETT, AJ
    HOLMES, G
    YOUNG, SJ
    PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 275 - 282
  • [26] Speaker Awareness for Speech Emotion Recognition
    Assuncao, Gustavo
    Menezes, Paulo
    Perdigao, Fernando
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2020, 16 (04) : 15 - 22
  • [27] Sound Processing Features for Speaker-Dependent and Phrase-Independent Emotion Recognition in Berlin Database
    Anagnostopoulos, Christos Nikolaos
    Vovoli, Eftichia
    INFORMATION SYSTEMS DEVELOPMENT: TOWARDS A SERVICE PROVISION SOCIETY, 2009, : 413 - 421
  • [28] Speaker Attentive Speech Emotion Recognition
    Le Moine, Clement
    Obin, Nicolas
    Roebel, Axel
    INTERSPEECH 2021, 2021, : 2866 - 2870
  • [29] Speaker-specific mapping for text-independent speaker recognition
    Misra, H
    Ikbal, S
    Yegnanarayana, B
    SPEECH COMMUNICATION, 2003, 39 (3-4) : 301 - 310
  • [30] Speaker and Channel Factors in Text-Dependent Speaker Recognition
    Stafylakis, Themos
    Kenny, Patrick
    Alam, Md. Jahangir
    Kockmann, Marcel
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (01) : 65 - 78