A Comparative Study on Different Labelling Schemes and Cross-Corpus Experiments in Speech Emotion Recognition

被引:0
|
作者
Baki, Pinar [1 ]
Erden, Berna [1 ]
Oncul, Serkan [1 ]
机构
[1] Arcel Arastirma Gelistirme Merkezi, Istanbul, Turkey
关键词
speech emotion recognition; cross-corpus training; emotion categories; audio classification;
D O I
10.1109/SIU53274.2021.9477924
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Performance of the speech emotion recognition systems depends on many factors such as quality of the speech data, environment, cultural differences, language, emotion categorization scheme, etc. In this work, we create a baseline speech emotion recognition model based on convolutional neural networks using the RAVDESS dataset. First, we compare the performance of the model with different labeling schemes. Then, we perform cross-corpus experiments on datasets recorded in different languages. The results show that emotion groups with common arousal or valence categories are often confused and using multiple corpora in training improves the generalization capacity of the model.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] Multi-scale discrepancy adversarial network for cross-corpus speech emotion recognition
    Wanlu ZHENG
    Wenming ZHENG
    Yuan ZONG
    虚拟现实与智能硬件(中英文), 2021, 3 (01) : 65 - 75
  • [42] Improving Cross-Corpus Speech Emotion Recognition with Adversarial Discriminative Domain Generalization (ADDoG)
    Gideon, John
    McInnis, Melvin G.
    Provost, Emily Mower
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2021, 12 (04) : 1055 - 1068
  • [43] Low-rank joint distribution adaptation for cross-corpus speech emotion recognition
    Li, Sunan
    Lu, Cheng
    Zhao, Yan
    Lian, Hailun
    Qi, Tianhua
    Zong, Yuan
    KNOWLEDGE-BASED SYSTEMS, 2025, 315
  • [44] CROSS-CORPUS EEG-BASED EMOTION RECOGNITION
    Rayatdoost, Soheil
    Soleymani, Mohammad
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [45] Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition
    Latif, Siddique
    Rana, Rajib
    Khalifa, Sara
    Jurdak, Raja
    Schuller, Bjorn
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 1912 - 1926
  • [46] Layer-Adapted Implicit Distribution Alignment Networks for Cross-Corpus Speech Emotion Recognition
    Zhao, Yan
    Zong, Yuan
    Wang, Jincen
    Lian, Hailun
    Lu, Cheng
    Zhao, Li
    Zheng, Wenming
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 5419 - 5430
  • [47] Cross-corpus speech emotion recognition using semi-supervised domain adaptation network
    Zhang, Yumei
    Jia, Maoshen
    Cao, Xuan
    Ru, Jiawei
    Zhang, Xinfeng
    SPEECH COMMUNICATION, 2025, 168
  • [48] Accuracy of Automatic Cross-Corpus Emotion Labeling for Conversational Speech Corpus Commonization
    Mori, Hiroki
    Nagaoka, Atsushi
    Arimoto, Yoshiko
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 4019 - 4023
  • [49] MBDA: A Multi-scale Bidirectional Perception Approach for Cross-Corpus Speech Emotion Recognition
    Li, Jiayang
    Wang, Xiaoye
    Li, Siyuan
    Shi, Jia
    Xiao, Yingyuan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 329 - 341
  • [50] Fusing Visual Attention CNN and Bag of Visual Words for Cross-Corpus Speech Emotion Recognition
    Seo, Minji
    Kim, Myungho
    SENSORS, 2020, 20 (19) : 1 - 21