Cross-Corpus Speech Emotion Recognition Based on Causal Emotion Information Representation

被引:0
|
作者
Fu, Hongliang [1 ]
Li, Qianqian [1 ]
Tao, Huawei [1 ]
Zhu, Chunhua [1 ]
Xie, Yue [2 ]
Guo, Ruxue [3 ]
机构
[1] Henan Univ Technol, Key Lab Grain Informat Proc & Control, Minist Educ, Zhengzhou 450001, Peoples R China
[2] Nanjing Inst Technol, Sch Commun Engn, Nanjing 211167, Peoples R China
[3] IFLYTEK Res, Hefei 230088, Peoples R China
基金
中国国家自然科学基金;
关键词
cross-corpus speech emotion recognition; causal representation learning; domain adaptation;
D O I
10.1587/transinf.2023EDL8087
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech emotion recognition (SER) is a key research technology to realize the third generation of artificial intelligence, which is widely used in human-computer interaction, emotion diagnosis, interpersonal communication and other fields. However, the aliasing of language and semantic information in speech tends to distort the alignment of emotion features, which affects the performance of cross-corpus SER system. This paper proposes a cross-corpus SER model based on causal emotion information representation (CEIR). The model uses the reconstruction loss of the deep autoencoder network and the source domain label information to realize the preliminary separation of causal features. Then, the causal correlation matrix is constructed, and the local maximum mean difference (LMMD) feature alignment technology is combined to make the causal features of different dimensions jointly distributed independent. Finally, the supervised fine-tuning of labeled data is used to achieve effective extraction of causal emotion information. The experimental results show that the average unweighted average recall (UAR) of the proposed algorithm is increased by 3.4% to 7.01% compared with the latest partial algorithms in the field.
引用
收藏
页码:1097 / 1100
页数:4
相关论文
共 50 条
  • [21] Cross-Corpus Speech Emotion Recognition Based on Joint Transfer Subspace Learning and Regression
    Zhang, Weijian
    Song, Peng
    Chen, Dongliang
    Sheng, Chao
    Zhang, Wenjing
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) : 588 - 598
  • [22] Cross-Corpus Arabic and English Emotion Recognition
    Meftah, Ali
    Seddiq, Yasser
    Alotaibi, Yousef
    Selouani, Sid-Ahmed
    2017 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2017, : 377 - 381
  • [23] Unsupervised Domain Adaptation Integrating Transformers and Mutual Information for Cross-Corpus Speech Emotion Recognition
    Zhang, Shiqing
    Liu, Ruixin
    Yang, Yijiao
    Zhao, Xiaoming
    Yu, Jun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [24] Deep Transductive Transfer Regression Network for Cross-Corpus Speech Emotion Recognition
    Zhao, Yan
    Wang, Jincen
    Ye, Ru
    Zong, Yuan
    Zheng, Wenming
    Zhao, Li
    INTERSPEECH 2022, 2022, : 371 - 375
  • [25] Towards Domain-Specific Cross-Corpus Speech Emotion Recognition Approach
    Zhao, Yan
    Zong, Yuan
    Lian, Hailun
    Lu, Cheng
    Shi, Jingang
    Zheng, Wenming
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
  • [26] Target-Adapted Subspace Learning for Cross-Corpus Speech Emotion Recognition
    Chen, Xiuzhen
    Zhou, Xiaoyan
    Lu, Cheng
    Zong, Yuan
    Zheng, Wenming
    Tang, Chuangao
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (12) : 2632 - 2636
  • [27] An adaptation framework with unified embedding reconstruction for cross-corpus speech emotion recognition
    Zhang, Ruiteng
    Wei, Jianguo
    Lu, Xugang
    Li, Yongwei
    Lu, Wenhuan
    Zhang, Lin
    Xu, Junhai
    APPLIED SOFT COMPUTING, 2025, 174
  • [28] Cross-corpus speech emotion recognition using subspace learning and domain adaption
    Xuan Cao
    Maoshen Jia
    Jiawei Ru
    Tun-wen Pai
    EURASIP Journal on Audio, Speech, and Music Processing, 2022
  • [29] Cross-corpus speech emotion recognition using subspace learning and domain adaption
    Cao, Xuan
    Jia, Maoshen
    Ru, Jiawei
    Pai, Tun-wen
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2022, 2022 (01)
  • [30] Progressive distribution adapted neural networks for cross-corpus speech emotion recognition
    Zong, Yuan
    Lian, Hailun
    Zhang, Jiacheng
    Feng, Ercui
    Lu, Cheng
    Chang, Hongli
    Tang, Chuangao
    FRONTIERS IN NEUROROBOTICS, 2022, 16