Transferable discriminant linear regression for cross-corpus speech emotion recognition

被引:7
|
作者
Li, Shaokai [1 ]
Song, Peng [1 ]
Zhang, Wenjing [1 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Peoples R China
基金
中国国家自然科学基金;
关键词
Linear regression; Speech emotion recognition; Category space; Transfer learning; LEAST-SQUARES REGRESSION; GENERAL FRAMEWORK; FEATURES; REGULARIZATION; CLASSIFICATION; ADAPTATION; DATABASES;
D O I
10.1016/j.apacoust.2022.108919
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech emotion recognition (SER) has attracted much interest recently due to its wide applications. However, it should be noted that most SER methods are conducted on the assumption that the training and testing data are from the same database. In real applications, this assumption does not hold, and the recognition performance will be significantly degraded. To solve this problem, we present a novel trans-ferable discriminant linear regression (TDLR) approach for cross-corpus SER. Specifically, first, we intro-duce a non-negative label relaxation linear regression on source corpus to help learn transferable feature representations. Second, we propose a simple but effective strategy to keep the linear relationship between the labels of source and target corpora. Meanwhile, we utilize the discriminative maximum mean discrepancy (MMD) as the distance metric between two databases. Furthermore, we use the graph Laplacian to preserve the geometric structure of samples, which can further reduce the distribution gap between the two databases. Additionally, to better obtain the intrinsic properties of data and make the model robust, we impose an '2;1-norm on the transformation matrices. Extensive experiments have been carried out on several standard databases, and the results show that TDLR can obtain better recognition performance than several state-of-the-art algorithms. (C) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Multi-scale discrepancy adversarial network for cross-corpus speech emotion recognition
    Wanlu ZHENG
    Wenming ZHENG
    Yuan ZONG
    虚拟现实与智能硬件(中英文), 2021, 3 (01) : 65 - 75
  • [42] Improving Cross-Corpus Speech Emotion Recognition with Adversarial Discriminative Domain Generalization (ADDoG)
    Gideon, John
    McInnis, Melvin G.
    Provost, Emily Mower
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2021, 12 (04) : 1055 - 1068
  • [43] Low-rank joint distribution adaptation for cross-corpus speech emotion recognition
    Li, Sunan
    Lu, Cheng
    Zhao, Yan
    Lian, Hailun
    Qi, Tianhua
    Zong, Yuan
    KNOWLEDGE-BASED SYSTEMS, 2025, 315
  • [44] CROSS-CORPUS EEG-BASED EMOTION RECOGNITION
    Rayatdoost, Soheil
    Soleymani, Mohammad
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [45] Robust Transferable Subspace Learning for Cross-Corpus Facial Expression Recognition
    Chen, Dongliang
    Song, Peng
    Zhang, Wenjing
    Zhang, Weijian
    Xu, Bingui
    Zhou, Xuan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (10): : 2241 - 2245
  • [46] Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition
    Latif, Siddique
    Rana, Rajib
    Khalifa, Sara
    Jurdak, Raja
    Schuller, Bjorn
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 1912 - 1926
  • [47] Layer-Adapted Implicit Distribution Alignment Networks for Cross-Corpus Speech Emotion Recognition
    Zhao, Yan
    Zong, Yuan
    Wang, Jincen
    Lian, Hailun
    Lu, Cheng
    Zhao, Li
    Zheng, Wenming
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 5419 - 5430
  • [48] Cross-corpus speech emotion recognition using semi-supervised domain adaptation network
    Zhang, Yumei
    Jia, Maoshen
    Cao, Xuan
    Ru, Jiawei
    Zhang, Xinfeng
    SPEECH COMMUNICATION, 2025, 168
  • [49] Learning Transferable Sparse Representations for Cross-Corpus Facial Expression Recognition
    Chen, Dongliang
    Song, Peng
    Zheng, Wenming
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1322 - 1333
  • [50] Accuracy of Automatic Cross-Corpus Emotion Labeling for Conversational Speech Corpus Commonization
    Mori, Hiroki
    Nagaoka, Atsushi
    Arimoto, Yoshiko
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 4019 - 4023