Transferable discriminant linear regression for cross-corpus speech emotion recognition

被引：7

作者：

Li, Shaokai ^{[1
]}

Song, Peng ^{[1
]}

Zhang, Wenjing ^{[1
]}

机构：

[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Peoples R China

来源：

APPLIED ACOUSTICS | 2022年 / 197卷

基金：

中国国家自然科学基金;

关键词：

Linear regression; Speech emotion recognition; Category space; Transfer learning; LEAST-SQUARES REGRESSION; GENERAL FRAMEWORK; FEATURES; REGULARIZATION; CLASSIFICATION; ADAPTATION; DATABASES;

D O I：

10.1016/j.apacoust.2022.108919

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speech emotion recognition (SER) has attracted much interest recently due to its wide applications. However, it should be noted that most SER methods are conducted on the assumption that the training and testing data are from the same database. In real applications, this assumption does not hold, and the recognition performance will be significantly degraded. To solve this problem, we present a novel trans-ferable discriminant linear regression (TDLR) approach for cross-corpus SER. Specifically, first, we intro-duce a non-negative label relaxation linear regression on source corpus to help learn transferable feature representations. Second, we propose a simple but effective strategy to keep the linear relationship between the labels of source and target corpora. Meanwhile, we utilize the discriminative maximum mean discrepancy (MMD) as the distance metric between two databases. Furthermore, we use the graph Laplacian to preserve the geometric structure of samples, which can further reduce the distribution gap between the two databases. Additionally, to better obtain the intrinsic properties of data and make the model robust, we impose an '2;1-norm on the transformation matrices. Extensive experiments have been carried out on several standard databases, and the results show that TDLR can obtain better recognition performance than several state-of-the-art algorithms. (C) 2022 Elsevier Ltd. All rights reserved.

引用

页数：11

共 50 条

[41] Multi-scale discrepancy adversarial network for cross-corpus speech emotion recognition
Wanlu ZHENG
Wenming ZHENG
Yuan ZONG
虚拟现实与智能硬件(中英文), 2021, 3 (01) : 65 - 75
[42] Improving Cross-Corpus Speech Emotion Recognition with Adversarial Discriminative Domain Generalization (ADDoG)
Gideon, John
McInnis, Melvin G.
Provost, Emily Mower
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2021, 12 (04) : 1055 - 1068
[43] Low-rank joint distribution adaptation for cross-corpus speech emotion recognition
Li, Sunan
Lu, Cheng
Zhao, Yan
Lian, Hailun
Qi, Tianhua
Zong, Yuan
KNOWLEDGE-BASED SYSTEMS, 2025, 315
[44] CROSS-CORPUS EEG-BASED EMOTION RECOGNITION
Rayatdoost, Soheil
Soleymani, Mohammad
2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
[45] Robust Transferable Subspace Learning for Cross-Corpus Facial Expression Recognition
Chen, Dongliang
Song, Peng
Zhang, Wenjing
Zhang, Weijian
Xu, Bingui
Zhou, Xuan
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (10): : 2241 - 2245
[46] Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition
Latif, Siddique
Rana, Rajib
Khalifa, Sara
Jurdak, Raja
Schuller, Bjorn
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 1912 - 1926
[47] Layer-Adapted Implicit Distribution Alignment Networks for Cross-Corpus Speech Emotion Recognition
Zhao, Yan
Zong, Yuan
Wang, Jincen
Lian, Hailun
Lu, Cheng
Zhao, Li
Zheng, Wenming
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 5419 - 5430
[48] Cross-corpus speech emotion recognition using semi-supervised domain adaptation network
Zhang, Yumei
Jia, Maoshen
Cao, Xuan
Ru, Jiawei
Zhang, Xinfeng
SPEECH COMMUNICATION, 2025, 168
[49] Learning Transferable Sparse Representations for Cross-Corpus Facial Expression Recognition
Chen, Dongliang
Song, Peng
Zheng, Wenming
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1322 - 1333
[50] Accuracy of Automatic Cross-Corpus Emotion Labeling for Conversational Speech Corpus Commonization
Mori, Hiroki
Nagaoka, Atsushi
Arimoto, Yoshiko
LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 4019 - 4023

← 1 2 3 4 5 →