Facial Depression Recognition by Deep Joint Label Distribution and Metric Learning

被引：29

作者：

Zhou, Xiuzhuang ^{[1
]}

Wei, Zeqiang ^{[1
]}

Xu, Min ^{[2
]}

Qu, Shan ^{[3
]}

Guo, Guodong ^{[4
,5
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China

[2] Capital Normal Univ, Coll Informat & Engn, Beijing 100048, Peoples R China

[3] Peking Univ Peoples Hosp, Dept Psychiat, Beijing 100044, Peoples R China

[4] Baidu Res, Inst Deep Learning, Beijing, Peoples R China

[5] Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100085, Peoples R China

来源：

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING | 2022年 / 13卷 / 03期

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

Feature extraction; Face recognition; Measurement; Predictive models; Histograms; Spatiotemporal phenomena; Faces; Depression recognition; label distribution learning; metric learning; label-aware histogram loss; spatiotemporal feature; SCALE;

D O I：

10.1109/TAFFC.2020.3022732

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While existing prediction models built on popular deep architectures have shown promising results in facial depression recognition, they still lack sufficient discriminative power due to the issues of 1) limited amount of labeled depression data for deep representation learning and, 2) large variation in facial expression across different persons of the same depression score and the subtle difference in facial expression across different depression levels. In this article, we formulate the facial depression recognition as a label distribution learning (LDL) problem, and propose a deep joint label distribution and metric learning (DJ-LDML) method to address these issues. In DJ-LDML, LDL exploits label relevance inherent in depression data to implicitly increase the amount of training data associated with each depression level without actually enlarging the dataset, while deep metric learning (DML) aims at learning a deep ordinal embedding with a specifically designed label-aware histogram loss, allowing semantics similarity between video sequences (described by ordinal labels) to be preserved for discriminative feature learning. The two learning modules in our DJ-LDML work collaboratively to enhance the representation ability and discriminative power of the deeply learned spatiotemporal feature, leading to improved depression prediction. We empirically evaluate our method on two benchmark datasets and the results demonstrate the effectiveness of our formulation.

引用

页码：1605 / 1618

页数：14

共 50 条

[41] Facial expression recognition via deep learning
Lv, Yadan
Feng, Zhiyong
Xu, Chao
2014 INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP), 2014,
[42] Spontaneous facial expression recognition: A robust metric learning approach
Wan, Shaohua
Aggarwal, J. K.
PATTERN RECOGNITION, 2014, 47 (05) : 1859 - 1868
[43] Capsule Embedding and Emotional Metric Learning for Facial Expression Recognition
Hu, Jiajing
Zhou, Yu
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT X, 2025, 15210 : 99 - 106
[44] Facial Age Estimation by Adaptive Label Distribution Learning
Geng, Xin
Wang, Qin
Xia, Yu
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4465 - 4470
[45] Soft Facial Landmark Detection by Label Distribution Learning
Su, Kai
Geng, Xin
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5008 - 5015
[46] Facial Expression Recognition with Identity and Emotion Joint Learning
Li, Ming
Xu, Hao
Huang, Xingchang
Song, Zhanmei
Liu, Xiaolin
Li, Xin
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2021, 12 (02) : 544 - 550
[47] Deep Facial Diagnosis: Deep Transfer Learning From Face Recognition to Facial Diagnosis
Jin, Bo
Cruz, Leandro
Goncalves, Nuno
IEEE ACCESS, 2020, 8 (08): : 123649 - 123661
[48] Dysarthric Speech Recognition Based on Deep Metric Learning
Takashima, Yuki
Takashima, Ryoichi
Takiguchi, Tetsuya
Ariki, Yasuo
INTERSPEECH 2020, 2020, : 4796 - 4800
[49] Deep metric learning for robust radar signal recognition
Chen, Kuiyu
Zhang, Jingyi
Chen, Si
Zhang, Shuning
DIGITAL SIGNAL PROCESSING, 2023, 137
[50] A Novel Multi-Feature Joint Learning Ensemble Framework for Multi-Label Facial Expression Recognition
Li, Wanzhao
Luo, Mingyuan
Zhang, Peng
Huang, Wei
IEEE ACCESS, 2021, 9 : 119766 - 119777

← 1 2 3 4 5 →