Facial Depression Recognition by Deep Joint Label Distribution and Metric Learning

被引:29
|
作者
Zhou, Xiuzhuang [1 ]
Wei, Zeqiang [1 ]
Xu, Min [2 ]
Qu, Shan [3 ]
Guo, Guodong [4 ,5 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Capital Normal Univ, Coll Informat & Engn, Beijing 100048, Peoples R China
[3] Peking Univ Peoples Hosp, Dept Psychiat, Beijing 100044, Peoples R China
[4] Baidu Res, Inst Deep Learning, Beijing, Peoples R China
[5] Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100085, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Feature extraction; Face recognition; Measurement; Predictive models; Histograms; Spatiotemporal phenomena; Faces; Depression recognition; label distribution learning; metric learning; label-aware histogram loss; spatiotemporal feature; SCALE;
D O I
10.1109/TAFFC.2020.3022732
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While existing prediction models built on popular deep architectures have shown promising results in facial depression recognition, they still lack sufficient discriminative power due to the issues of 1) limited amount of labeled depression data for deep representation learning and, 2) large variation in facial expression across different persons of the same depression score and the subtle difference in facial expression across different depression levels. In this article, we formulate the facial depression recognition as a label distribution learning (LDL) problem, and propose a deep joint label distribution and metric learning (DJ-LDML) method to address these issues. In DJ-LDML, LDL exploits label relevance inherent in depression data to implicitly increase the amount of training data associated with each depression level without actually enlarging the dataset, while deep metric learning (DML) aims at learning a deep ordinal embedding with a specifically designed label-aware histogram loss, allowing semantics similarity between video sequences (described by ordinal labels) to be preserved for discriminative feature learning. The two learning modules in our DJ-LDML work collaboratively to enhance the representation ability and discriminative power of the deeply learned spatiotemporal feature, leading to improved depression prediction. We empirically evaluate our method on two benchmark datasets and the results demonstrate the effectiveness of our formulation.
引用
收藏
页码:1605 / 1618
页数:14
相关论文
共 50 条
  • [41] Facial expression recognition via deep learning
    Lv, Yadan
    Feng, Zhiyong
    Xu, Chao
    2014 INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP), 2014,
  • [42] Spontaneous facial expression recognition: A robust metric learning approach
    Wan, Shaohua
    Aggarwal, J. K.
    PATTERN RECOGNITION, 2014, 47 (05) : 1859 - 1868
  • [43] Capsule Embedding and Emotional Metric Learning for Facial Expression Recognition
    Hu, Jiajing
    Zhou, Yu
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT X, 2025, 15210 : 99 - 106
  • [44] Facial Age Estimation by Adaptive Label Distribution Learning
    Geng, Xin
    Wang, Qin
    Xia, Yu
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4465 - 4470
  • [45] Soft Facial Landmark Detection by Label Distribution Learning
    Su, Kai
    Geng, Xin
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5008 - 5015
  • [46] Facial Expression Recognition with Identity and Emotion Joint Learning
    Li, Ming
    Xu, Hao
    Huang, Xingchang
    Song, Zhanmei
    Liu, Xiaolin
    Li, Xin
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2021, 12 (02) : 544 - 550
  • [47] Deep Facial Diagnosis: Deep Transfer Learning From Face Recognition to Facial Diagnosis
    Jin, Bo
    Cruz, Leandro
    Goncalves, Nuno
    IEEE ACCESS, 2020, 8 (08): : 123649 - 123661
  • [48] Dysarthric Speech Recognition Based on Deep Metric Learning
    Takashima, Yuki
    Takashima, Ryoichi
    Takiguchi, Tetsuya
    Ariki, Yasuo
    INTERSPEECH 2020, 2020, : 4796 - 4800
  • [49] Deep metric learning for robust radar signal recognition
    Chen, Kuiyu
    Zhang, Jingyi
    Chen, Si
    Zhang, Shuning
    DIGITAL SIGNAL PROCESSING, 2023, 137
  • [50] A Novel Multi-Feature Joint Learning Ensemble Framework for Multi-Label Facial Expression Recognition
    Li, Wanzhao
    Luo, Mingyuan
    Zhang, Peng
    Huang, Wei
    IEEE ACCESS, 2021, 9 : 119766 - 119777