Multi-threshold deep metric learning for facial expression recognition

被引:0
|
作者
Yang, Wenwu [1 ]
Yu, Jinyi [1 ]
Chen, Tuo [1 ]
Liu, Zhenguang [2 ]
Wang, Xun [1 ]
Shen, Jianbing [3 ]
机构
[1] Zhejiang GongShang Univ, Hangzhou 310018, Peoples R China
[2] Zhejiang Univ, Hangzhou 310012, Peoples R China
[3] Univ Macau, Taipa 999078, Macau, Peoples R China
关键词
Facial expression recognition; Triplet loss learning; Multiple thresholds;
D O I
10.1016/j.patcog.2024.110711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature representations generated through triplet-based deep metric learning offer significant advantages for facial expression recognition (FER). Each threshold in triplet loss inherently shapes a distinct distribution of inter-class variations, leading to unique representations of expression features. Nonetheless, pinpointing the optimal threshold for triplet loss presents a formidable challenge, as the ideal threshold varies not only across different datasets but also among classes within the same dataset. In this paper, we propose a novel multi-threshold deep metric learning approach that bypasses the complex process of threshold validation and markedly improves the effectiveness in creating expression feature representations. Instead of choosing a single optimal threshold from a valid range, we comprehensively sample thresholds throughout this range, which ensures that the representation characteristics exhibited by the thresholds within this spectrum are fully captured and utilized for enhancing FER. Specifically, we segment the embedding layer of the deep metric learning network into multiple slices, with each slice representing a specific threshold sample. We subsequently train these embedding slices in an end-to-end fashion, applying triplet loss at its associated threshold to each slice, which results in a collection of unique expression features corresponding to each embedding slice. Moreover, we identify the issue that the traditional triplet loss may struggle to converge when employing the widely-used Batch Hard strategy for mining informative triplets, and introduce a novel loss termed dual triplet loss to address it. Extensive evaluations demonstrate the superior performance of the proposed approach on both posed and spontaneous facial expression datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Joint Deep Learning of Facial Expression Synthesis and Recognition
    Yan, Yan
    Huang, Ying
    Chen, Si
    Shen, Chunhua
    Wang, Hanzi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2792 - 2807
  • [22] Dynamic Facial Expression Recognition Based on Deep Learning
    Deng, Liwei
    Wang, Qian
    Yuan, Ding
    14TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND EDUCATION (ICCSE 2019), 2019, : 32 - 37
  • [23] A discriminative deep association learning for facial expression recognition
    Jin, Xing
    Sun, Wenyun
    Jin, Zhong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (04) : 779 - 793
  • [24] A survey of facial expression recognition based on deep learning
    Wei, Heng
    Zhang, Zhi
    PROCEEDINGS OF THE 15TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2020), 2020, : 90 - 94
  • [25] Deep Learning for Illumination Invariant Facial Expression Recognition
    Ruiz-Garcia, Ariel
    Palade, Vasile
    Elshaw, Mark
    Almakky, Ibrahim
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 202 - 207
  • [26] Automatic Facial Expression Recognition Using Deep Learning
    Prasad, M. S. Guru
    Prithviraj
    Choudhury, Tanupriya
    Kotecha, Ketan
    Jain, Deepak
    Yeole, Ashwini N.
    INTELLIGENT AND FUZZY SYSTEMS, INFUS 2024 CONFERENCE, VOL 1, 2024, 1088 : 414 - 426
  • [27] A discriminative deep association learning for facial expression recognition
    Xing Jin
    Wenyun Sun
    Zhong Jin
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 779 - 793
  • [28] Sparse deep feature learning for facial expression recognition
    Xie, Weicheng
    Jia, Xi
    Shen, Linlin
    Yang, Meng
    PATTERN RECOGNITION, 2019, 96
  • [29] Research of Facial Expression Recognition Based on Deep Learning
    Zhang, Linhao
    Yang, Yuliang
    Li, Wanchong
    Dang, Shuai
    Zhu, Mengyu
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 688 - 691
  • [30] MULTI-THRESHOLD THRESHOLD ELEMENTS
    HARING, DR
    IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1966, EC15 (01): : 45 - &