Multi-threshold deep metric learning for facial expression recognition

被引:0
|
作者
Yang, Wenwu [1 ]
Yu, Jinyi [1 ]
Chen, Tuo [1 ]
Liu, Zhenguang [2 ]
Wang, Xun [1 ]
Shen, Jianbing [3 ]
机构
[1] Zhejiang GongShang Univ, Hangzhou 310018, Peoples R China
[2] Zhejiang Univ, Hangzhou 310012, Peoples R China
[3] Univ Macau, Taipa 999078, Macau, Peoples R China
关键词
Facial expression recognition; Triplet loss learning; Multiple thresholds;
D O I
10.1016/j.patcog.2024.110711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature representations generated through triplet-based deep metric learning offer significant advantages for facial expression recognition (FER). Each threshold in triplet loss inherently shapes a distinct distribution of inter-class variations, leading to unique representations of expression features. Nonetheless, pinpointing the optimal threshold for triplet loss presents a formidable challenge, as the ideal threshold varies not only across different datasets but also among classes within the same dataset. In this paper, we propose a novel multi-threshold deep metric learning approach that bypasses the complex process of threshold validation and markedly improves the effectiveness in creating expression feature representations. Instead of choosing a single optimal threshold from a valid range, we comprehensively sample thresholds throughout this range, which ensures that the representation characteristics exhibited by the thresholds within this spectrum are fully captured and utilized for enhancing FER. Specifically, we segment the embedding layer of the deep metric learning network into multiple slices, with each slice representing a specific threshold sample. We subsequently train these embedding slices in an end-to-end fashion, applying triplet loss at its associated threshold to each slice, which results in a collection of unique expression features corresponding to each embedding slice. Moreover, we identify the issue that the traditional triplet loss may struggle to converge when employing the widely-used Batch Hard strategy for mining informative triplets, and introduce a novel loss termed dual triplet loss to address it. Extensive evaluations demonstrate the superior performance of the proposed approach on both posed and spontaneous facial expression datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Robust facial expression recognition algorithm based on local metric learning
    Jiang, Bin
    Jia, Kebin
    JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (01)
  • [32] Expression-Guided Deep Joint Learning for Facial Expression Recognition
    Fang, Bei
    Zhao, Yujie
    Han, Guangxin
    He, Juhou
    SENSORS, 2023, 23 (16)
  • [33] Online Multi-threshold Learning with Imbalanced Data Stream
    Cai, Xufen
    Yang, Min
    Zhu, Rong
    Li, Xiaoyan
    Ye, Long
    Zhang, Qin
    ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 : 3 - 9
  • [34] Deep Learning Based Mobilenet and Multi-Head Attention Model for Facial Expression Recognition
    Nouisser, Aicha
    Zouari, Ramzi
    Kherallah, Monji
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 485 - 491
  • [35] Multi angle optimal pattern-based deep learning for automatic facial expression recognition
    Jain, Deepak Kumar
    Zhang, Zhang
    Huang, Kaiqi
    PATTERN RECOGNITION LETTERS, 2020, 139 : 157 - 165
  • [36] Multi-Modal Emotion Recognition From Speech and Facial Expression Based on Deep Learning
    Cai, Linqin
    Dong, Jiangong
    Wei, Min
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5726 - 5729
  • [37] Deep multi-threshold spiking-UNet for image processing
    Li, Hebei
    Zhang, Yueyi
    Xiong, Zhiwei
    Sun, Xiaoyan
    NEUROCOMPUTING, 2024, 586
  • [38] Automated Facial Expression Recognition Framework Using Deep Learning
    Saeed, Saad
    Shah, Asghar Ali
    Ehsan, Muhammad Khurram
    Amirzada, Muhammad Rizwan
    Mahmood, Asad
    Mezgebo, Teweldebrhan
    JOURNAL OF HEALTHCARE ENGINEERING, 2022, 2022
  • [39] Deep Neural Networks with Relativity Learning for Facial Expression Recognition
    Guo, Yanan
    Tao, Dapeng
    Yu, Jun
    Xiong, Hao
    Li, Yaotang
    Tao, Dacheng
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2016,
  • [40] Automated Facial Expression Recognition Framework Using Deep Learning
    Saeed, Saad
    Shah, Asghar Ali
    Ehsan, Muhammad Khurram
    Amirzada, Muhammad Rizwan
    Mahmood, Asad
    Mezgebo, Teweldebrhan
    Journal of Healthcare Engineering, 2022, 2022