Multi-threshold deep metric learning for facial expression recognition

被引:0
|
作者
Yang, Wenwu [1 ]
Yu, Jinyi [1 ]
Chen, Tuo [1 ]
Liu, Zhenguang [2 ]
Wang, Xun [1 ]
Shen, Jianbing [3 ]
机构
[1] Zhejiang GongShang Univ, Hangzhou 310018, Peoples R China
[2] Zhejiang Univ, Hangzhou 310012, Peoples R China
[3] Univ Macau, Taipa 999078, Macau, Peoples R China
关键词
Facial expression recognition; Triplet loss learning; Multiple thresholds;
D O I
10.1016/j.patcog.2024.110711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature representations generated through triplet-based deep metric learning offer significant advantages for facial expression recognition (FER). Each threshold in triplet loss inherently shapes a distinct distribution of inter-class variations, leading to unique representations of expression features. Nonetheless, pinpointing the optimal threshold for triplet loss presents a formidable challenge, as the ideal threshold varies not only across different datasets but also among classes within the same dataset. In this paper, we propose a novel multi-threshold deep metric learning approach that bypasses the complex process of threshold validation and markedly improves the effectiveness in creating expression feature representations. Instead of choosing a single optimal threshold from a valid range, we comprehensively sample thresholds throughout this range, which ensures that the representation characteristics exhibited by the thresholds within this spectrum are fully captured and utilized for enhancing FER. Specifically, we segment the embedding layer of the deep metric learning network into multiple slices, with each slice representing a specific threshold sample. We subsequently train these embedding slices in an end-to-end fashion, applying triplet loss at its associated threshold to each slice, which results in a collection of unique expression features corresponding to each embedding slice. Moreover, we identify the issue that the traditional triplet loss may struggle to converge when employing the widely-used Batch Hard strategy for mining informative triplets, and introduce a novel loss termed dual triplet loss to address it. Extensive evaluations demonstrate the superior performance of the proposed approach on both posed and spontaneous facial expression datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Local Learning With Deep and Handcrafted Features for Facial Expression Recognition
    Georgescu, Mariana-Iuliana
    Ionescu, Radu Tudor
    Popescu, Marius
    IEEE ACCESS, 2019, 7 : 64827 - 64836
  • [42] Facial Expression Recognition System for Stress Detection with Deep Learning
    Almeida, Jose
    Rodrigues, Fatima
    PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS (ICEIS 2021), VOL 1, 2021, : 256 - 263
  • [43] Facial expression recognition using lightweight deep learning modeling
    Ahmad, Mubashir
    Saira
    Alfandi, Omar
    Khattak, Asad Masood
    Qadri, Syed Furqan
    Saeed, Iftikhar Ahmed
    Khan, Salabat
    Hayat, Bashir
    Ahmad, Arshad
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (05) : 8208 - 8225
  • [44] Facial Expression Recognition using Visual Saliency and Deep Learning
    Mavani, Viraj
    Raman, Shanmuganathan
    Miyapuram, Krishna P.
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2783 - 2788
  • [45] Enhancing masked facial expression recognition with multimodal deep learning
    Shahzad, H. M.
    Bhatti, Sohail Masood
    Jaffar, Arfan
    Akram, Sheeraz
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 73911 - 73921
  • [46] Facial expression recognition via learning deep sparse autoencoders
    Zeng, Nianyin
    Zhang, Hong
    Song, Baoye
    Liu, Weibo
    Li, Yurong
    Dobaie, Abdullah M.
    NEUROCOMPUTING, 2018, 273 : 643 - 649
  • [47] Deep Disturbance-Disentangled Learning for Facial Expression Recognition
    Ruan, Delian
    Yan, Yan
    Chen, Si
    Xue, Jing-Hao
    Wang, Hanzi
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2833 - 2841
  • [48] Facial Expression Recognition of Animated Characters using Deep Learning
    Lakhani, Mohd Ismail
    McDermott, James
    Glavin, Frank G.
    Nagarajan, Sai Priya
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [49] A Compact Deep Learning Model for Robust Facial Expression Recognition
    Kuo, Chieh-Ming
    Lai, Shang-Hong
    Sarkis, Michel
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 2202 - 2210
  • [50] Real-World Facial Expression Recognition Using Metric Learning Method
    Liu, Zhiwen
    Li, Shan
    Deng, Weihong
    BIOMETRIC RECOGNITION, 2016, 9967 : 519 - 527