Multi-threshold deep metric learning for facial expression recognition

被引：0

作者：

Yang, Wenwu ^{[1
]}

Yu, Jinyi ^{[1
]}

Chen, Tuo ^{[1
]}

Liu, Zhenguang ^{[2
]}

Wang, Xun ^{[1
]}

Shen, Jianbing ^{[3
]}

机构：

[1] Zhejiang GongShang Univ, Hangzhou 310018, Peoples R China

[2] Zhejiang Univ, Hangzhou 310012, Peoples R China

[3] Univ Macau, Taipa 999078, Macau, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 156卷

关键词：

Facial expression recognition; Triplet loss learning; Multiple thresholds;

D O I：

10.1016/j.patcog.2024.110711

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Feature representations generated through triplet-based deep metric learning offer significant advantages for facial expression recognition (FER). Each threshold in triplet loss inherently shapes a distinct distribution of inter-class variations, leading to unique representations of expression features. Nonetheless, pinpointing the optimal threshold for triplet loss presents a formidable challenge, as the ideal threshold varies not only across different datasets but also among classes within the same dataset. In this paper, we propose a novel multi-threshold deep metric learning approach that bypasses the complex process of threshold validation and markedly improves the effectiveness in creating expression feature representations. Instead of choosing a single optimal threshold from a valid range, we comprehensively sample thresholds throughout this range, which ensures that the representation characteristics exhibited by the thresholds within this spectrum are fully captured and utilized for enhancing FER. Specifically, we segment the embedding layer of the deep metric learning network into multiple slices, with each slice representing a specific threshold sample. We subsequently train these embedding slices in an end-to-end fashion, applying triplet loss at its associated threshold to each slice, which results in a collection of unique expression features corresponding to each embedding slice. Moreover, we identify the issue that the traditional triplet loss may struggle to converge when employing the widely-used Batch Hard strategy for mining informative triplets, and introduce a novel loss termed dual triplet loss to address it. Extensive evaluations demonstrate the superior performance of the proposed approach on both posed and spontaneous facial expression datasets.

引用

页数：12

共 50 条

[31] Robust facial expression recognition algorithm based on local metric learning
Jiang, Bin
Jia, Kebin
JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (01)
[32] Expression-Guided Deep Joint Learning for Facial Expression Recognition
Fang, Bei
Zhao, Yujie
Han, Guangxin
He, Juhou
SENSORS, 2023, 23 (16)
[33] Online Multi-threshold Learning with Imbalanced Data Stream
Cai, Xufen
Yang, Min
Zhu, Rong
Li, Xiaoyan
Ye, Long
Zhang, Qin
ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 : 3 - 9
[34] Deep Learning Based Mobilenet and Multi-Head Attention Model for Facial Expression Recognition
Nouisser, Aicha
Zouari, Ramzi
Kherallah, Monji
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 485 - 491
[35] Multi angle optimal pattern-based deep learning for automatic facial expression recognition
Jain, Deepak Kumar
Zhang, Zhang
Huang, Kaiqi
PATTERN RECOGNITION LETTERS, 2020, 139 : 157 - 165
[36] Multi-Modal Emotion Recognition From Speech and Facial Expression Based on Deep Learning
Cai, Linqin
Dong, Jiangong
Wei, Min
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5726 - 5729
[37] Deep multi-threshold spiking-UNet for image processing
Li, Hebei
Zhang, Yueyi
Xiong, Zhiwei
Sun, Xiaoyan
NEUROCOMPUTING, 2024, 586
[38] Automated Facial Expression Recognition Framework Using Deep Learning
Saeed, Saad
Shah, Asghar Ali
Ehsan, Muhammad Khurram
Amirzada, Muhammad Rizwan
Mahmood, Asad
Mezgebo, Teweldebrhan
JOURNAL OF HEALTHCARE ENGINEERING, 2022, 2022
[39] Deep Neural Networks with Relativity Learning for Facial Expression Recognition
Guo, Yanan
Tao, Dapeng
Yu, Jun
Xiong, Hao
Li, Yaotang
Tao, Dacheng
2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2016,
[40] Automated Facial Expression Recognition Framework Using Deep Learning
Saeed, Saad
Shah, Asghar Ali
Ehsan, Muhammad Khurram
Amirzada, Muhammad Rizwan
Mahmood, Asad
Mezgebo, Teweldebrhan
Journal of Healthcare Engineering, 2022, 2022

← 1 2 3 4 5 →