Region and Temporal Dependency Fusion for Multi-label Action Unit Detection

被引:0
|
作者
Mei, Chuanneng [1 ]
Jiang, Fei [1 ]
Shen, Ruimin [1 ]
Hu, Qiaoping [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
关键词
FACIAL EXPRESSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Facial Action Unit (AU) detection from videos increases numerous interests over the past years due to its importance for analyzing facial expressions. Many proposed methods face challenges in detecting sparse face regions for different AUs, in the fusion of temporal dependency, and in learning multiple AUs simultaneously. In this paper, we propose a novel deep neural network architecture for AU detection to model above-mentioned challenges jointly. Firstly, to capture the region sparsity, we design a region pooling layer after a fully convolutional network to extract per-region features for each AU. Secondly, in order to integrate temporal dependency, Long Short Term Memory (LSTM) is stacked on the top of regional features. Finally, the regional features and outputs of LSTMs are utilized together to produce per-frame multi-label predictions. Experimental results on three large spontaneous AU datasets, BP4D, GFT and DISFA, have demonstrated our work outperforms state-of-the-art methods. On three datasets, our work has highest average F1 and AUC scores with an average F1 score improvement of 4.8% on BP4D, 12.7% on GFT and 14.3% on DISFA, and an average AUC score improvement of 27.4% on BP4D and 33.5% on DISFA.
引用
收藏
页码:848 / 853
页数:6
相关论文
共 50 条
  • [31] POSTER: Multi-Block Fusion Mechanism for Multi-label Vulnerability Detection in Smart Contracts
    Van Tong
    Cuong Dao
    Thep Dong
    Hai Anh Tran
    Duc Tran
    Tran, Truong X.
    PROCEEDINGS OF THE 19TH ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, ACM ASIACCS 2024, 2024, : 1955 - 1957
  • [32] Online multi-label dependency topic models for text classification
    Sophie Burkhardt
    Stefan Kramer
    Machine Learning, 2018, 107 : 859 - 886
  • [33] PAT: Position-Aware Transformer for Dense Multi-Label Action Detection
    Sardari, Faegheh
    Mustafa, Armin
    Jackson, Philip J. B.
    Hilton, Adrian
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2980 - 2989
  • [34] Limited-Supervised Multi-Label Learning with Dependency Noise
    Wang, Yejiang
    Zhao, Yuhai
    Wang, Zhengkui
    Shan, Wen
    Wang, Xingwei
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 15662 - 15670
  • [35] Semantic representation and dependency learning for multi-label image recognition
    Pu, Tao
    Sun, Mingzhan
    Wu, Hefeng
    Chen, Tianshui
    Tian, Ling
    Lin, Liang
    NEUROCOMPUTING, 2023, 526 : 121 - 130
  • [36] Multi-Label Hashing for Dependency Relations Among Multiple Objectives
    Peng, Liangkang
    Qian, Jiangbo
    Xu, Zhengtao
    Xin, Yu
    Guo, Lijun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1759 - 1773
  • [37] Online multi-label dependency topic models for text classification
    Burkhardt, Sophie
    Kramer, Stefan
    MACHINE LEARNING, 2018, 107 (05) : 859 - 886
  • [38] A survey on multi-label feature selection from perspectives of label fusion
    Qian, Wenbin
    Huang, Jintao
    Xu, Fankang
    Shu, Wenhao
    Ding, Weiping
    INFORMATION FUSION, 2023, 100
  • [39] Multi-Label Text Classification Based on Label Combination and Fusion of Attentions
    Wu, Xinke
    Sun, Jun
    Li, Zhihua
    Computer Engineering and Applications, 2023, 59 (06) : 125 - 133
  • [40] Multi-Label Multi-Class Action Recognition With Deep Spatio-Temporal Layers Based on Temporal Gaussian Mixtures
    Joefrie, Yuri Yudhaswana
    Aono, Masaki
    IEEE ACCESS, 2020, 8 : 173566 - 173575