Region and Temporal Dependency Fusion for Multi-label Action Unit Detection

被引:0
|
作者
Mei, Chuanneng [1 ]
Jiang, Fei [1 ]
Shen, Ruimin [1 ]
Hu, Qiaoping [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
关键词
FACIAL EXPRESSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Facial Action Unit (AU) detection from videos increases numerous interests over the past years due to its importance for analyzing facial expressions. Many proposed methods face challenges in detecting sparse face regions for different AUs, in the fusion of temporal dependency, and in learning multiple AUs simultaneously. In this paper, we propose a novel deep neural network architecture for AU detection to model above-mentioned challenges jointly. Firstly, to capture the region sparsity, we design a region pooling layer after a fully convolutional network to extract per-region features for each AU. Secondly, in order to integrate temporal dependency, Long Short Term Memory (LSTM) is stacked on the top of regional features. Finally, the regional features and outputs of LSTMs are utilized together to produce per-frame multi-label predictions. Experimental results on three large spontaneous AU datasets, BP4D, GFT and DISFA, have demonstrated our work outperforms state-of-the-art methods. On three datasets, our work has highest average F1 and AUC scores with an average F1 score improvement of 4.8% on BP4D, 12.7% on GFT and 14.3% on DISFA, and an average AUC score improvement of 27.4% on BP4D and 33.5% on DISFA.
引用
收藏
页码:848 / 853
页数:6
相关论文
共 50 条
  • [41] Complex video event detection via pairwise fusion of trajectory and multi-label hypergraphs
    Chen, Xiao-jun
    Zhan, Yong-zhao
    Ke, Jia
    Chen, Xiao-bo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (22) : 15079 - 15100
  • [42] Composite Harmonic Source Detection with Multi-Label Approach Using Advanced Fusion Method
    Sun, Lina
    Wang, Hong
    Qi, Linhai
    Yan, Jiangyu
    Jiang, Meijing
    ELECTRONICS, 2024, 13 (07)
  • [43] Complex video event detection via pairwise fusion of trajectory and multi-label hypergraphs
    Xiao-jun Chen
    Yong-zhao Zhan
    Jia Ke
    Xiao-bo Chen
    Multimedia Tools and Applications, 2016, 75 : 15079 - 15100
  • [44] Multi-label Co-regularization for Semi-supervised Facial Action Unit Recognition
    Niu, Xuesong
    Han, Hu
    Shan, Shiguang
    Chen, Xilin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [45] Accurate use of label dependency in multi-label text classification through the lens of causality
    Fan, Caoyun
    Chen, Wenqing
    Tian, Jidong
    Li, Yitian
    He, Hao
    Jin, Yaohui
    APPLIED INTELLIGENCE, 2023, 53 (19) : 21841 - 21857
  • [46] Three-Way Decisions Based Multi-label Learning Algorithm with Label Dependency
    Li, Feng
    Miao, Duoqian
    Zhang, Wei
    ROUGH SETS, (IJCRS 2016), 2016, 9920 : 240 - 249
  • [47] Accurate use of label dependency in multi-label text classification through the lens of causality
    Caoyun Fan
    Wenqing Chen
    Jidong Tian
    Yitian Li
    Hao He
    Yaohui Jin
    Applied Intelligence, 2023, 53 : 21841 - 21857
  • [48] Normalizing Chinese temporal expressions with multi-label classification
    Wu, ML
    Li, WJ
    Chen, Q
    Lu, Q
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 318 - 323
  • [49] Discovering multi-label temporal patterns in sequence databases
    Chen, Yen-Liang
    Wu, Shin-Yi
    Wang, Yu-Cheng
    INFORMATION SCIENCES, 2011, 181 (03) : 398 - 418
  • [50] DEPENDENCY PRIOR FOR MULTI-ATLAS LABEL FUSION
    Wang, Hongzhi
    Yushkevich, Paul A.
    2012 9TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2012, : 892 - 895