Region and Temporal Dependency Fusion for Multi-label Action Unit Detection

被引:0
|
作者
Mei, Chuanneng [1 ]
Jiang, Fei [1 ]
Shen, Ruimin [1 ]
Hu, Qiaoping [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
关键词
FACIAL EXPRESSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Facial Action Unit (AU) detection from videos increases numerous interests over the past years due to its importance for analyzing facial expressions. Many proposed methods face challenges in detecting sparse face regions for different AUs, in the fusion of temporal dependency, and in learning multiple AUs simultaneously. In this paper, we propose a novel deep neural network architecture for AU detection to model above-mentioned challenges jointly. Firstly, to capture the region sparsity, we design a region pooling layer after a fully convolutional network to extract per-region features for each AU. Secondly, in order to integrate temporal dependency, Long Short Term Memory (LSTM) is stacked on the top of regional features. Finally, the regional features and outputs of LSTMs are utilized together to produce per-frame multi-label predictions. Experimental results on three large spontaneous AU datasets, BP4D, GFT and DISFA, have demonstrated our work outperforms state-of-the-art methods. On three datasets, our work has highest average F1 and AUC scores with an average F1 score improvement of 4.8% on BP4D, 12.7% on GFT and 14.3% on DISFA, and an average AUC score improvement of 27.4% on BP4D and 33.5% on DISFA.
引用
收藏
页码:848 / 853
页数:6
相关论文
共 50 条
  • [21] Improved Conditional Dependency Networks for Multi-label Classification
    Guo Tao
    Li Guiyang
    2015 SEVENTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2015), 2015, : 561 - 565
  • [22] Semi-supervised Learning for Multi-label Video Action Detection
    Zhang, Hongcheng
    Zhao, Xu
    Wang, Dongqi
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2124 - 2134
  • [23] Region attention and label embedding for facial action unit detection
    Song, Wei
    Li, Dong
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [24] Multi-label Classification of Legal Text with Fusion of Label Relations
    Song Z.
    Li Y.
    Li D.
    Wang S.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (02): : 185 - 192
  • [25] Outliers Detection in Multi-label Datasets
    Bello, Marilyn
    Napoles, Gonzalo
    Morera, Rafael
    Vanhoof, Koen
    Bello, Rafael
    ADVANCES IN SOFT COMPUTING, MICAI 2020, PT I, 2020, 12468 : 65 - 75
  • [26] Multi-label learning with missing labels for image annotation and facial action unit recognition
    Wu, Baoyuan
    Lyu, Siwei
    Hu, Bao-Gang
    Ji, Qiang
    PATTERN RECOGNITION, 2015, 48 (07) : 2279 - 2289
  • [27] Joint Patch and Multi-label Learning for Facial Action Unit and Holistic Expression Recognition
    Zhao, Kaili
    Chu, Wen-Sheng
    De la Torre, Fernando
    Cohn, Jeffrey F.
    Zhang, Honggang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (08) : 3931 - 3946
  • [28] Source Detection With Multi-Label Classification
    Vijayamohanan, Jayakrishnan
    Gupta, Arjun
    Noakoasteen, Oameed
    Goudos, Sotirios K. K.
    Christodoulou, Christos G.
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2023, 4 : 336 - 345
  • [29] Detection and Multi-label Classification of Bats
    Dierckx, Lucile
    Beauvois, Melanie
    Nijssen, Siegfried
    ADVANCES IN INTELLIGENT DATA ANALYSIS XX, IDA 2022, 2022, 13205 : 53 - 65
  • [30] Action Unit Detection with Region Adaptation, Multi-labeling Learning and Optimal Temporal Fusing
    Li, Wei
    Abtahi, Farnaz
    Zhu, Zhigang
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6766 - 6775