Region and Temporal Dependency Fusion for Multi-label Action Unit Detection

被引：0

作者：

Mei, Chuanneng ^{[1
]}

Jiang, Fei ^{[1
]}

Shen, Ruimin ^{[1
]}

Hu, Qiaoping ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China

来源：

2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2018年

关键词：

FACIAL EXPRESSION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic Facial Action Unit (AU) detection from videos increases numerous interests over the past years due to its importance for analyzing facial expressions. Many proposed methods face challenges in detecting sparse face regions for different AUs, in the fusion of temporal dependency, and in learning multiple AUs simultaneously. In this paper, we propose a novel deep neural network architecture for AU detection to model above-mentioned challenges jointly. Firstly, to capture the region sparsity, we design a region pooling layer after a fully convolutional network to extract per-region features for each AU. Secondly, in order to integrate temporal dependency, Long Short Term Memory (LSTM) is stacked on the top of regional features. Finally, the regional features and outputs of LSTMs are utilized together to produce per-frame multi-label predictions. Experimental results on three large spontaneous AU datasets, BP4D, GFT and DISFA, have demonstrated our work outperforms state-of-the-art methods. On three datasets, our work has highest average F1 and AUC scores with an average F1 score improvement of 4.8% on BP4D, 12.7% on GFT and 14.3% on DISFA, and an average AUC score improvement of 27.4% on BP4D and 33.5% on DISFA.

引用

页码：848 / 853

页数：6

共 50 条

[21] Improved Conditional Dependency Networks for Multi-label Classification
Guo Tao
Li Guiyang
2015 SEVENTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2015), 2015, : 561 - 565
[22] Semi-supervised Learning for Multi-label Video Action Detection
Zhang, Hongcheng
Zhao, Xu
Wang, Dongqi
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2124 - 2134
[23] Region attention and label embedding for facial action unit detection
Song, Wei
Li, Dong
MULTIMEDIA SYSTEMS, 2025, 31 (02)
[24] Multi-label Classification of Legal Text with Fusion of Label Relations
Song Z.
Li Y.
Li D.
Wang S.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (02): : 185 - 192
[25] Outliers Detection in Multi-label Datasets
Bello, Marilyn
Napoles, Gonzalo
Morera, Rafael
Vanhoof, Koen
Bello, Rafael
ADVANCES IN SOFT COMPUTING, MICAI 2020, PT I, 2020, 12468 : 65 - 75
[26] Multi-label learning with missing labels for image annotation and facial action unit recognition
Wu, Baoyuan
Lyu, Siwei
Hu, Bao-Gang
Ji, Qiang
PATTERN RECOGNITION, 2015, 48 (07) : 2279 - 2289
[27] Joint Patch and Multi-label Learning for Facial Action Unit and Holistic Expression Recognition
Zhao, Kaili
Chu, Wen-Sheng
De la Torre, Fernando
Cohn, Jeffrey F.
Zhang, Honggang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (08) : 3931 - 3946
[28] Source Detection With Multi-Label Classification
Vijayamohanan, Jayakrishnan
Gupta, Arjun
Noakoasteen, Oameed
Goudos, Sotirios K. K.
Christodoulou, Christos G.
IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2023, 4 : 336 - 345
[29] Detection and Multi-label Classification of Bats
Dierckx, Lucile
Beauvois, Melanie
Nijssen, Siegfried
ADVANCES IN INTELLIGENT DATA ANALYSIS XX, IDA 2022, 2022, 13205 : 53 - 65
[30] Action Unit Detection with Region Adaptation, Multi-labeling Learning and Optimal Temporal Fusing
Li, Wei
Abtahi, Farnaz
Zhu, Zhigang
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6766 - 6775

← 1 2 3 4 5 →