Motion Complementary Network for Efficient Action Recognition

被引：0

作者：

Cheng, Ke ^{[1
,2
,3
]}

Zhang, Yifan ^{[1
,2
,3
]}

Li, Chenghua ^{[1
,2
,3
]}

Cheng, Jian ^{[1
,2
,3
,4
]}

Lu, Hanqing ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Inst Automat, NLPR, Beijing 100190, Peoples R China

[2] Chinese Acad Sci, Inst Automat, AIRIA, Beijing 100190, Peoples R China

[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

[4] CASIA, Res Ctr Brain Inspired Intelligence, Beijing 100190, Peoples R China

来源：

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年

关键词：

D O I：

10.1109/ICPR48806.2021.9412783

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Both two-stream ConvNet and 3D ConvNet are widely used in action recognition. However, both methods are not efficient for deployment: calculating optical flow is very slow, while 3D convolution is computationally expensive. Our key insight is that the motion information from optical flow maps is complementary to the motion information from 3D ConvNet. Instead of simply combining these two methods, we propose two novel techniques to enhance the performance with less computational cost: fixed-motion-accumulation and balanced-motion-policy. With these two techniques, we propose a novel framework called Efficient Motion Complementary Network(EMC-Net) that enjoys both high efficiency and high performance. We conduct extensive experiments on Kinetics, UCF101, and Jester datasets. We achieve notably higher performance while consuming 4.7x less computation than I3D, 11.6 x less computation than ECO, 17.8x less computation than R(2+1)D. On Kinetics dataset, we achieve 2.6% better performance than the recent proposed TSM with 1.4 x fewer FLOPs and 10ms faster on K80 GPU.

引用

页码：1543 / 1549

页数：7

共 50 条

[1] Differential motion attention network for efficient action recognition
Liu, Caifeng
Gu, Fangjie
VISUAL COMPUTER, 2025, 41 (03): : 1719 - 1731
[2] Mixed Resolution Network with hierarchical motion modeling for efficient action recognition
Lu, Xiusheng
Zhao, Sicheng
Cheng, Lechao
Zheng, Ying
Fan, Xueqiao
Song, Mingli
KNOWLEDGE-BASED SYSTEMS, 2024, 294
[3] PAN: Persistent Appearance Network with an Efficient Motion Cue for Fast Action Recognition
Zhang, Can
Zou, Yuexian
Chen, Guang
Gan, Lei
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 500 - 509
[4] SkeletonCapsuleNet: An Efficient Network for Action Recognition
Yu, Yue
Tian, Niehao
Chen, Xiangru
Li, Ying
2018 8TH INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV), 2018, : 74 - 77
[5] Motion Feature Network: Fixed Motion Filter for Action Recognition
Lee, Myunggi
Lee, Seungeui
Son, Sungjoon
Park, Gyutae
Kwak, Nojun
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 392 - 408
[6] Action Keypoint Network for Efficient Video Recognition
Chen, Xu
Han, Yahong
Wang, Xiaohan
Sun, Yifan
Yang, Yi
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4980 - 4993
[7] UNSUPERVISED MOTION REPRESENTATION ENHANCED NETWORK FOR ACTION RECOGNITION
Yang, Xiaohang
Kong, Lingtong
Yang, Jie
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2445 - 2449
[8] A motion-aware ConvLSTM network for action recognition
Mahshid Majd
Reza Safabakhsh
Applied Intelligence, 2019, 49 : 2515 - 2521
[9] MDNET: MOTION DISTINCTION NETWORK FOR EFFECTIVE ACTION RECOGNITION
Jin, Rongrong
Ye, Weirong
Wang, Xiao
Yan, Yan
Wang, Hanzi
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3236 - 3240
[10] A spatiotemporal and motion information extraction network for action recognition
Wang, Wei
Wang, Xianmin
Zhou, Mingliang
Wei, Xuekai
Li, Jing
Ren, Xiaojun
Zong, Xuemei
WIRELESS NETWORKS, 2024, 30 (06) : 5389 - 5405

← 1 2 3 4 5 →