Motion Complementary Network for Efficient Action Recognition

被引:0
|
作者
Cheng, Ke [1 ,2 ,3 ]
Zhang, Yifan [1 ,2 ,3 ]
Li, Chenghua [1 ,2 ,3 ]
Cheng, Jian [1 ,2 ,3 ,4 ]
Lu, Hanqing [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, NLPR, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Inst Automat, AIRIA, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[4] CASIA, Res Ctr Brain Inspired Intelligence, Beijing 100190, Peoples R China
关键词
D O I
10.1109/ICPR48806.2021.9412783
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Both two-stream ConvNet and 3D ConvNet are widely used in action recognition. However, both methods are not efficient for deployment: calculating optical flow is very slow, while 3D convolution is computationally expensive. Our key insight is that the motion information from optical flow maps is complementary to the motion information from 3D ConvNet. Instead of simply combining these two methods, we propose two novel techniques to enhance the performance with less computational cost: fixed-motion-accumulation and balanced-motion-policy. With these two techniques, we propose a novel framework called Efficient Motion Complementary Network(EMC-Net) that enjoys both high efficiency and high performance. We conduct extensive experiments on Kinetics, UCF101, and Jester datasets. We achieve notably higher performance while consuming 4.7x less computation than I3D, 11.6 x less computation than ECO, 17.8x less computation than R(2+1)D. On Kinetics dataset, we achieve 2.6% better performance than the recent proposed TSM with 1.4 x fewer FLOPs and 10ms faster on K80 GPU.
引用
收藏
页码:1543 / 1549
页数:7
相关论文
共 50 条
  • [1] Differential motion attention network for efficient action recognition
    Liu, Caifeng
    Gu, Fangjie
    VISUAL COMPUTER, 2025, 41 (03): : 1719 - 1731
  • [2] Mixed Resolution Network with hierarchical motion modeling for efficient action recognition
    Lu, Xiusheng
    Zhao, Sicheng
    Cheng, Lechao
    Zheng, Ying
    Fan, Xueqiao
    Song, Mingli
    KNOWLEDGE-BASED SYSTEMS, 2024, 294
  • [3] PAN: Persistent Appearance Network with an Efficient Motion Cue for Fast Action Recognition
    Zhang, Can
    Zou, Yuexian
    Chen, Guang
    Gan, Lei
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 500 - 509
  • [4] SkeletonCapsuleNet: An Efficient Network for Action Recognition
    Yu, Yue
    Tian, Niehao
    Chen, Xiangru
    Li, Ying
    2018 8TH INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV), 2018, : 74 - 77
  • [5] Motion Feature Network: Fixed Motion Filter for Action Recognition
    Lee, Myunggi
    Lee, Seungeui
    Son, Sungjoon
    Park, Gyutae
    Kwak, Nojun
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 392 - 408
  • [6] Action Keypoint Network for Efficient Video Recognition
    Chen, Xu
    Han, Yahong
    Wang, Xiaohan
    Sun, Yifan
    Yang, Yi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4980 - 4993
  • [7] UNSUPERVISED MOTION REPRESENTATION ENHANCED NETWORK FOR ACTION RECOGNITION
    Yang, Xiaohang
    Kong, Lingtong
    Yang, Jie
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2445 - 2449
  • [8] A motion-aware ConvLSTM network for action recognition
    Mahshid Majd
    Reza Safabakhsh
    Applied Intelligence, 2019, 49 : 2515 - 2521
  • [9] MDNET: MOTION DISTINCTION NETWORK FOR EFFECTIVE ACTION RECOGNITION
    Jin, Rongrong
    Ye, Weirong
    Wang, Xiao
    Yan, Yan
    Wang, Hanzi
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3236 - 3240
  • [10] A spatiotemporal and motion information extraction network for action recognition
    Wang, Wei
    Wang, Xianmin
    Zhou, Mingliang
    Wei, Xuekai
    Li, Jing
    Ren, Xiaojun
    Zong, Xuemei
    WIRELESS NETWORKS, 2024, 30 (06) : 5389 - 5405