Deep Feature Flow for Video Recognition

被引:416
|
作者
Zhu, Xizhou [1 ,2 ]
Xiong, Yuwen [2 ]
Dai, Jifeng [2 ]
Yuan, Lu [2 ]
Wei, Yichen [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[2] Microsoft Res, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR.2017.441
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neutral networks have achieved great success on image recognition tasks. Yet, it is non-trivial to transfer the state-of-the-art image recognition networks to videos as per-frame evaluation is too slow and unaffordable. We present deep feature flow, a fast and accurate framework for video recognition. It runs the expensive convolutional sub-network only on sparse key frames and propagates their deep feature maps to other frames via a flow field. It achieves significant speedup as flow computation is relatively fast. The end-to-end training of the whole architecture significantly boosts the recognition accuracy. Deep feature flow is flexible and general. It is validated on two video datasets on object detection and semantic segmentation. It significantly advances the practice of video recognition tasks. Code would be released.
引用
收藏
页码:4141 / 4150
页数:10
相关论文
共 50 条
  • [21] An Interpretable Deep Learning-Based Feature Reduction in Video-Based Human Activity Recognition
    Dutt, Micheal
    Goodwin, Morten
    Omlin, Christian W.
    IEEE ACCESS, 2024, 12 : 187947 - 187963
  • [22] Collaborative Spatiotemporal Feature Learning for Video Action Recognition
    Li, Chao
    Zhong, Qiaoyong
    Xie, Di
    Pu, Shiliang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7864 - 7873
  • [23] System of Feature Extraction for Video Pattern Recognition on FPGA
    Sergiyenko, Anatolij
    Serhiienko, Pavlo
    Orlova, Maria
    Molchanov, Oleksii
    2019 IEEE 2ND UKRAINE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (UKRCON-2019), 2019, : 1175 - 1178
  • [24] An Adaptive Semisupervised Feature Analysis for Video Semantic Recognition
    Luo, Minnan
    Chang, Xiaojun
    Nie, Liqiang
    Yang, Yi
    Hauptmann, Alexander G.
    Zheng, Qinghua
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (02) : 648 - 660
  • [25] Motion Feature Combination for Human Action Recognition in Video
    Meng, Hongying
    Pears, Nick
    Bailey, Chris
    COMPUTER VISION AND COMPUTER GRAPHICS, 2008, 21 : 151 - +
  • [26] Facial Expression Recognition in Video with Multiple Feature Fusion
    Chen, Junkai
    Chen, Zenghai
    Chi, Zheru
    Fu, Hong
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (01) : 38 - 50
  • [27] Geometry Guided Feature Aggregation in Video Face Recognition
    Peng, Baoyun
    Jin, Xiao
    Wu, Yichao
    Li, Dongsheng
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2670 - 2677
  • [28] Deep Temporal Feature Encoding for Action Recognition
    Li, Lin
    Zhang, Zhaoxiang
    Huang, Yan
    Wang, Liang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1109 - 1114
  • [29] DEEP SELECTIVE FEATURE LEARNING FOR ACTION RECOGNITION
    Li, Ziqiang
    Ge, Yongxin
    Feng, Jinyuan
    Qi, Xiaolei
    Yu, Jiaruo
    Yu, Hui
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [30] Unsupervised local deep feature for image recognition
    Wang, Yang
    Wang, Xinggang
    Liu, Wenyu
    INFORMATION SCIENCES, 2016, 351 : 67 - 75