Dynamic Scene Recognition with Complementary Spatiotemporal Features

被引:24
|
作者
Feichtenhofer, Christoph [1 ]
Pinz, Axel [1 ]
Wildes, Richard P. [2 ,3 ]
机构
[1] Graz Univ Technol, Inst Elect Measurement & Measurement Signal Proc, Graz, Austria
[2] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON, Canada
[3] York Univ, Ctr Vis Res, Toronto, ON, Canada
基金
奥地利科学基金会; 加拿大自然科学与工程研究理事会;
关键词
Dynamic scenes; feature representations; visual spacetime; image dynamics; spatiotemporal orientation; SPATIAL PYRAMIDS; IMAGE; REPRESENTATION; PERCEPTION; COMPACT;
D O I
10.1109/TPAMI.2016.2526008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents Dynamically Pooled Complementary Features (DPCF), a unified approach to dynamic scene recognition that analyzes a short video clip in terms of its spatial, temporal and color properties. The complementarity of these properties is preserved through all main steps of processing, including primitive feature extraction, coding and pooling. In the feature extraction step, spatial orientations capture static appearance, spatiotemporal oriented energies capture image dynamics and color statistics capture chromatic information. Subsequently, primitive features are encoded into a mid-level representation that has been learned for the task of dynamic scene recognition. Finally, a novel dynamic spacetime pyramid is introduced. This dynamic pooling approach can handle both global as well as local motion by adapting to the temporal structure, as guided by pooling energies. The resulting system provides online recognition of dynamic scenes that is thoroughly evaluated on the two current benchmark datasets and yields best results to date on both datasets. In-depth analysis reveals the benefits of explicitly modeling feature complementarity in combination with the dynamic spacetime pyramid, indicating that this unified approach should be well-suited to many areas of video analysis.
引用
收藏
页码:2389 / 2401
页数:13
相关论文
共 50 条
  • [41] Local spatiotemporal features for dynamic texture synthesis
    Rocio A Lizarraga-Morales
    Yimo Guo
    Guoying Zhao
    Matti Pietikäinen
    Raul E Sanchez-Yanez
    EURASIP Journal on Image and Video Processing, 2014 (1)
  • [42] Local spatiotemporal features for dynamic texture synthesis
    Lizarraga-Morales, Rocio A.
    Guo, Yimo
    Zhao, Guoying
    Pietikainen, Matti
    Sanchez-Yanez, Raul E.
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2014,
  • [43] Dynamic Features for Iris Recognition
    da Costa, Ronaldo Martins
    Gonzaga, Adilson
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (04): : 1072 - 1082
  • [44] Spatiotemporal features representation with dynamic mode decomposition for hand gesture recognition using deep neural networks
    Sharma, Bhavana
    Panda, Jeebananda
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3745 - 3759
  • [45] SPEAKER RECOGNITION BY STATISTICAL FEATURES AND DYNAMIC FEATURES
    FURUI, S
    REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1982, 30 (03): : 467 - 482
  • [46] Spatiotemporal features representation with dynamic mode decomposition for hand gesture recognition using deep neural networks
    Bhavana Sharma
    Jeebananda Panda
    Signal, Image and Video Processing, 2024, 18 : 3745 - 3759
  • [47] Touch Gesture Recognition Using Spatiotemporal Fusion Features
    Li, Yun-Kai
    Meng, Qing-Hao
    Zhang, Hong-Wei
    IEEE SENSORS JOURNAL, 2022, 22 (01) : 428 - 437
  • [48] Spatiotemporal Features for Action Recognition and Salient Event Detection
    Rapantzikos, Konstantinos
    Avrithis, Yannis
    Kollias, Stefanos
    COGNITIVE COMPUTATION, 2011, 3 (01) : 167 - 184
  • [49] Embedding Sequential Information into Spatiotemporal Features for Action Recognition
    Ye, Yuancheng
    Tian, Yingli
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 1110 - 1118
  • [50] Video Recognition of Human Fall Based on Spatiotemporal Features
    Wang, Kai
    Zhao, Youjin
    Xiong, Qingyu
    Shen, Xiling
    Fan, Min
    Gao, Min
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2016, 22 (02): : 303 - 309