Dynamic Scene Recognition with Complementary Spatiotemporal Features

被引:24
|
作者
Feichtenhofer, Christoph [1 ]
Pinz, Axel [1 ]
Wildes, Richard P. [2 ,3 ]
机构
[1] Graz Univ Technol, Inst Elect Measurement & Measurement Signal Proc, Graz, Austria
[2] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON, Canada
[3] York Univ, Ctr Vis Res, Toronto, ON, Canada
基金
奥地利科学基金会; 加拿大自然科学与工程研究理事会;
关键词
Dynamic scenes; feature representations; visual spacetime; image dynamics; spatiotemporal orientation; SPATIAL PYRAMIDS; IMAGE; REPRESENTATION; PERCEPTION; COMPACT;
D O I
10.1109/TPAMI.2016.2526008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents Dynamically Pooled Complementary Features (DPCF), a unified approach to dynamic scene recognition that analyzes a short video clip in terms of its spatial, temporal and color properties. The complementarity of these properties is preserved through all main steps of processing, including primitive feature extraction, coding and pooling. In the feature extraction step, spatial orientations capture static appearance, spatiotemporal oriented energies capture image dynamics and color statistics capture chromatic information. Subsequently, primitive features are encoded into a mid-level representation that has been learned for the task of dynamic scene recognition. Finally, a novel dynamic spacetime pyramid is introduced. This dynamic pooling approach can handle both global as well as local motion by adapting to the temporal structure, as guided by pooling energies. The resulting system provides online recognition of dynamic scenes that is thoroughly evaluated on the two current benchmark datasets and yields best results to date on both datasets. In-depth analysis reveals the benefits of explicitly modeling feature complementarity in combination with the dynamic spacetime pyramid, indicating that this unified approach should be well-suited to many areas of video analysis.
引用
收藏
页码:2389 / 2401
页数:13
相关论文
共 50 条
  • [31] Dynamic Parallel Pyramid Networks for Scene Recognition
    Liu, Kai
    Moon, Seungbin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 6591 - 6601
  • [32] Temporal Residual Networks for Dynamic Scene Recognition
    Feichtenhofer, Christoph
    Pinz, Axel
    Wildes, Richard P.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7435 - 7444
  • [33] Heterogeneous bag-of-features for object/scene recognition
    Nanni, Loris
    Lumini, Alessandra
    APPLIED SOFT COMPUTING, 2013, 13 (04) : 2171 - 2178
  • [34] Previously fixated visual features improve scene recognition
    Valuch, C.
    Ansorge, U.
    PERCEPTION, 2012, 41 : 124 - 125
  • [35] A Discriminative Representation of Convolutional Features for Indoor Scene Recognition
    Khan, Salman H.
    Hayat, Munawar
    Bennamoun, Mohammed
    Togneri, Roberto
    Sohel, Ferdous A.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3372 - 3383
  • [36] Enhancing Semantic Features with Compositional Analysis for Scene Recognition
    Redi, Miriam
    Merialdo, Bernard
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 446 - 455
  • [37] Scene modelling, recognition and tracking with invariant image features
    Skrypnyk, I
    Lowe, DG
    ISMAR 2004: THIRD IEEE AND ACM INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY, 2004, : 110 - 119
  • [38] Automatic Scene Recognition for Digital Camera by Semantic Features
    Li, Jiming
    Qian, Yunta
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1 AND 2, 2008, : 327 - 332
  • [39] Scene recognition based on saliency building and local features
    Chen S.
    Wu C.
    Yu X.
    Chen D.
    International Journal of Digital Content Technology and its Applications, 2011, 5 (10) : 112 - 118
  • [40] Fusing Attention Features and Contextual Information for Scene Recognition
    Peng, Yuqing
    Liu, Xianzi
    Wang, Chenxi
    Xiao, Tengfei
    Li, Tiejun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (03)