Dynamic Scene Recognition with Complementary Spatiotemporal Features

被引:24
|
作者
Feichtenhofer, Christoph [1 ]
Pinz, Axel [1 ]
Wildes, Richard P. [2 ,3 ]
机构
[1] Graz Univ Technol, Inst Elect Measurement & Measurement Signal Proc, Graz, Austria
[2] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON, Canada
[3] York Univ, Ctr Vis Res, Toronto, ON, Canada
基金
奥地利科学基金会; 加拿大自然科学与工程研究理事会;
关键词
Dynamic scenes; feature representations; visual spacetime; image dynamics; spatiotemporal orientation; SPATIAL PYRAMIDS; IMAGE; REPRESENTATION; PERCEPTION; COMPACT;
D O I
10.1109/TPAMI.2016.2526008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents Dynamically Pooled Complementary Features (DPCF), a unified approach to dynamic scene recognition that analyzes a short video clip in terms of its spatial, temporal and color properties. The complementarity of these properties is preserved through all main steps of processing, including primitive feature extraction, coding and pooling. In the feature extraction step, spatial orientations capture static appearance, spatiotemporal oriented energies capture image dynamics and color statistics capture chromatic information. Subsequently, primitive features are encoded into a mid-level representation that has been learned for the task of dynamic scene recognition. Finally, a novel dynamic spacetime pyramid is introduced. This dynamic pooling approach can handle both global as well as local motion by adapting to the temporal structure, as guided by pooling energies. The resulting system provides online recognition of dynamic scenes that is thoroughly evaluated on the two current benchmark datasets and yields best results to date on both datasets. In-depth analysis reveals the benefits of explicitly modeling feature complementarity in combination with the dynamic spacetime pyramid, indicating that this unified approach should be well-suited to many areas of video analysis.
引用
收藏
页码:2389 / 2401
页数:13
相关论文
共 50 条
  • [1] Spacetime Forests with Complementary Features for Dynamic Scene Recognition
    Feichtenhofer, Christoph
    Pinz, Axel
    Wildes, Richard P.
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
  • [2] Dynamic Scene Recognition Using Spatiotemporal Based DLTP on Spark
    Uddin, Md Azher
    Akhond, Mostafijur Rahman
    Lee, Young-Koo
    IEEE ACCESS, 2018, 6 : 66123 - 66133
  • [3] Selective spatiotemporal features learning for dynamic gesture recognition
    Tang, Xianlun
    Yan, Zhenfu
    Peng, Jiangping
    Hao, Bohui
    Wang, Huiming
    Li, Jie
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 169
  • [4] The Spatiotemporal Dynamics of Scene Gist Recognition
    Larson, Adam M.
    Freeman, Tyler E.
    Ringer, Ryan V.
    Loschky, Lester C.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2014, 40 (02) : 471 - 487
  • [5] Static object imaging features recognition algorithm in dynamic scene mapping
    Junchai Gao
    Bing Han
    Keding Yan
    Multimedia Tools and Applications, 2019, 78 : 33885 - 33898
  • [6] Wide-Baseline Visible Features for Highly Dynamic Scene Recognition
    Kawewong, Aram
    Tangruamsub, Sirinart
    Hasegawa, Osamu
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2009, 5702 : 723 - 731
  • [7] Static object imaging features recognition algorithm in dynamic scene mapping
    Gao, Junchai
    Han, Bing
    Yan, Keding
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (23) : 33885 - 33898
  • [8] Portmanteauing Features for Scene Text Recognition
    Tan, Yew Lee
    Chew, Ernest Yu Kai
    Kong, Adams Wai-Kin
    Kim, Jung-Jae
    Lim, Joo Hwee
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1499 - 1505
  • [9] Convolutional Network Features for Scene Recognition
    Koskela, Markus
    Laaksonen, Jorma
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 1169 - 1172
  • [10] Spatiotemporal Dynamics of Orientation Processing During Scene Recognition
    Ismail, Ahamed Miflah Hussain
    Solomon, Joshua A.
    Hansard, Miles
    Mareschal, Isabelle
    PERCEPTION, 2017, 46 (10) : 1220 - 1221