Dynamic Scene Recognition with Complementary Spatiotemporal Features

被引：24

作者：

Feichtenhofer, Christoph ^{[1
]}

Pinz, Axel ^{[1
]}

Wildes, Richard P. ^{[2
,3
]}

机构：

[1] Graz Univ Technol, Inst Elect Measurement & Measurement Signal Proc, Graz, Austria

[2] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON, Canada

[3] York Univ, Ctr Vis Res, Toronto, ON, Canada

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2016年 / 38卷 / 12期

基金：

奥地利科学基金会; 加拿大自然科学与工程研究理事会;

关键词：

Dynamic scenes; feature representations; visual spacetime; image dynamics; spatiotemporal orientation; SPATIAL PYRAMIDS; IMAGE; REPRESENTATION; PERCEPTION; COMPACT;

D O I：

10.1109/TPAMI.2016.2526008

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents Dynamically Pooled Complementary Features (DPCF), a unified approach to dynamic scene recognition that analyzes a short video clip in terms of its spatial, temporal and color properties. The complementarity of these properties is preserved through all main steps of processing, including primitive feature extraction, coding and pooling. In the feature extraction step, spatial orientations capture static appearance, spatiotemporal oriented energies capture image dynamics and color statistics capture chromatic information. Subsequently, primitive features are encoded into a mid-level representation that has been learned for the task of dynamic scene recognition. Finally, a novel dynamic spacetime pyramid is introduced. This dynamic pooling approach can handle both global as well as local motion by adapting to the temporal structure, as guided by pooling energies. The resulting system provides online recognition of dynamic scenes that is thoroughly evaluated on the two current benchmark datasets and yields best results to date on both datasets. In-depth analysis reveals the benefits of explicitly modeling feature complementarity in combination with the dynamic spacetime pyramid, indicating that this unified approach should be well-suited to many areas of video analysis.

引用

页码：2389 / 2401

页数：13

共 50 条

[41] Local spatiotemporal features for dynamic texture synthesis
Rocio A Lizarraga-Morales
Yimo Guo
Guoying Zhao
Matti Pietikäinen
Raul E Sanchez-Yanez
EURASIP Journal on Image and Video Processing, 2014 (1)
[42] Local spatiotemporal features for dynamic texture synthesis
Lizarraga-Morales, Rocio A.
Guo, Yimo
Zhao, Guoying
Pietikainen, Matti
Sanchez-Yanez, Raul E.
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2014,
[43] Dynamic Features for Iris Recognition
da Costa, Ronaldo Martins
Gonzaga, Adilson
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (04): : 1072 - 1082
[44] Spatiotemporal features representation with dynamic mode decomposition for hand gesture recognition using deep neural networks
Sharma, Bhavana
Panda, Jeebananda
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3745 - 3759
[45] SPEAKER RECOGNITION BY STATISTICAL FEATURES AND DYNAMIC FEATURES
FURUI, S
REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1982, 30 (03): : 467 - 482
[46] Spatiotemporal features representation with dynamic mode decomposition for hand gesture recognition using deep neural networks
Bhavana Sharma
Jeebananda Panda
Signal, Image and Video Processing, 2024, 18 : 3745 - 3759
[47] Touch Gesture Recognition Using Spatiotemporal Fusion Features
Li, Yun-Kai
Meng, Qing-Hao
Zhang, Hong-Wei
IEEE SENSORS JOURNAL, 2022, 22 (01) : 428 - 437
[48] Spatiotemporal Features for Action Recognition and Salient Event Detection
Rapantzikos, Konstantinos
Avrithis, Yannis
Kollias, Stefanos
COGNITIVE COMPUTATION, 2011, 3 (01) : 167 - 184
[49] Embedding Sequential Information into Spatiotemporal Features for Action Recognition
Ye, Yuancheng
Tian, Yingli
PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 1110 - 1118
[50] Video Recognition of Human Fall Based on Spatiotemporal Features
Wang, Kai
Zhao, Youjin
Xiong, Qingyu
Shen, Xiling
Fan, Min
Gao, Min
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2016, 22 (02): : 303 - 309

← 1 2 3 4 5 →