Video Covariance Matrix Logarithm for Human Action Recognition in Videos

被引：0

作者：

Bilinski, Piotr ^{[1
]}

Bremond, Francois ^{[1
]}

机构：

[1] INRIA Sophia Antipolis, STARS Team, 2004 Route Lucioles,BP93, F-06902 Sophia Antipolis, France

来源：

PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI) | 2015年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a new local spatio-temporal descriptor for videos and we propose a new approach for action recognition in videos based on the introduced descriptor. The new descriptor is called the Video Covariance Matrix Logarithm (VCML). The VCML descriptor is based on a covariance matrix representation, and it models relationships between different low-level features, such as intensity and gradient. We apply the VCML descriptor to encode appearance information of local spatio-temporal video volumes, which are extracted by the Dense Trajectories. Then, we present an extensive evaluation of the proposed VCML descriptor with the Fisher vector encoding and the Support Vector Machines on four challenging action recognition datasets. We show that the VCML descriptor achieves better results than the state-of-the-art appearance descriptors. Moreover, we present that the VCML descriptor carries complementary information to the HOG descriptor and their fusion gives a significant improvement in action recognition accuracy. Finally, we show that the VCML descriptor improves action recognition accuracy in comparison to the state-of-the-art Dense Trajectories, and that the proposed approach achieves superior performance to the state-of-the-art methods.

引用

页码：2140 / 2147

页数：8

共 50 条

[31] Diving deep into human action recognition in aerial videos: A survey
Kapoor, Surbhi
Sharma, Akashdeep
Verma, Amandeep
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
[32] Human Action Recognition in Unconstrained Videos by Explicit Motion Modeling
Jiang, Yu-Gang
Dai, Qi
Liu, Wei
Xue, Xiangyang
Ngo, Chong-Wah
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) : 3781 - 3795
[33] Deep Learning-Based Human Action Recognition in Videos
Li, Song
Shi, Qian
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2025, 34 (01)
[34] Motion keypoint trajectory and covariance descriptor for human action recognition
Yi, Yun
Wang, Hanli
VISUAL COMPUTER, 2018, 34 (03): : 391 - 403
[35] Motion keypoint trajectory and covariance descriptor for human action recognition
Yun Yi
Hanli Wang
The Visual Computer, 2018, 34 : 391 - 403
[36] An Overview of Action Recognition in Videos
Buric, M.
Pobar, M.
Kos, M. Ivasic
2017 40TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2017, : 1098 - 1103
[37] Kernelized Covariance for Action Recognition
Cavazza, Jacopo
Zunino, Andrea
Biagio, Marco San
Murino, Vittorio
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 408 - 413
[38] High dimensional covariance matrix estimation by penalizing the matrix-logarithm transformed likelihood
Yu, Philip L. H.
Wang, Xiaohang
Zhu, Yuanyuan
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 114 : 12 - 25
[39] Modeling Video Activity with Dynamic Phrases and Its Application to Action Recognition in Tennis Videos
Vainstein, Jonathan
Manera, Jose F.
Negri, Pablo
Delrieux, Claudio
Maguitman, Ana
PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 909 - 916
[40] Video-Based Action Recognition Using Dimension Reduction of Deep Covariance Trajectories
Dai, Mengyu
Srivastava, Anuj
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 621 - 630

← 1 2 3 4 5 →