Video Covariance Matrix Logarithm for Human Action Recognition in Videos

被引:0
|
作者
Bilinski, Piotr [1 ]
Bremond, Francois [1 ]
机构
[1] INRIA Sophia Antipolis, STARS Team, 2004 Route Lucioles,BP93, F-06902 Sophia Antipolis, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a new local spatio-temporal descriptor for videos and we propose a new approach for action recognition in videos based on the introduced descriptor. The new descriptor is called the Video Covariance Matrix Logarithm (VCML). The VCML descriptor is based on a covariance matrix representation, and it models relationships between different low-level features, such as intensity and gradient. We apply the VCML descriptor to encode appearance information of local spatio-temporal video volumes, which are extracted by the Dense Trajectories. Then, we present an extensive evaluation of the proposed VCML descriptor with the Fisher vector encoding and the Support Vector Machines on four challenging action recognition datasets. We show that the VCML descriptor achieves better results than the state-of-the-art appearance descriptors. Moreover, we present that the VCML descriptor carries complementary information to the HOG descriptor and their fusion gives a significant improvement in action recognition accuracy. Finally, we show that the VCML descriptor improves action recognition accuracy in comparison to the state-of-the-art Dense Trajectories, and that the proposed approach achieves superior performance to the state-of-the-art methods.
引用
收藏
页码:2140 / 2147
页数:8
相关论文
共 50 条
  • [31] Diving deep into human action recognition in aerial videos: A survey
    Kapoor, Surbhi
    Sharma, Akashdeep
    Verma, Amandeep
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
  • [32] Human Action Recognition in Unconstrained Videos by Explicit Motion Modeling
    Jiang, Yu-Gang
    Dai, Qi
    Liu, Wei
    Xue, Xiangyang
    Ngo, Chong-Wah
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) : 3781 - 3795
  • [33] Deep Learning-Based Human Action Recognition in Videos
    Li, Song
    Shi, Qian
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2025, 34 (01)
  • [34] Motion keypoint trajectory and covariance descriptor for human action recognition
    Yi, Yun
    Wang, Hanli
    VISUAL COMPUTER, 2018, 34 (03): : 391 - 403
  • [35] Motion keypoint trajectory and covariance descriptor for human action recognition
    Yun Yi
    Hanli Wang
    The Visual Computer, 2018, 34 : 391 - 403
  • [36] An Overview of Action Recognition in Videos
    Buric, M.
    Pobar, M.
    Kos, M. Ivasic
    2017 40TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2017, : 1098 - 1103
  • [37] Kernelized Covariance for Action Recognition
    Cavazza, Jacopo
    Zunino, Andrea
    Biagio, Marco San
    Murino, Vittorio
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 408 - 413
  • [38] High dimensional covariance matrix estimation by penalizing the matrix-logarithm transformed likelihood
    Yu, Philip L. H.
    Wang, Xiaohang
    Zhu, Yuanyuan
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 114 : 12 - 25
  • [39] Modeling Video Activity with Dynamic Phrases and Its Application to Action Recognition in Tennis Videos
    Vainstein, Jonathan
    Manera, Jose F.
    Negri, Pablo
    Delrieux, Claudio
    Maguitman, Ana
    PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 909 - 916
  • [40] Video-Based Action Recognition Using Dimension Reduction of Deep Covariance Trajectories
    Dai, Mengyu
    Srivastava, Anuj
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 621 - 630