Video Covariance Matrix Logarithm for Human Action Recognition in Videos

被引:0
|
作者
Bilinski, Piotr [1 ]
Bremond, Francois [1 ]
机构
[1] INRIA Sophia Antipolis, STARS Team, 2004 Route Lucioles,BP93, F-06902 Sophia Antipolis, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a new local spatio-temporal descriptor for videos and we propose a new approach for action recognition in videos based on the introduced descriptor. The new descriptor is called the Video Covariance Matrix Logarithm (VCML). The VCML descriptor is based on a covariance matrix representation, and it models relationships between different low-level features, such as intensity and gradient. We apply the VCML descriptor to encode appearance information of local spatio-temporal video volumes, which are extracted by the Dense Trajectories. Then, we present an extensive evaluation of the proposed VCML descriptor with the Fisher vector encoding and the Support Vector Machines on four challenging action recognition datasets. We show that the VCML descriptor achieves better results than the state-of-the-art appearance descriptors. Moreover, we present that the VCML descriptor carries complementary information to the HOG descriptor and their fusion gives a significant improvement in action recognition accuracy. Finally, we show that the VCML descriptor improves action recognition accuracy in comparison to the state-of-the-art Dense Trajectories, and that the proposed approach achieves superior performance to the state-of-the-art methods.
引用
收藏
页码:2140 / 2147
页数:8
相关论文
共 50 条
  • [41] The Progress of Human Action Recognition in Videos Based on Deep Learning: A Review
    Luo H.-L.
    Tong K.
    Kong F.-S.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (05): : 1162 - 1173
  • [42] TSNet: Deep Ne work for Human Action Recognition in Hazy Videos
    Chaudhary, Sachin
    Murala, Subrahmanyam
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 3981 - 3986
  • [43] SCNN: SEQUENTIAL CONVOLUTIONAL NEURAL NETWORK FOR HUMAN ACTION RECOGNITION IN VIDEOS
    Yang, Hao
    Yuan, Chunfeng
    Xing, Junliang
    Hu, Weiming
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 355 - 359
  • [44] An efficient and sparse approach for large scale human action recognition in videos
    Cyrille Beaudry
    Renaud Péteri
    Laurent Mascarilla
    Machine Vision and Applications, 2016, 27 : 529 - 543
  • [45] Spatio-Temporal VLAD Encoding for Human Action Recognition in Videos
    Duta, Ionut C.
    Ionescu, Bogdan
    Aizawa, Kiyoharu
    Sebe, Nicu
    MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 365 - 378
  • [46] Human action recognition in videos with articulated pose information by deep networks
    M. Farrajota
    João M. F. Rodrigues
    J. M. H. du Buf
    Pattern Analysis and Applications, 2019, 22 : 1307 - 1318
  • [47] An efficient and sparse approach for large scale human action recognition in videos
    Beaudry, Cyrille
    Peteri, Renaud
    Mascarilla, Laurent
    MACHINE VISION AND APPLICATIONS, 2016, 27 (04) : 529 - 543
  • [48] Human Action Recognition in Unconstrained Videos Using Deep Learning Techniques
    Priya, G. G. Lakshmi
    Jain, Mrinal
    Perumal, R. Srinivasa
    Mouli, P. V. S. S. R. Chandra
    INTELLIGENT COMPUTING AND COMMUNICATION, ICICC 2019, 2020, 1034 : 737 - 744
  • [49] Pose primitive based human action recognition in videos or still images
    Thurau, Christian
    Hlavac, Vaclav
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 2955 - +
  • [50] A Generalized Pyramid Matching Kernel for Human Action Recognition in Realistic Videos
    Zhu, Jun
    Zhou, Quan
    Zou, Weijia
    Zhang, Rui
    Zhang, Wenjun
    SENSORS, 2013, 13 (11) : 14398 - 14416