HUMAN ACTION RECOGNITION WITH OPTIMIZED VIDEO DENSELY SAMPLING

被引:0
|
作者
Wang, Bin [1 ]
Liu, Yu [1 ]
Xiao, Wenhua [1 ]
Xiong, Zhihui [1 ]
Wang, Wei [1 ]
Zhang, Maojun [1 ]
机构
[1] Natl Univ Def Technol, Coll Informat Syst & Management, Changsha 410073, Hunan, Peoples R China
关键词
video representation; action recognition; spatiotemporal local features; dense sample; shift and scale invariant;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Dense sample video patches have been used for video representation in action recognition and achieve better performance than sparse spatiotemporal local features. However, two problems of this method must be considered. First one, many video patches are from background other than human body. Second one, the descriptor is not reliable, since it is neither shift nor scale invariant. To solve these two problems, we proposed an Optimized Video Dense Sampling (OVDS) method combing with dense sampling and spatiotemporal interest points detector. OVDS densely sampled video patches with optimizing the position and scale parameters to guarantee the features are shift and scale invariant. To omit the action unrelated features, we extracted video patches only from human body regions instead of the whole videos. Experimental results on KTH, Weizmann, UCF, Hoollywood2 datasets showed that the features detected by OVDS are informative and reliable for action recognition, and achieve better performance over the existing spatiotemporal local features.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Human Action Recognition in Video
    Singh, Dushyant Kumar
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 : 54 - 66
  • [2] MGSampler: An Explainable Sampling Strategy for Video Action Recognition
    Zhi, Yuan
    Tong, Zhan
    Wang, Limin
    Wu, Gangshan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1493 - 1502
  • [3] A Closer Look at Video Sampling for Sequential Action Recognition
    Zhang, Yu
    Zhao, Junjie
    Chen, Zhengjie
    Mi, Siya
    Zhu, Hongyuan
    Geng, Xin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7503 - 7514
  • [4] Combining densely sampled form and motion for human action recognition
    Schindler, Konrad
    van Gool, Luc
    PATTERN RECOGNITION, 2008, 5096 : 122 - 131
  • [5] Video Analytics Framework for Human Action Recognition
    Khan, Muhammad Attique
    Alhaisoni, Majed
    Armghan, Ammar
    Alenezi, Fayadh
    Tariq, Usman
    Nam, Yunyoung
    Akram, Tallha
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (03): : 3841 - 3859
  • [6] Video and Image Complexity in Human Action Recognition
    Burgos-Madrigal, Andrea
    Altamirano-Robles, Leopoldo
    PROGRESS IN ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, 2021, 13055 : 349 - 359
  • [7] Combining Video Subsequences for Human Action Recognition
    Onofri, Leonardo
    Soda, Paolo
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 597 - 600
  • [8] Stereoscopic Video Description for Human Action Recognition
    Mademlis, Ioannis
    Iosifidis, Alexandros
    Tefas, Anastasios
    Nikolaidis, Nikos
    Pitas, Ioannis
    2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR MULTIMEDIA, SIGNAL AND VISION PROCESSING (CIMSIVP), 2014, : 1 - 6
  • [9] Automatic Video Descriptor for Human Action Recognition
    Perera, Minoli
    Farook, Cassim
    Madurapperuma, A. P.
    2017 NATIONAL INFORMATION TECHNOLOGY CONFERENCE (NITC), 2017, : 61 - 66
  • [10] Video-Based Human Action Recognition Using Spatial Pyramid Pooling and 3D Densely Convolutional Networks
    Yang, Wanli
    Chen, Yimin
    Huang, Chen
    Gao, Mingke
    FUTURE INTERNET, 2018, 10 (12):