Human activity recognition algorithm in video sequences based on the fusion of multiple features for realistic and multi-view environment

被引:0
|
作者
Arati Kushwaha
Ashish Khare
Om Prakash
机构
[1] GLA University,Department of Computer Engineering & Applications
[2] University of Allahabad,Department of Electronics and Communication
[3] H.N.B. Garhwal University,Department of Computer Science and Engineering
来源
关键词
Human Activity Recognition; Zernike Moment; Optical Flow; Local Ternary Pattern; Histogram of Oriented Gradients; Support Vector Machine;
D O I
暂无
中图分类号
学科分类号
摘要
Video-based human activity recognition (HAR) is an active and challenging research area in the field of computer vision. The presence of camera motion, irregular motion of humans, varying illumination conditions, complex backgrounds, and variations in the shape and size of human objects in video clips of the same activity category makes human activity recognition more difficult. Therefore, to overcome these challenges, we introduce a novel feature representation technique for human activity recognition based on the fusion of multiple features. This paper presents a robust and view-invariant feature descriptor based on the combination of motion information and the local appearance of human objects for video-based human activity recognition in realistic and multi-view environments. Firstly, we used a combination of Optical Flow (OF) and Histogram of Oriented Gradients (HOG) to compute the dynamic pattern of motion information. Then, we computed shape information by combining Local Ternary Pattern (LTP) and Zernike Moment (ZM) feature descriptors. Finally, a feature fusion strategy is used to integrate the motion information and shape information to construct the final feature vector. The experiments are performed on three different publically available video datasets– IXMAS, CASIA, and TV human interaction (TV-HI) and achieved classification accuracy values are 98.25%, 92.21%, 98.66%, and 96.48% respectively on IXMAS, CASIA Single Person, CASIA Interaction and TV-HI datasets. The results are evaluated in terms of seven different performance measures- accuracy, precision, recall, specificity, F-measure, Matthew's correlation coefficient (MCC) and computation time. The effectiveness of the proposed method is proven by comparing its results with other existing state-of-the-art methods. The obtained results have demonstrated the usefulness of the proposed method.
引用
收藏
页码:22727 / 22748
页数:21
相关论文
共 50 条
  • [31] MULTI-VIEW HUMAN ACTIVITY RECOGNITION USING MOTION FREQUENCY
    Koese, Neslihan
    Babaee, Mohammadreza
    Rigoll, Gerhard
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3963 - 3967
  • [32] An Efficient Multi-view Based Activity Recognition System for Video Surveillance Using Random Forest
    Arunnehru, J.
    Geetha, M. K.
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 2, 2015, 32 : 111 - 122
  • [33] Joint activity recognition and indoor localization with WiFi sensing based on multi-view fusion strategy
    Yan, BeiMing
    Cheng, Wei
    Li, Yong
    Gao, Xiang
    Liu, HuiMin
    DIGITAL SIGNAL PROCESSING, 2022, 129
  • [34] MULTI-VIEW FUSION BASED ON EXPECTATION MAXIMIZATION FOR SAR TARGET RECOGNITION
    Zhang, Yukun
    Guo, Xiansheng
    Ren, Haohao
    Wan, Qun
    Shen, Xiaofeng
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 778 - 781
  • [35] Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features
    Li, Bo
    Cai, Lijun
    Liao, Bo
    Fu, Xiangzheng
    Bing, Pingping
    Yang, Jialiang
    MOLECULES, 2019, 24 (05)
  • [36] Hierarchical multi-view aggregation network for sensor-based human activity recognition
    Zhang, Xiheng
    Wong, Yongkang
    Kankanhalli, Mohan S.
    Geng, Weidong
    PLOS ONE, 2019, 14 (09):
  • [37] Micro-network-based deep convolutional neural network for human activity recognition from realistic and multi-view visual data
    Kushwaha, Arati
    Khare, Ashish
    Prakash, Om
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (18): : 13321 - 13341
  • [38] Micro-network-based deep convolutional neural network for human activity recognition from realistic and multi-view visual data
    Arati Kushwaha
    Ashish Khare
    Om Prakash
    Neural Computing and Applications, 2023, 35 : 13321 - 13341
  • [39] A Color Correction Algorithm of Multi-view Video Based on Depth Segmentation
    Fei, Yue
    Yu, Mei
    Shao, Feng
    Jiang, Gangyi
    ISCSCT 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY, VOL 2, PROCEEDINGS, 2008, : 206 - 209
  • [40] Automatic Paper Recommendation Algorithm Based on Multi-View Fusion TextRCNN
    Yang, Xiuzhang
    Wu, Shuai
    Yang, Qi
    Xiang, Meiyu
    Li, Na
    Zhou, Jisong
    Zhao, Xiaoming
    Computer Engineering and Applications, 2024, 59 (02) : 110 - 119