Human behaviour recognition with mid-level representations for crowd understanding and analysis

被引:2
|
作者
Sun, Bangyong [1 ,2 ]
Yuan, Nianzeng [1 ]
Li, Shuying [4 ]
Wu, Siyuan [2 ]
Wang, Nan [2 ,3 ]
机构
[1] Xian Univ Technol, Coll Printing Packaging Engn & Digital Media, Xian 710048, Shaanxi, Peoples R China
[2] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Key Lab Spectral Imaging Technol CAS, Xian 710119, Shaanxi, Peoples R China
[3] Univ Chinese Acad Sci, 19A Yuquanlu, Beijing 100049, Peoples R China
[4] Xian Univ Posts & Telecommun, Sch Automat, Xian 710121, Shaanxi, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
VIDEOS;
D O I
10.1049/ipr2.12147
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd understanding and analysis have received increasing attention for couples of decades, and development of human behaviour recognition strongly supports the application of crowd understanding and analysis. Human behaviour recognition usually seeks to automatically analyse ongoing movements and actions in different camera views by using various machine learning methodologies in unknown video clips or image sequences. Compared to other data modalities such as documents and images, processing video data demands much higher computational and storage resources. The idea of using middle level semantic concepts to represent human actions from videos is explored and it is argued that these semantic attributes enable the construction of more descriptive methods for human action recognition. The mid-level attributes, initialized by a cluster processing, are built upon low level features and fully utilize the discrepancies in different action classes, which can capture the importance of each attribute for each action class. In this way, the representation is constructed to be semantically rich and capable of highly discriminative performance even paired with simple linear classifiers. The method is verified on three challenging datasets (KTH, UCF50 and HMDB51), and the experimental results demonstrate that our method achieves better results than the baseline methods on human action recognition.
引用
收藏
页码:3414 / 3424
页数:11
相关论文
共 50 条
  • [41] A mid-level visual concept generation framework for sports analysis
    Tong, XF
    Duan, LY
    Lu, HQ
    Xu, CS
    Tian, Q
    Jin, JS
    2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 646 - 649
  • [42] Practice Analysis for mid-level Pharmacy Workers in South Africa
    Boschmans, Shirley-Anne
    Fogarty, Teri-Lynne
    Schafermeyer, Kenneth W.
    Mallinson, R. Kevin
    PHARMACY EDUCATION, 2015, 15 (01): : 31 - 38
  • [43] MID-LEVEL CHORD TRANSITION FEATURES FOR MUSICAL STYLE ANALYSIS
    Weiss, Christof
    Brand, Fabian
    Mueller, Meinard
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 341 - 345
  • [44] From Mid-Level Policy Analysis to Macro-Level Political Economy
    Labonte, Ronald
    INTERNATIONAL JOURNAL OF HEALTH POLICY AND MANAGEMENT, 2018, 7 (07): : 656 - 658
  • [45] Understanding job satisfaction amongst mid-level cadres in Malawi: the contribution of organisational justice
    McAuliffe, Eilish
    Manafa, Ogenna
    Maseko, Fresier
    Bowie, Cameron
    White, Emma
    REPRODUCTIVE HEALTH MATTERS, 2009, 17 (33) : 80 - 90
  • [46] Mid-level image representations for real-time heart view plane classification of echocardiograms
    Penatti, Otavio A. B.
    Werneck, Rafael de O.
    de Almeida, Waldir R.
    Stein, Bernardo V.
    Pazinato, Daniel V.
    Mendes Junior, Pedro R.
    Torres, Ricardo da S.
    Rocha, Anderson
    COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 66 : 66 - 81
  • [47] A novel mid-level distinctive feature learning for action recognition via diffusion map
    Xu, Wanru
    Miao, Zhenjiang
    Tian, Yi
    NEUROCOMPUTING, 2016, 218 : 185 - 196
  • [48] Revisiting Mid-Level Patterns for Cross-Domain Few-Shot Recognition
    Zou, Yixiong
    Zhang, Shanghang
    Yu, Jianpeng
    Tian, Yonghong
    Moura, Jose M. F.
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 741 - 749
  • [49] Interaction Part Mining: A Mid-Level Approach for Fine-Grained Action Recognition
    Zhou, Yang
    Ni, Bingbing
    Hong, Richang
    Wang, Meng
    Tian, Qi
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3323 - 3331
  • [50] Does embodied training improve the recognition of mid-level expressive movement qualities sonification?
    Radoslaw Niewiadomski
    Maurizio Mancini
    Andrea Cera
    Stefano Piana
    Corrado Canepa
    Antonio Camurri
    Journal on Multimodal User Interfaces, 2019, 13 : 191 - 203