MMA: a multi-view and multi-modality benchmark dataset for human action recognition

被引:5
|
作者
Gao, Zan [1 ,2 ]
Han, Tao-tao [1 ,2 ]
Zhang, Hua [1 ,2 ]
Xue, Yan-bing [1 ,2 ]
Xu, Guang-ping [1 ,2 ]
机构
[1] Tianjin Univ Technol, Key Lab Comp Vis & Syst, Minist Educ, Tianjin 300384, Peoples R China
[2] Tianjin Univ Technol, Tianjin Key Lab Intelligence Comp & Novel Softwar, Tianjin 300384, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Benchmark dataset; Multi-view; Multi-modalidy; Cross-view; Multi-task; Cross-domain; FEATURE-SELECTION;
D O I
10.1007/s11042-018-5833-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition is an active research topic in both computer vision and machine learning communities, which has broad applications including surveillance, biometrics and human computer interaction. In the past decades, although some famous action datasets have been released, there still exist limitations, including the limited action categories and samples, camera views and variety of scenarios. Moreover, most of them are designed for a subset of the learning problems, such as single-view learning problem, cross-view learning problem and multi-task learning problem. In this paper, we introduce a multi-view, multi-modality benchmark dataset for human action recognition (abbreviated to MMA). MMA consists of 7080 action samples from 25 action categories, including 15 single-subject actions and 10 double-subject interactive actions in three views of two different scenarios. Further, we systematically benchmark the state-of-the-art approaches on MMA with respective to all three learning problems by different temporal-spatial feature representations. Experimental results demonstrate that MMA is challenging on all three learning problems due to significant intra-class variations, occlusion issues, views and scene variations, and multiple similar action categories. Meanwhile, we provide the baseline for the evaluation of existing state-of-the-art algorithms.
引用
收藏
页码:29383 / 29404
页数:22
相关论文
共 50 条
  • [41] Multi-View Action Recognition using Contrastive Learning
    Shah, Ketul
    Shah, Anshul
    Lau, Chun Pong
    de Melo, Celso M.
    Chellappa, Rama
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3370 - 3380
  • [42] Action Recognition with a Multi-View Temporal Attention Network
    Dengdi Sun
    Zhixiang Su
    Zhuanlian Ding
    Bin Luo
    Cognitive Computation, 2022, 14 : 1082 - 1095
  • [43] Action Recognition with a Multi-View Temporal Attention Network
    Sun, Dengdi
    Su, Zhixiang
    Ding, Zhuanlian
    Luo, Bin
    COGNITIVE COMPUTATION, 2022, 14 (03) : 1082 - 1095
  • [44] Multi-View Action Recognition One Camera At a Time
    Spurlock, Scott
    Souvenir, Richard
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 604 - 609
  • [45] Compositional action recognition with multi-view feature fusion
    Zhao, Zhicheng
    Liu, Yingan
    Ma, Lei
    PLOS ONE, 2022, 17 (04):
  • [46] Human Action Recognition Based on Multi-view Semi-supervised Learning
    Tang C.
    Wang W.
    Wang X.
    Zhang C.
    Zou L.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (04): : 376 - 384
  • [47] An Enhanced Multi-view Human Action Recognition System for Virtual Training Simulator
    Kwon, Beom
    Kim, Junghwan
    Lee, Sanghoon
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [48] Motion Local Ternary Pattern for Distributed Multi-View Human Action Recognition
    Luo, Jiajia
    Qi, Hairong
    2012 SIXTH INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS (ICDSC), 2012,
  • [49] HUMAN ACTION RECOGNITION BASED ON BAG OF FEATURES AND MULTI-VIEW NEURAL NETWORKS
    Iosifidis, Alexandros
    Tefas, Anastasios
    Pitas, Ioannis
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1510 - 1514
  • [50] Human Action Recognition Based on Multi-View Regularized Extreme Learning Machine
    Iosifidis, Alexandros
    Tefas, Anastasios
    Pitas, Ioannis
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2015, 24 (05)