Video benchmarks of human action datasets: a review

被引:48
|
作者
Singh, Tej [1 ]
Vishwakarma, Dinesh Kumar [2 ]
机构
[1] Delhi Technol Univ, Dept Elect & Commun Engn, New Delhi, India
[2] Delhi Technol Univ, Dept Informat Technol, New Delhi, India
关键词
Human action and activity recognition; Survey; RGB dataset; RGB-depth (RGB-D) dataset; HUMAN ACTIVITY RECOGNITION; UNIFIED FRAMEWORK; SILHOUETTE; MOTION; PATTERNS; FEATURES; 3D;
D O I
10.1007/s10462-018-9651-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision-based Human activity recognition is becoming a trendy area of research due to its wide application such as security and surveillance, human-computer interactions, patients monitoring system, and robotics. In the past two decades, there are several publically available human action, and activity datasets are reported based on modalities, view, actors, actions, and applications. The objective of this survey paper is to outline the different types of video datasets and highlights their merits and demerits under practical considerations. Based on the available information inside the dataset we can categorise these datasets into RGB (Red, Green, and Blue) and RGB-D(depth). The most prominent challenges involved in these datasets are occlusions, illumination variation, view variation, annotation, and fusion of modalities. The key specification of these datasets is discussed such as resolutions, frame rate, actions/actors, background, and application domain. We have also presented the state-of-the-art algorithms in a tabular form that give the best performance on such datasets. In comparison with earlier surveys, our works give a better presentation of datasets on the well-organised comparison, challenges, and latest evaluation technique on existing datasets.
引用
收藏
页码:1107 / 1154
页数:48
相关论文
共 50 条
  • [1] Video benchmarks of human action datasets: a review
    Tej Singh
    Dinesh Kumar Vishwakarma
    Artificial Intelligence Review, 2019, 52 : 1107 - 1154
  • [2] A survey of video datasets for human action and activity recognition
    Chaquet, Jose M.
    Carmona, Enrique J.
    Fernandez-Caballero, Antonio
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (06) : 633 - 659
  • [3] Human action recognition approaches with video datasets-A survey
    Ozyer, Tansel
    Ak, Duygu Selin
    Alhajj, Reda
    KNOWLEDGE-BASED SYSTEMS, 2021, 222
  • [4] MetaVD: A Meta Video Dataset for enhancing human action recognition datasets
    Yoshikawa, Yuya
    Shigeto, Yutaro
    Takeuchi, Akikazu
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 212
  • [5] A Critical Review of Action Recognition Benchmarks
    Hassner, Tal
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 245 - 250
  • [6] Exploring Action Recognition in Endoscopy Video Datasets
    Tian, Yuchen
    Paheding, Sidike
    Azimi, Ehsan
    Lee, Eung-Joo
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2024, 2024, 13034
  • [7] A survey on video-based Human Action Recognition: recent updates, datasets, challenges, and applications
    Preksha Pareek
    Ankit Thakkar
    Artificial Intelligence Review, 2021, 54 : 2259 - 2322
  • [8] A survey on video-based Human Action Recognition: recent updates, datasets, challenges, and applications
    Pareek, Preksha
    Thakkar, Ankit
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (03) : 2259 - 2322
  • [9] Review on Recent Advances in Human Action Recognition in Video Data
    Baisware, Akshita
    Sayankar, Bharati
    Hood, Saurabh
    2019 9TH INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY: SIGNAL AND INFORMATION PROCESSING (ICETET-SIP-19), 2019,
  • [10] A Review of Deep Learning-based Human Activity Recognition on Benchmark Video Datasets
    Sharma, Vijeta
    Gupta, Manjari
    Pandey, Anil Kumar
    Mishra, Deepti
    Kumar, Ajai
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)