Incremental human action recognition with dual memory

被引:3
|
作者
Gutoski, Matheus [1 ]
Lazzaretti, Andre Eugenio [1 ]
Lopes, Heitor Silverio [1 ]
机构
[1] Univ Tecnol Fed Parana, Av Sete Setembro 3165, BR-80230901 Curitiba, Parana, Brazil
关键词
Incremental learning; Human Action Recognition; Metric Learning; Triplet Networks; Dual-memory Extreme Value Machine;
D O I
10.1016/j.imavis.2021.104313
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Incremental learning is a topic of great interest in the current state of machine learning research. Real-world problems often require a classifier to incorporate new knowledge while preserving what was learned before. One of the most challenging problems in computer vision is Human Action Recognition (HAR) in videos. How-ever, most of the existing works approach HAR from a non-incremental point of view. This work proposes a framework for performing HAR in the incremental learning scenario called Incremental Human Action Recogni-tion with Dual Memory (IHAR-DM). IHAR-DM contains three main components: a 3D convolutional neural net-work for capturing Spatio-temporal features; a Triplet Network to perform metric learning; and the dual -memory Extreme Value Machine, which is introduced in this work. The proposed method is compared with 10 other state-of-the-art incremental learning models. We propose five experimental settings containing different numbers of tasks and classes using two widely known HAR datasets: UCF-101 and HMDB51. Our results show superior performance in terms of Normalized Mutual Information (NMI) and Inter-task Intransigence (ITI), which is a new metric proposed in this work. Overall results show the feasibility of the proposal for real HAR problems, which mostly present the requirements imposed by incremental learning. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Incremental discriminant-analysis of canonical correlations for action recognition
    Wu, Xinxiao
    Jia, Yunde
    Liang, Wei
    PATTERN RECOGNITION, 2010, 43 (12) : 4190 - 4197
  • [22] Incremental Dual-memory LSTM in Land Cover Prediction
    Jia, Xiaowei
    Khandelwal, Ankush
    Nayak, Guruprasad
    Gerber, James
    Carlson, Kimberly
    West, Paul
    Kumar, Vipin
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 867 - 876
  • [23] Human Action Recognition Without Human
    He, Yun
    Shirakabe, Soma
    Satoh, Yutaka
    Kataoka, Hirokatsu
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 11 - 17
  • [24] A dual fast and slow feature interaction in biologically inspired visual recognition of human action
    Yousefi, Bardia
    Loo, Chu Kiong
    APPLIED SOFT COMPUTING, 2018, 62 : 57 - 72
  • [25] The Study of Human Action Recognition in Videos with Long Short-Term Memory Model
    Khan, Hussan
    Habib, Sammra
    Qasim, Amna
    Hussain, Nisar
    Usman, Muhammad
    Mahmood, Ahmad
    Shaukat, Zainab
    Afzal, Asghar
    Zain, Muhammad
    INFORMATION MANAGEMENT AND BIG DATA, SIMBIG 2023, 2024, 2142 : 217 - 230
  • [26] Semi-supervised long short-term memory for human action recognition
    Liu, Hong
    Liu, Chang
    Ding, Runwei
    JOURNAL OF ENGINEERING-JOE, 2020, 2020 (13): : 373 - 378
  • [27] Distinctive action sketch for human action recognition
    Zheng, Ying
    Yao, Hongxun
    Sun, Xiaoshuai
    Zhao, Sicheng
    Porikli, Fatih
    SIGNAL PROCESSING, 2018, 144 : 323 - 332
  • [28] Action Capsules: Human skeleton action recognition
    Bavil, Ali Farajzadeh
    Damirchi, Hamed
    Taghirad, Hamid D.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 233
  • [29] Human Action Recognition with Transformers
    Mazzeo, Pier Luigi
    Spagnolo, Paolo
    Fasano, Matteo
    Distante, Cosimo
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 230 - 241
  • [30] A method for human action recognition
    Masoud, O
    Papanikolopoulos, N
    IMAGE AND VISION COMPUTING, 2003, 21 (08) : 729 - 743