Deep appearance and motion learning for egocentric activity recognition

被引:35
|
作者
Wang, Xuanhan [1 ]
Gao, Lianli [1 ]
Song, Jingkuan [2 ]
Zhen, Xiantong [3 ]
Sebe, Nicu [4 ]
Shen, Heng Tao [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Sichuan, Peoples R China
[2] Columbia Univ, Sch Engn & Appl Sci, New York, NY 10027 USA
[3] Univ Western Ontario, Digital Imaging Grp, London, ON N6A 4V2, Canada
[4] Univ Trento, Dept Informat Engn & Comp Sci, I-38100 Trento, Italy
基金
中国国家自然科学基金;
关键词
Multiple feature learning; Deep learning; Autoencoder; Egocentric video; Activity recognition;
D O I
10.1016/j.neucom.2017.08.063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Egocentric activity recognition has recently generated great popularity in computer vision due to its widespread applications in egocentric video analysis. However, it poses new challenges comparing to the conventional third-person activity recognition tasks, which are caused by significant body shaking, varied lengths, and poor recoding quality, etc. To handle these challenges, in this paper, we propose deep appearance and motion learning (DAML) for egocentric activity recognition, which leverages the great strength of deep learning networks in feature learning. In contrast to hand- crafted visual features or pre-trained convolutional neural network (CNN) features with limited generality to new egocentric videos, the proposed DAML is built on the deep autoencoder (DAE), and directly extracts appearance and motion feature, the main cue of activities, from egocentric videos. The DAML takes advantages of the great effectiveness and efficiency of the DAE in unsupervised feature learning, which provides a new representation learning framework of egocentric videos. The learned appearance and motion features by the DAML are seamlessly fused to accomplish a rich informative egocentric activity representation which can be readily fed into any supervised learning models for activity recognition. Experimental results on two challenging benchmark datasets show that the DAML achieves high performance on both short- and long-term egocentric activity recognition tasks, which is comparable to or even better than the state-of-the-art counterparts. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:438 / 447
页数:10
相关论文
共 50 条
  • [41] A Survey of Group Activity Recognition Based on Deep Learning
    Yan, Rui
    Ge, Xiao-Jing
    Huang, Peng
    Shu, Xiang-Bo
    Tang, Jin-Hui
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (11): : 2552 - 2578
  • [42] Hybrid deep learning framework for human activity recognition
    Pushpalatha, S.
    Math, Shrishail
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2022, 13 (01): : 1225 - 1237
  • [43] Human Activity Recognition in Videos Using Deep Learning
    Kumar, Mohit
    Rana, Adarsh
    Ankita
    Yadav, Arun Kumar
    Yadav, Divakar
    SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 288 - 299
  • [44] Activity Recognition Based on Deep Learning and Android Software
    Wang, Chao
    Lin, Chuang
    Yang, Mo
    2018 IEEE INTERNATIONAL CONFERENCE ON CYBORG AND BIONIC SYSTEMS (CBS), 2018, : 31 - 35
  • [45] Deep Learning for Human Activity Recognition in Mobile Computing
    Plotz, Thomas
    Guan, Yu
    COMPUTER, 2018, 51 (05) : 50 - 59
  • [46] A Multimodal Deep Learning Network for Group Activity Recognition
    Rossi, Silvia
    Capasso, Roberto
    Acampora, Giovanni
    Staffa, Mariacarla
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [47] Abnormal activity recognition based on deep learning in crowd
    Fu, Yan
    Liu, Tao
    Ye, Ou
    2019 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2019), VOL 1, 2019, : 301 - 304
  • [48] Deep learning in the classification and recognition of cardiac activity patterns
    Jelen, Lukasz
    Ciskowski, Piotr
    Kluwak, Konrad
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2024, 70 (01) : 79 - 85
  • [49] Human Activity Recognition Based on Deep Learning Method
    Shi, Xiaoran
    Li, Yaxin
    Zhou, Feng
    Liu, Lei
    2018 INTERNATIONAL CONFERENCE ON RADAR (RADAR), 2018,
  • [50] Evaluation of deep learning model for human activity recognition
    Bhat, Owais
    Khan, Dawood A.
    EVOLVING SYSTEMS, 2022, 13 (01) : 159 - 168