Activity Recognition from Multi-modal Sensor Data Using a Deep Convolutional Neural Network

被引:5
|
作者
Taherkhani, Aboozar [1 ]
Cosma, Georgina [1 ]
Alani, Ali A. [2 ]
McGinnity, T. M. [1 ]
机构
[1] Nottingham Trent Univ, Sch Sci & Technol, Nottingham, England
[2] Univ Diyala, Coll Sci, Comp Sci Dept, Diyala, Iraq
来源
关键词
Deep leering; Convolutional neural networks; Multi-modal data; Sensor data; Deep belief network; Smart home; Data fusion; ALGORITHM;
D O I
10.1007/978-3-030-01177-2_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-modal data extracted from different sensors in a smart home can be fused to build models that recognize the daily living activities of residents. This paper proposes a Deep Convolutional Neural Network to perform the activity recognition task using the multi-modal data collected from a smart residential home. The dataset contains accelerometer data (composed of three perpendicular components of acceleration and the strength of the accelerometer signal received by four receivers), video data (15 time-series related to 2D and 3D center of mass and bounding box extracted from an RGB-D camera), and Passive Infra-Red sensor data. The performance of the Deep Convolutional Neural Network is compared to the Deep Belief Network. Experimental results revealed that the Deep Convolutional Neural Network with two pairs of convolutional and max pooling layers achieved better classification accuracy than the Deep Belief Network. The Deep Belief Network uses Restricted Boltzmann Machines for pre-training the network. When training deep learning models using classes with a high number of training samples, the DBN achieved 65.97% classification accuracy, whereas the CNN achieved 75.33% accuracy. The experimental results demonstrate the challenges of dealing with multi-modal data and highlight the importance of having a suitable number of samples within each class for sufficiently training and testing deep learning models.
引用
收藏
页码:203 / 218
页数:16
相关论文
共 50 条
  • [1] Multi-Modal Convolutional Neural Networks for Activity Recognition
    Ha, Sojeong
    Yun, Jeong-Min
    Choi, Seungjin
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 3017 - 3022
  • [2] Dynamic Hand Gesture Recognition from Multi-modal Streams Using Deep Neural Network
    Thanh-Hai Tran
    Hoang-Nhat Tran
    Huong-Giang Doan
    MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, 2019, 11909 : 156 - 167
  • [3] Deep Convolutional Neural Network for Multi-Modal Image Restoration and Fusion
    Deng, Xin
    Dragotti, Pier Luigi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3333 - 3348
  • [4] Multi-Modal Human Action Recognition Using Deep Neural Networks Fusing Image and Inertial Sensor Data
    Hwang, Inhwan
    Cha, Geonho
    Oh, Songhwai
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), 2017, : 278 - 283
  • [5] Classifying Imbalanced Multi-modal Sensor Data for Human Activity Recognition in a Smart Home using Deep Learning
    Alani, Ali A.
    Cosma, Georgina
    Taherkhani, Aboozar
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [6] An Investigation of Recurrent Neural Network for Daily Activity Recognition using Multi-modal Signals
    Tamamori, Akira
    Hayashi, Tomoki
    Toda, Tomoki
    Takeda, Kazuya
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1334 - 1340
  • [7] Daily activity recognition based on recurrent neural network using multi-modal signals
    Tamamori, Akira
    Hayashi, Tomoki
    Toda, Tomoki
    Takeda, Kazuya
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2018, 7
  • [8] Modality-convolutions: Multi-modal Gesture Recognition Based on Convolutional Neural Network
    Huo, Da
    Chen, Yufeng
    Li, Fengxia
    Lei, Zhengchao
    2017 12TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND EDUCATION (ICCSE 2017), 2017, : 349 - 353
  • [9] Robust multi-modal pedestrian detection using deep convolutional neural network with ensemble learning model
    Jain, Deepak Kumar
    Zhao, Xudong
    Garcia, Salvador
    Neelakandan, Subramani
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [10] Multi-modal vertebrae recognition using Transformed Deep Convolution Network
    Cai, Yunliang
    Landis, Mark
    Laidley, David T.
    Kornecki, Anat
    Lum, Andrea
    Li, Shuo
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2016, 51 : 11 - 19