Improving accuracy of temporal action detection by deep hybrid convolutional network

被引:1
|
作者
Gan, Ming-Gang [1 ]
Zhang, Yan [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, State Key Lab Intelligent Control & Decis Complex, Beijing 100081, Peoples R China
基金
国家重点研发计划;
关键词
Boundary regression; Proposal classification; Temporal action detection; Temporal action location; Video analysis; ACTION RECOGNITION; VIDEO; CORPUS;
D O I
10.1007/s11042-022-13962-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Temporal action detection, a fundamental yet challenging task in understanding human actions, is usually divided into two stages: temporal action proposal generation and proposal classification. Classifying action proposals is always considered an action recognition task and receives little attention. However, compared with action classification, classifying action proposals has more large intra-class variations and subtle inter-class differences, making it more difficult to classify accurately. In this paper, we propose a novel end-to-end framework called Deep Hybrid Convolutional Network (DHCNet) to classify action proposals and achieve high-performance temporal action detection. DHCNet improves temporal action detection performance from three aspects. First, DHCNet utilizes Subnet I to effectively model the temporal structure of proposals and generate discriminative proposal features. Second, the Subnet II of DHCNet exploits Graph Convolution (GConv) to acquire information from other proposals and obtains much semantic information to enhance the proposal feature. Third, DHCNet adopts a coarse-to-fine cascaded classification, where the influence of large intra-class variations and subtle inter-class differences are reduced significantly at different granularities. Besides, we design an iterative boundary regression method based on closed-loop feedback to refine the temporal boundaries of proposals. Extensive experiments demonstrate the effectiveness of our approach. Furthermore, DHCNet achieves the state-of-the-art performance on the THUMOS'14 dataset(59.9% on mAP@0.5).
引用
收藏
页码:16127 / 16149
页数:23
相关论文
共 50 条
  • [41] Effective android malware detection with a hybrid model based on deep autoencoder and convolutional neural network
    Wang, Wei
    Zhao, Mengxue
    Wang, Jigang
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (08) : 3035 - 3043
  • [42] Prediction of mutation effects using a deep temporal convolutional network
    Kim, Ha Young
    Kim, Dongsup
    BIOINFORMATICS, 2020, 36 (07) : 2047 - 2052
  • [43] Temporal action proposal for online driver action monitoring using Dilated Convolutional Temporal Prediction Network
    Wen, Boge
    Chen, Siyuan
    Shao, Chenhui
    COMPUTERS IN INDUSTRY, 2020, 121 (121)
  • [44] A Deep Convolutional Neural Network Model for Improving WRF Simulations
    Sayeed, Alqamah
    Choi, Yunsoo
    Jung, Jia
    Lops, Yannic
    Eslami, Ebrahim
    Salman, Ahmed Khan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) : 750 - 760
  • [45] Two-stream graph convolutional neural network fusion for weakly supervised temporal action detection
    Mengyao Zhao
    Zhengping Hu
    Shufang Li
    Shuai Bi
    Zhe Sun
    Signal, Image and Video Processing, 2022, 16 : 947 - 954
  • [46] Two-stream graph convolutional neural network fusion for weakly supervised temporal action detection
    Zhao, Mengyao
    Hu, Zhengping
    Li, Shufang
    Bi, Shuai
    Sun, Zhe
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (04) : 947 - 954
  • [47] Action Recognition Based on a Hybrid Deep Network
    Zou Y.
    Zhou X.
    Ren X.
    SN Computer Science, 2021, 2 (6)
  • [48] Improving CBIR Accuracy using Convolutional Neural Network for Feature Extraction
    Shah, Amjad
    Naseem, Rashid
    Sadia
    Iqbal, Shahid
    Shah, Muhammad Arif
    2017 13TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET 2017), 2017,
  • [49] Detection of Aerobics Action Based on Convolutional Neural Network
    Zhang, Siyu
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [50] Character action recognition based on deep convolutional neural network and action sequence
    Li, Jiaen
    Ren, Fuji
    Nishide, Shun
    Kang, Xin
    PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 149 - 153