Improving accuracy of temporal action detection by deep hybrid convolutional network

被引:1
|
作者
Gan, Ming-Gang [1 ]
Zhang, Yan [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, State Key Lab Intelligent Control & Decis Complex, Beijing 100081, Peoples R China
基金
国家重点研发计划;
关键词
Boundary regression; Proposal classification; Temporal action detection; Temporal action location; Video analysis; ACTION RECOGNITION; VIDEO; CORPUS;
D O I
10.1007/s11042-022-13962-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Temporal action detection, a fundamental yet challenging task in understanding human actions, is usually divided into two stages: temporal action proposal generation and proposal classification. Classifying action proposals is always considered an action recognition task and receives little attention. However, compared with action classification, classifying action proposals has more large intra-class variations and subtle inter-class differences, making it more difficult to classify accurately. In this paper, we propose a novel end-to-end framework called Deep Hybrid Convolutional Network (DHCNet) to classify action proposals and achieve high-performance temporal action detection. DHCNet improves temporal action detection performance from three aspects. First, DHCNet utilizes Subnet I to effectively model the temporal structure of proposals and generate discriminative proposal features. Second, the Subnet II of DHCNet exploits Graph Convolution (GConv) to acquire information from other proposals and obtains much semantic information to enhance the proposal feature. Third, DHCNet adopts a coarse-to-fine cascaded classification, where the influence of large intra-class variations and subtle inter-class differences are reduced significantly at different granularities. Besides, we design an iterative boundary regression method based on closed-loop feedback to refine the temporal boundaries of proposals. Extensive experiments demonstrate the effectiveness of our approach. Furthermore, DHCNet achieves the state-of-the-art performance on the THUMOS'14 dataset(59.9% on mAP@0.5).
引用
收藏
页码:16127 / 16149
页数:23
相关论文
共 50 条
  • [21] Deep Convolutional Generative Adversarial Network and Convolutional Neural Network for Smoke Detection
    Yin, Hang
    Wei, Yurong
    Liu, Hedan
    Liu, Shuangyin
    Liu, Chuanyun
    Gao, Yacui
    COMPLEXITY, 2020, 2020
  • [22] Deep Convolutional Generative Adversarial Network and Convolutional Neural Network for Smoke Detection
    Yin, Hang
    Wei, Yurong
    Liu, Hedan
    Liu, Shuangyin
    Liu, Chuanyun
    Gao, Yacui
    Liu, Shuangyin (hdlsyxlq@126.com), 1600, Hindawi Limited (2020):
  • [23] Accuracy of a deep convolutional neural network in detection of retinitis pigmentosa on ultrawide-field images
    Masumoto, Hiroki
    Tabuchi, Hitoshi
    Nakakura, Shunsuke
    Ohsugi, Hideharu
    Enno, Hiroki
    Ishitobi, Naofumi
    Ohsugi, Eiko
    Mitamura, Yoshinori
    PEERJ, 2019, 7
  • [24] SMC: Single-Stage Multi-location Convolutional Network for Temporal Action Detection
    Liu, Zhikang
    Wang, Zilei
    Zhao, Yan
    Tian, Ye
    COMPUTER VISION - ACCV 2018, PT II, 2019, 11362 : 179 - 195
  • [25] Multistream Temporal Convolutional Network for Correct/Incorrect Patient Transfer Action Detection Using Body Sensor Network
    Zhong, Zhihang
    Lin, Chingszu
    Kanai-Pak, Masako
    Maeda, Jukai
    Kitajima, Yasuko
    Nakamura, Mitsuhiro
    Kuwahara, Noriaki
    Ogata, Taiki
    Ota, Jun
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (23) : 17000 - 17013
  • [26] Capsule Boundary Network With 3D Convolutional Dynamic Routing for Temporal Action Detection
    Chen, Yaosen
    Guo, Bing
    Shen, Yan
    Wang, Wei
    Lu, Weichen
    Suo, Xinhua
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 2962 - 2975
  • [27] Container Code Detection by Deep Convolutional Network
    Wang Zhi-Ming
    Ma Shu
    PROCEEDINGS OF THE 2017 GLOBAL CONFERENCE ON MECHANICS AND CIVIL ENGINEERING (GCMCE 2017), 2017, 132 : 82 - 87
  • [28] Obstacle Detection with Deep Convolutional Neural Network
    Yu, Hong
    Hong, Ruxia
    Huang, XiaoLei
    Wang, Zhengyou
    2013 SIXTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2013, : 265 - 268
  • [29] Deep Convolutional Neural Network for Fog Detection
    Zhang, Jun
    Lu, Hui
    Xia, Yi
    Han, Ting-Ting
    Miao, Kai-Chao
    Yao, Ye-Qing
    Liu, Cheng-Xiao
    Zhou, Jian-Ping
    Chen, Peng
    Wang, Bing
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT II, 2018, 10955 : 1 - 10
  • [30] Deep Convolutional Neural Network for Fire Detection
    Gotthans, Jakub
    Gotthans, Tomas
    Marsalek, Roman
    PROCEEDINGS OF THE 2020 30TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2020, : 128 - 133