Improving accuracy of temporal action detection by deep hybrid convolutional network

被引:1
|
作者
Gan, Ming-Gang [1 ]
Zhang, Yan [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, State Key Lab Intelligent Control & Decis Complex, Beijing 100081, Peoples R China
基金
国家重点研发计划;
关键词
Boundary regression; Proposal classification; Temporal action detection; Temporal action location; Video analysis; ACTION RECOGNITION; VIDEO; CORPUS;
D O I
10.1007/s11042-022-13962-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Temporal action detection, a fundamental yet challenging task in understanding human actions, is usually divided into two stages: temporal action proposal generation and proposal classification. Classifying action proposals is always considered an action recognition task and receives little attention. However, compared with action classification, classifying action proposals has more large intra-class variations and subtle inter-class differences, making it more difficult to classify accurately. In this paper, we propose a novel end-to-end framework called Deep Hybrid Convolutional Network (DHCNet) to classify action proposals and achieve high-performance temporal action detection. DHCNet improves temporal action detection performance from three aspects. First, DHCNet utilizes Subnet I to effectively model the temporal structure of proposals and generate discriminative proposal features. Second, the Subnet II of DHCNet exploits Graph Convolution (GConv) to acquire information from other proposals and obtains much semantic information to enhance the proposal feature. Third, DHCNet adopts a coarse-to-fine cascaded classification, where the influence of large intra-class variations and subtle inter-class differences are reduced significantly at different granularities. Besides, we design an iterative boundary regression method based on closed-loop feedback to refine the temporal boundaries of proposals. Extensive experiments demonstrate the effectiveness of our approach. Furthermore, DHCNet achieves the state-of-the-art performance on the THUMOS'14 dataset(59.9% on mAP@0.5).
引用
收藏
页码:16127 / 16149
页数:23
相关论文
共 50 条
  • [1] Improving accuracy of temporal action detection by deep hybrid convolutional network
    Ming-Gang Gan
    Yan Zhang
    Multimedia Tools and Applications, 2023, 82 : 16127 - 16149
  • [2] Improving generalisation and accuracy of on-line milling chatter detection via a novel hybrid deep convolutional neural network
    Zhang, Pengfei
    Gao, Dong
    Hong, Dongbo
    Lu, Yong
    Wu, Qian
    Zan, Shusong
    Liao, Zhirong
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2023, 193
  • [3] Boundary graph convolutional network for temporal action detection
    Chen, Yaosen
    Guo, Bing
    Shen, Yan
    Wang, Wei
    Lu, Weichen
    Suo, Xinhua
    IMAGE AND VISION COMPUTING, 2021, 109
  • [4] Improving accuracy and robustness of deep convolutional neural network based thoracic OAR segmentation
    Feng, Xue
    Bernard, Mark E.
    Hunter, Thomas
    Chen, Quan
    PHYSICS IN MEDICINE AND BIOLOGY, 2020, 65 (07):
  • [5] Human and object detection using Hybrid Deep Convolutional Neural Network
    Mukilan, P.
    Semunigus, Wogderess
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (07) : 1913 - 1923
  • [6] Human and object detection using Hybrid Deep Convolutional Neural Network
    P. Mukilan
    Wogderess Semunigus
    Signal, Image and Video Processing, 2022, 16 : 1913 - 1923
  • [7] A Deep Convolutional Network for Saliency Object Detection with Balanced Accuracy and High Efficiency
    Zhang Wenming
    Yao Zhenfei
    Gao Kun
    Li Haibin
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (05) : 1201 - 1208
  • [8] Improving the Accuracy of Community Detection in Social Network Through a Hybrid Method
    Nooribakhsh, Mahsa
    Fernandez-Diego, Marta
    Gonzalez-Ladron-De-Guevara, Fernando
    Mollamotalebi, Mahdi
    SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2024, PT II, 2025, 15212 : 117 - 126
  • [9] Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training
    Liu, Sheng
    Li, Xiao
    Zhai, Yuexiang
    You, Chong
    Zhu, Zhihui
    Fernandez-Granda, Carlos
    Qu, Qing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] A Hybrid Convolutional and Graph Neural Network for Human Action Detection in Static Images
    Lu, Xinbiao
    Xing, Hao
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (12) : 7820 - 7842