Graph Convolutional Networks for Temporal Action Localization

被引:376
|
作者
Zeng, Runhao [1 ,2 ]
Huang, Wenbing [2 ,5 ]
Tan, Mingkui [1 ,4 ]
Rong, Yu [2 ]
Zhao, Peilin [2 ]
Huang, Junzhou [2 ]
Gan, Chuang [3 ]
机构
[1] South China Univ Technol, Sch Software Engn, Guangzhou, Peoples R China
[2] Tencent AI Lab, Shenzhen, Peoples R China
[3] MIT, IBM Watson AI Lab, Cambridge, MA 02139 USA
[4] Peng Cheng Lab, Shenzhen, Peoples R China
[5] Tsinghua Univ, State Key Lab Intelligent Technol & Syst, Tsinghua Natl Lab Informat Sci & Technol TNList, Dept Comp Sci & Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV.2019.00719
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most state-of-the-art action localization systems process each action proposal individually, without explicitly exploiting their relations during learning. However, the relations between proposals actually play an important role in action localization, since a meaningful action always consists of multiple proposals in a video. In this paper, we propose to exploit the proposal-proposal relations using Graph Convolutional Networks (GCNs). First, we construct an action proposal graph, where each proposal is represented as a node and their relations between two proposals as an edge. Here, we use two types of relations, one for capturing the context information for each proposal and the other one for characterizing the correlations between distinct actions. Then we apply the GCNs over the graph to model the relations among different proposals and learn powerful representations for the action classification and localization. Experimental results show that our approach significantly outperforms the state-of-the-art on THUMOS14 (49.1% versus 42.8%). Moreover, augmentation experiments on ActivityNet also verify the efficacy of modeling action proposal relationships.
引用
收藏
页码:7093 / 7102
页数:10
相关论文
共 50 条
  • [41] On the spatial attention in spatio-temporal graph convolutional networks for skeleton-based human action recognition
    Heidari, Negar
    Iosifidis, Alexandros
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [42] Position-aware spatio-temporal graph convolutional networks for skeleton-based action recognition
    Yang, Ping
    Wang, Qin
    Chen, Hao
    Wu, Zizhao
    IET COMPUTER VISION, 2023, 17 (07) : 844 - 854
  • [43] Spatial-Temporal Self-Attention Enhanced Graph Convolutional Networks for Fitness Yoga Action Recognition
    Wei, Guixiang
    Zhou, Huijian
    Zhang, Liping
    Wang, Jianji
    SENSORS, 2023, 23 (10)
  • [44] Weakly supervised image classification and pointwise localization with graph convolutional networks
    Liu, Yongsheng
    Chen, Wenyu
    Qu, Hong
    Mahmud, S. M. Hasan
    Miao, Kebin
    PATTERN RECOGNITION, 2021, 109
  • [45] Exploring frame segmentation networks for temporal action localization
    Yang, Ke
    Shen, Xiaolong
    Qiao, Peng
    Li, Shijie
    Li, Dongsheng
    Dou, Yong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 61 : 296 - 302
  • [46] Implementating Spatio-Temporal Graph Convolutional Networks on Graphcore IPUs
    Moe, Johannes
    Pogorelov, Konstantin
    Schroeder, Daniel Thilo
    Langguth, Johannes
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 45 - 54
  • [47] Graph autoencoder with mirror temporal convolutional networks for traffic anomaly detection
    Ren, Zhiyu
    Li, Xiaojie
    Peng, Jing
    Chen, Ken
    Tan, Qushan
    Wu, Xi
    Shi, Canghong
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [48] Railway Delay Prediction with Spatial-Temporal Graph Convolutional Networks
    Heglund, Jacob S. W.
    Taleongpong, Panukorn
    Hu, Simon
    Tran, Huy T.
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [49] Traffic Prediction with Peak-Aware Temporal Graph Convolutional Networks
    Acun, Fatih
    Kalkan, Sinan
    Gol, Ebru Aydin
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [50] Temporal Inception Architecture for Action Recognition with Convolutional Neural Networks
    Zhang, Wei
    Cen, Jiepeng
    Zheng, Huicheng
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3216 - 3221