Graph Convolutional Networks for Temporal Action Localization

被引:376
|
作者
Zeng, Runhao [1 ,2 ]
Huang, Wenbing [2 ,5 ]
Tan, Mingkui [1 ,4 ]
Rong, Yu [2 ]
Zhao, Peilin [2 ]
Huang, Junzhou [2 ]
Gan, Chuang [3 ]
机构
[1] South China Univ Technol, Sch Software Engn, Guangzhou, Peoples R China
[2] Tencent AI Lab, Shenzhen, Peoples R China
[3] MIT, IBM Watson AI Lab, Cambridge, MA 02139 USA
[4] Peng Cheng Lab, Shenzhen, Peoples R China
[5] Tsinghua Univ, State Key Lab Intelligent Technol & Syst, Tsinghua Natl Lab Informat Sci & Technol TNList, Dept Comp Sci & Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV.2019.00719
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most state-of-the-art action localization systems process each action proposal individually, without explicitly exploiting their relations during learning. However, the relations between proposals actually play an important role in action localization, since a meaningful action always consists of multiple proposals in a video. In this paper, we propose to exploit the proposal-proposal relations using Graph Convolutional Networks (GCNs). First, we construct an action proposal graph, where each proposal is represented as a node and their relations between two proposals as an edge. Here, we use two types of relations, one for capturing the context information for each proposal and the other one for characterizing the correlations between distinct actions. Then we apply the GCNs over the graph to model the relations among different proposals and learn powerful representations for the action classification and localization. Experimental results show that our approach significantly outperforms the state-of-the-art on THUMOS14 (49.1% versus 42.8%). Moreover, augmentation experiments on ActivityNet also verify the efficacy of modeling action proposal relationships.
引用
收藏
页码:7093 / 7102
页数:10
相关论文
共 50 条
  • [31] Frame Segmentation Networks for Temporal Action Localization
    Yang, Ke
    Qiao, Peng
    Wang, Qiang
    Li, Shijie
    Niu, Xin
    Li, Dongsheng
    Dou, Yong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 242 - 252
  • [32] Temporal Convolutional Networks: A Unified Approach to Action Segmentation
    Lea, Colin
    Vidal, Rene
    Reiter, Austin
    Hager, Gregory D.
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 47 - 54
  • [33] Predicting Team Performance with Spatial Temporal Graph Convolutional Networks
    Hu, Shengnan
    Sukthankar, Gita
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2342 - 2348
  • [34] Urban Overtourism Detection Based on Graph Temporal Convolutional Networks
    Kong, Xiangjie
    Huang, Zhiqiang
    Shen, Guojiang
    Lin, Hang
    Lv, Mingjie
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 442 - 454
  • [35] Higher-Order Graph Convolutional Embedding for Temporal Networks
    Mo, Xian
    Pang, Jun
    Liu, Zhiming
    WEB INFORMATION SYSTEMS ENGINEERING, WISE 2020, PT I, 2020, 12342 : 3 - 15
  • [36] Exploring Temporal Preservation Networks for Precise Temporal Action Localization
    Yang, Ke
    Qiao, Peng
    Li, Dongsheng
    Lv, Shaohe
    Dou, Yong
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7477 - 7484
  • [37] Generative Adversarial Graph Convolutional Networks for Human Action Synthesis
    Degardin, Bruno
    Neves, Joao
    Lopes, Vasco
    Brito, Joao
    Yaghoubi, Ehsan
    Proenca, Hugo
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2753 - 2762
  • [38] Weakly Supervised Graph Convolutional Neural Network for Human Action Localization
    Miki, Daisuke
    Chen, Shi
    Demachi, Kazuyuki
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 642 - 650
  • [39] Timestamp-Supervised Action Segmentation with Graph Convolutional Networks
    Khan, Hamza
    Haresh, Sanjay
    Ahmed, Awais
    Siddiqui, Shakeeb
    Konin, Andrey
    Zia, M. Zeeshan
    Quoc-Huy Tran
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10619 - 10626
  • [40] On the spatial attention in spatio-temporal graph convolutional networks for skeleton-based human action recognition
    Heidari, Negar
    Iosifidis, Alexandros
    Proceedings of the International Joint Conference on Neural Networks, 2021, 2021-July