Weakly Supervised Action Localization by Sparse Temporal Pooling Network

被引:249
|
作者
Phuc Nguyen [1 ]
Liu, Ting [2 ]
Prasad, Gautam [2 ]
Han, Bohyung [3 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92697 USA
[2] Google, Venice, CA USA
[3] Seoul Natl Univ, Seoul, South Korea
关键词
D O I
10.1109/CVPR.2018.00706
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a weakly supervised temporal action localization algorithm on untrimmed videos using convolutional neural networks. Our algorithm learns from video-level class labels and predicts temporal intervals of human actions with no requirement of temporal localization annotations. We design our network to identify a sparse subset of key segments associated with target actions in a video using an attention module and fuse the key segments through adaptive temporal pooling. Our loss function is comprised of two terms that minimize the video-level action classification error and enforce the sparsity of the segment selection. At inference time, we extract and score temporal proposals using temporal class activations and class-agnostic attentions to estimate the time intervals that correspond to target actions. The proposed algorithm attains state-of-the-art results on the THUMOS14 dataset and outstanding performance on ActivityNet1.3 even with its weak supervision.
引用
收藏
页码:6752 / 6761
页数:10
相关论文
共 50 条
  • [1] Multiple Temporal Pooling Mechanisms for Weakly Supervised Temporal Action Localization
    Dou, Peng
    Zeng, Ying
    Wang, Zhuoqun
    Hu, Haifeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (03)
  • [2] ACTION COHERENCE NETWORK FOR WEAKLY SUPERVISED TEMPORAL ACTION LOCALIZATION
    Zhai, Yuanhao
    Wang, Le
    Liu, Ziyi
    Zhang, Qilin
    Hua, Gang
    Zheng, Nanning
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3696 - 3700
  • [3] Weakly Supervised Temporal Action Detection with Shot-Based Temporal Pooling Network
    Su, Haisheng
    Zhao, Xu
    Lin, Tianwei
    Fei, Haiping
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 426 - 436
  • [4] Action Coherence Network for Weakly-Supervised Temporal Action Localization
    Zhai, Yuanhao
    Wang, Le
    Tang, Wei
    Zhang, Qilin
    Zheng, Nanning
    Hua, Gang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1857 - 1870
  • [5] Action Unit Memory Network for Weakly Supervised Temporal Action Localization
    Luo, Wang
    Zhang, Tianzhu
    Yang, Wenfei
    Liu, Jingen
    Mei, Tao
    Wu, Feng
    Zhang, Yongdong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9964 - 9974
  • [6] Complementary Attention Network for Weakly Supervised Temporal Action Localization
    Dou, Peng
    Hu, Haifeng
    NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6713 - 6732
  • [7] Ensemble Prototype Network For Weakly Supervised Temporal Action Localization
    Wu, Kewei
    Luo, Wenjie
    Xie, Zhao
    Guo, Dan
    Zhang, Zhao
    Hong, Richang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [8] Relational Prototypical Network for Weakly Supervised Temporal Action Localization
    Huang, Linjiang
    Huang, Yan
    Ouyang, Wanli
    Wang, Liang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11053 - 11060
  • [9] Complementary Attention Network for Weakly Supervised Temporal Action Localization
    Peng Dou
    Haifeng Hu
    Neural Processing Letters, 2023, 55 : 6713 - 6732
  • [10] Ensemble Prototype Network For Weakly Supervised Temporal Action Localization
    Wu, Kewei
    Luo, Wenjie
    Xie, Zhao
    Guo, Dan
    Zhang, Zhao
    Hong, Richang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (03) : 4560 - 4574