Multi-granularity Generator for Temporal Action Proposal

被引:10
|
作者
Liu, Yuan [2 ]
Ma, Lin [1 ]
Zhang, Yifeng [2 ]
Liu, Wei [1 ]
Chang, Shih-Fu [3 ]
机构
[1] Tencent AI Lab, Bellevue, WA 98004 USA
[2] Southeast Univ, Nanjing, Jiangsu, Peoples R China
[3] Columbia Univ, New York, NY 10027 USA
关键词
D O I
10.1109/CVPR.2019.00372
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal action proposal generation is an important task, aiming to localize the video segments containing human actions in an untrimmed video. In this paper, we propose a multi-granularity generator (MGG) to perform the temporal action proposal from different granularity perspectives, relying on the video visual features equipped with the position embedding information. First, we propose to use a bilinear matching model to exploit the rich local information within the video sequence. Afterwards, two components, namely segment proposal producer (SPP) and frame actionness producer (FAP), are combined to perform the task of temporal action proposal at two distinct granularities. SPP considers the whole video in the form of feature pyramid and generates segment proposals from one coarse perspective, while FAP carries out a finer actionness evaluation for each video frame. Our proposed MGG can be trained in an end-to-end fashion. Through temporally adjusting the segment proposals with fine-grained information based on frame actionness, MGG achieves the superior performance over state-of-the-art methods on the public THUMOS-14 and ActivityNet-1.3 datasets. Moreover, we employ existing action classifiers to perform the classification of the proposals generated by MGG, leading to significant improvements compared against the competing methods for the video detection task.
引用
收藏
页码:3599 / 3608
页数:10
相关论文
共 50 条
  • [21] Learning multi-granularity features from multi-granularity regions for person re-identification
    Yang, Kaiwen
    Yang, Jiwei
    Tian, Xinmei
    NEUROCOMPUTING, 2021, 432 : 206 - 215
  • [22] MgMViT: Multi-Granularity and Multi-Scale Vision Transformer for Efficient Action Recognition
    Huo, Hua
    Li, Bingjie
    ELECTRONICS, 2024, 13 (05)
  • [23] Language-Guided Multi-Granularity Context Aggregation for Temporal Sentence Grounding
    Gong, Guoqiang
    Zhu, Linchao
    Mu, Yadong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7402 - 7414
  • [24] Multi-granularity spatial-temporal access control model for web GIS
    Zhang, Ai-juan
    Gao, Jing-xiang
    Ji, Cheng
    Sun, Jiu-yun
    Bao, Yu
    TRANSACTIONS OF NONFERROUS METALS SOCIETY OF CHINA, 2014, 24 (09) : 2946 - 2953
  • [25] Research on Expression of Multi-granularity Spatio-temporal Object Composition Structure
    Li R.
    Shi J.
    Dong G.
    Liu Z.
    Journal of Geo-Information Science, 2021, 23 (01) : 113 - 123
  • [26] Text-enhanced Multi-Granularity Temporal Graph Learning for Event Prediction
    Han, Xiaoxue
    Ning, Yue
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 171 - 180
  • [27] Multi-Granularity Detector for Vulnerability Fixes
    Nguyen, Truong Giang
    Le-Cong, Thanh
    Kang, Hong Jin
    Widyasari, Ratnadira
    Yang, Chengran
    Zhao, Zhipeng
    Xu, Bowen
    Zhou, Jiayuan
    Xia, Xin
    Hassan, Ahmed E.
    Le, Xuan-Bach D.
    Lo, David
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (08) : 4035 - 4057
  • [28] Multi-Granularity Graph Model (MGGM)
    Ghobril, P
    Tohmé, S
    2005 Conference on Optical Network Design and Modelling, Proceedings: TOWARDS THE BROADBAND-FOR-ALL ERA, 2005, : 383 - 392
  • [29] MULTI-GRANULARITY KNOWLEDGE MINING ON THE WEB
    Xie, Ming
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2012, 22 (01) : 1 - 16
  • [30] On Multi-granularity Soft Rough Sets
    Wang, Xiaomin
    Liu, Ying
    Li, Piyu
    Liu, Jianbo
    PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 6657 - 6662