Temporal Feature Enhancement Dilated Convolution Network for Weakly-supervised Temporal Action Localization

被引：8

作者：

Zhou, Jianxiong ^{[1
]}

Wu, Ying ^{[1
]}

机构：

[1] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA

来源：

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年

关键词：

D O I：

10.1109/WACV56688.2023.00597

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly-supervised Temporal Action Localization (WTAL) aims to classify and localize action instances in untrimmed videos with only video-level labels. Existing methods typically use snippet-level RGB and optical flow features extracted from pre-trained extractors directly. Because of two limitations: the short temporal span of snippets and the inappropriate initial features, these WTAL methods suffer from the lack of effective use of temporal information and have limited performance. In this paper, we propose the Temporal Feature Enhancement Dilated Convolution Network (TFE-DCN) to address these two limitations. The proposed TFE-DCN has an enlarged receptive field that covers a long temporal span to observe the full dynamics of action instances, which makes it powerful to capture temporal dependencies between snippets. Furthermore, we propose the Modality Enhancement Module that can enhance RGB features with the help of enhanced optical flow features, making the overall features appropriate for the WTAL task. Experiments conducted on THUMOS'14 and ActivityNet v1.3 datasets show that our proposed approach far outperforms state-of-the-art WTAL methods.

引用

页码：6017 / 6026

页数：10

共 50 条

[31] Weakly-Supervised Temporal Action Localization with Regional Similarity Consistency
Ren, Haoran
Ren, Hao
Lu, Hong
Jin, Cheng
MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 69 - 81
[32] A Novel Action Saliency and Context-Aware Network for Weakly-Supervised Temporal Action Localization
Zhao, Yibo
Zhang, Hua
Gao, Zan
Gao, Wenjie
Wang, Meng
Chen, Shengyong
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8253 - 8266
[33] Context Sensitive Network for weakly-supervised fine-grained temporal action localization
Dong, Cerui
Liu, Qinying
Wang, Zilei
Zhang, Yixin
Zhao, Feng
NEURAL NETWORKS, 2025, 185
[34] Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization
Zhai, Yuanhao
Wang, Le
Tang, Wei
Zhang, Qilin
Zheng, Nanning
Doermann, David
Yuan, Junsong
Hua, Gang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4136 - 4151
[35] Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization
Gao, Junyu
Chen, Mengyuan
Xu, Changsheng
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19967 - 19977
[36] Deep feature enhancing and selecting network for weakly supervised temporal action localization
Yu, Jiaruo
Ge, Yongxin
Qin, Xiaolei
Li, Ziqiang
Huang, Sheng
Chen, Feiyu
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 80
[37] Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization
Lee, Pilhyeon
Byun, Hyeran
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13628 - 13637
[38] Progressive enhancement network with pseudo labels for weakly supervised temporal action localization
Wang, Qingyun
Song, Yan
Zou, Rong
Shu, Xiangbo
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
[39] CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning
Zhang, Can
Cao, Meng
Yang, Dongming
Chen, Jie
Zou, Yuexian
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16005 - 16014
[40] Diffusion-based framework for weakly-supervised temporal action localization
Zou, Yuanbing
Zhao, Qingjie
Sarker, Prodip Kumar
Li, Shanshan
Wang, Lei
Liu, Wangwang
Pattern Recognition, 2025, 160

← 1 2 3 4 5 →