Multi-granularity hierarchical contrastive learning between foreground and background for semi-supervised video action detection

被引：0

作者：

Zhang, Qiming ^{[1
]}

Hu, Zhengping ^{[1
,2
,3
]}

Wang, Yulu ^{[1
]}

Zhang, Hehao ^{[1
]}

Di, Jirui ^{[1
]}

机构：

[1] Yanshan Univ, Sch Informat & Engn, Qinhuangdao 066004, Hebei, Peoples R China

[2] Yanshan Univ, Qinhuangdao 066004, Hebei, Peoples R China

[3] Hebei Key Lab Informat Transmiss & Signal Proc, Qinhuangdao 066004, Hebei, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2025年 / 309卷

关键词：

Semi-supervised learning; Video action detection; Multi-granularity; Contrastive learning; NETWORK;

D O I：

10.1016/j.knosys.2024.112853

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semi-supervised video action detection has received increasing attention due to its lower data annotation cost and performance comparable to fully supervised methods. However, due to the presence of dynamic background regions in the video, existing methods may encounter biases when interpreting the foreground and background of the video. This bias causes the model to mistakenly identify dynamic background areas as action foregrounds or to overlook background information, leading to misjudgment of the foreground. In response to this issue, this paper proposes a multi-granularity hierarchical contrastive learning between foreground and background for semi-supervised video action detection method termed as Multi-FB. Specifically, this paper proposes a multi- granularity encoding network based on foreground and background. This network uses a unified encoder to represent and learn foreground and background regions in videos at different granularities, thereby improving the model's understanding of action foreground and related background. Secondly, this paper proposes an Intramodel multi-granularity hierarchical contrastive strategy, which aims to minimize the representation discrepancies of foreground-to-foreground and background-to-background at different granularities within the same video, while maximizing the representation differences between the foreground and background at various granularities within the video. Furthermore, this paper proposes a Cross-model multi-granularity hierarchical contrastive strategy, which aims to enhance the consistency of the model's representations of foregrounds and backgrounds between the original data and the augmented data. A large number of experimental results on JHMDB-21 and UCF101-24 show that the proposed method can significantly distinguish feature representations between different categories, achieving performance comparable to state-of-the-art methods under semi- supervised conditions.

引用

页数：16

共 50 条

[21] Semi-Supervised Multi-Granularity CNNs for Text Classification: An Application in Human-Car Interaction
Zhao, Fen
Li, Yinguo
Bai, Ling
Tian, Zhen
Wang, Xinheng
IEEE ACCESS, 2020, 8 : 68000 - 68012
[22] SMGCL: Semi-supervised Multi-view Graph Contrastive Learning
Zhou, Hui
Gong, Maoguo
Wang, Shanfeng
Gao, Yuan
Zhao, Zhongying
KNOWLEDGE-BASED SYSTEMS, 2023, 260
[23] Trusted Semi-Supervised Multi-View Classification With Contrastive Learning
Wang, Xiaoli
Wang, Yongli
Wang, Yupeng
Huang, Anqi
Liu, Jun
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8268 - 8278
[24] Multi-Granularity approach for Enhancing the Performance of Network Intrusion Detection with Supervised Learning
Saraswathy, V. R.
Kasthuri, N.
Ramyadevi, I. P.
PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,
[25] A SEMI-SUPERVISED LEARNING APPROACH TO ONLINE AUDIO BACKGROUND DETECTION
Chu, Selina
Narayanan, Shrikanth
Kuo, C-C Jay
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1629 - +
[26] Neighbor-Guided Consistent and Contrastive Learning for Semi-Supervised Action Recognition
Wu, Jianlong
Sun, Wei
Gan, Tian
Ding, Ning
Jiang, Feijun
Shen, Jialie
Nie, Liqiang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2215 - 2227
[27] Audio-Visual Contrastive and Consistency Learning for Semi-Supervised Action Recognition
Assefa, Maregu
Jiang, Wei
Zhan, Jinyu
Gedamu, Kumie
Yilma, Getinet
Ayalew, Melese
Adhikari, Deepak
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3491 - 3504
[28] Multi granularity based label propagation with active learning for semi-supervised classification
Hu, Shengdan
Miao, Duoqian
Pedrycz, Witold
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 192
[29] Ego-Vehicle Action Recognition based on Semi-Supervised Contrastive Learning
Noguchi, Chihiro
Tanizawa, Toshihiro
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5977 - 5987
[30] M-GWNN: Multi-granularity graph wavelet neural networks for semi-supervised node classification
Zheng, Wenjie
Qian, Fulan
Zhao, Shu
Zhang, Yanping
NEUROCOMPUTING, 2021, 453 : 524 - 537

← 1 2 3 4 5 →