Progressive expansion for semi-supervised bi-modal salient object detection

被引：2

作者：

Wang, Jie ^{[1
]}

Zhang, Zihao ^{[1
]}

Yu, Nana ^{[1
]}

Han, Yahong ^{[1
]}

机构：

[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China

来源：

PATTERN RECOGNITION | 2025年 / 157卷

关键词：

Bi-modal salient object detection; Cross-model fusion; Semi-supervised learning; NETWORK; CONTEXT;

D O I：

10.1016/j.patcog.2024.110868

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing bi-modal salient object detection (SOD) methods primarily rely on fully supervised training strategies that require extensive manual annotation. Undoubtedly, extensive manual annotation is time-consuming and laborious, and the fully supervised strategy is also prone to overfitting on the training set. Therefore, we introduce a semi-supervised learning architecture (SSLA) to alleviate these problems while ensuring detection performance. Considering that the inherent training mode and concise architecture of basic SSLA will limit its ability to effectively explore the learning potential of the model, we further propose two optimization strategies, dynamic adjustment and active expansion. Specifically, we dynamically adjust the supervision scheme for unlabeled samples during training so that the model can continuously utilize the model's gains (pseudo labels) to supervise and guide the model to further explore the unlabeled samples. Furthermore, the active expansion strategy enables the model to acquire more beneficial supervised information and focuses its attention on difficult-to-segment samples. In summary, an effective progressive expansion network (PENet) architecture for semi-supervised bi-modal SOD is proposed. Extensive experiments indicate that our PENet architecture, while effectively alleviating 90% of annotation burdens, has achieved highly competitive results in RGB-T and RGB-D tasks compared to fully supervised methods. The performance is even more pronounced during cross-dataset testing.

引用

页数：15

共 50 条

[41] Scale-Equivalent Distillation for Semi-Supervised Object Detection
Guo, Qiushan
Mu, Yao
Chen, Jianyu
Wang, Tianqi
Yu, Yizhou
Luo, Ping
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14502 - 14511
[42] Consistency-based Semi-supervised Learning for Object Detection
Jeong, Jisoo
Lee, Seungeui
Kim, Jeesoo
Kwak, Nojun
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[43] Active and semi-supervised learning for object detection with imperfect data
Rhee, Phill Kyu
Erdenee, Enkhbayar
Kyun, Shin Dong
Ahmed, Minhaz Uddin
Jin, Songguo
COGNITIVE SYSTEMS RESEARCH, 2017, 45 : 109 - 123
[44] Interpolation-based Semi-supervised Learning for Object Detection
Jeong, Jisoo
Verma, Vikas
Hyun, Minsung
Kannala, Juho
Kwak, Nojun
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11597 - 11606
[45] Semi-supervised learning based object detection in aerial imagery
Yao, J
Zhang, ZF
2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol 1, Proceedings, 2005, : 1011 - 1016
[46] Dense Information Learning Based Semi-Supervised Object Detection
Yang, Xi
Li, Penghui
Zhou, Qiubai
Wang, Nannan
Gao, Xinbo
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1022 - 1035
[47] Global Focal Learning for Semi-Supervised Oriented Object Detection
Wang, Kai
Xiao, Zhifeng
Wan, Qiao
Xia, Fanfan
Chen, Pin
Li, Deren
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[48] Adapting Object Size Variance and Class Imbalance for Semi-supervised Object Detection
Nie, Yuxiang
Fang, Chaowei
Cheng, Lechao
Lin, Liang
Li, Guanbin
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1966 - 1974
[49] S 3 Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection
Zhu, Lei
Wang, Xiaoqiang
Li, Ping
Yang, Xin
Zhang, Qing
Wang, Weiming
Schonlieb, Carola-Bibiane
Chen, C. L. Philip
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 676 - 689
[50] ProUDA: Progressive unsupervised data augmentation for semi-Supervised 3D object detection on point cloud
An, Pei
Liang, Junxiong
Ma, Tao
Chen, Yanfei
Wang, Liheng
Ma, Jie
PATTERN RECOGNITION LETTERS, 2023, 170 : 64 - 69

← 1 2 3 4 5 →