Progressive expansion for semi-supervised bi-modal salient object detection

被引：2

作者：

Wang, Jie ^{[1
]}

Zhang, Zihao ^{[1
]}

Yu, Nana ^{[1
]}

Han, Yahong ^{[1
]}

机构：

[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China

来源：

PATTERN RECOGNITION | 2025年 / 157卷

关键词：

Bi-modal salient object detection; Cross-model fusion; Semi-supervised learning; NETWORK; CONTEXT;

D O I：

10.1016/j.patcog.2024.110868

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing bi-modal salient object detection (SOD) methods primarily rely on fully supervised training strategies that require extensive manual annotation. Undoubtedly, extensive manual annotation is time-consuming and laborious, and the fully supervised strategy is also prone to overfitting on the training set. Therefore, we introduce a semi-supervised learning architecture (SSLA) to alleviate these problems while ensuring detection performance. Considering that the inherent training mode and concise architecture of basic SSLA will limit its ability to effectively explore the learning potential of the model, we further propose two optimization strategies, dynamic adjustment and active expansion. Specifically, we dynamically adjust the supervision scheme for unlabeled samples during training so that the model can continuously utilize the model's gains (pseudo labels) to supervise and guide the model to further explore the unlabeled samples. Furthermore, the active expansion strategy enables the model to acquire more beneficial supervised information and focuses its attention on difficult-to-segment samples. In summary, an effective progressive expansion network (PENet) architecture for semi-supervised bi-modal SOD is proposed. Extensive experiments indicate that our PENet architecture, while effectively alleviating 90% of annotation burdens, has achieved highly competitive results in RGB-T and RGB-D tasks compared to fully supervised methods. The performance is even more pronounced during cross-dataset testing.

引用

页数：15

共 50 条

[21] Rethinking Pseudo Labels for Semi-supervised Object Detection
Li, Hengduo
Wu, Zuxuan
Shrivastava, Abhinav
Davis, Larry S.
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1314 - 1322
[22] Lesion Localization in OCT by Semi-Supervised Object Detection
Wu, Yue
Zhou, Yang
Zhao, Jianchun
Yang, Jingyuan
Yu, Weihong
Chen, Youxin
Li, Xirong
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 639 - 646
[23] WEAKLY SEMI-SUPERVISED ORIENTED OBJECT DETECTION WITH POINTS
Zhang, Ziming
Wang, Yucheng
He, Chu
Zhang, Qingyi
Chen, Xi
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3080 - 3084
[24] Semi-supervised Object Detection via VC Learning
Chen, Changrui
Debattista, Kurt
Han, Jungong
COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 169 - 185
[25] SOOD: Towards Semi-Supervised Oriented Object Detection
Hua, Wei
Liang, Dingkang
Li, Jingyu
Liu, Xiaolong
Zou, Zhikang
Ye, Xiaoqing
Bai, Xiang
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15558 - 15567
[26] CrossRectify: Leveraging disagreement for semi-supervised object detection
Ma, Chengcheng
Pan, Xingjia
Ye, Qixiang
Tang, Fan
Dong, Weiming
Xu, Changsheng
PATTERN RECOGNITION, 2023, 137
[27] SEMI-SUPERVISED OBJECT DETECTION WITH SPARSELY ANNOTATED DATASET
Yoon, Jihun
Hong, Seungbum
Choi, Min-Kook
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 719 - 723
[28] STOD: toward semi-supervised tiny object detection
Guo Y.
Feng Y.
Du K.
Cao L.
Neural Computing and Applications, 2024, 36 (27) : 17107 - 17123
[29] Dense Learning based Semi-Supervised Object Detection
Chen, Binghui
Li, Pengyu
Chen, Xiang
Wang, Biao
Zhang, Lei
Hua, Xian-Sheng
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4805 - 4814
[30] Semi-supervised Learning of Feature Hierarchies for Object Detection in a Video
Yang, Yang
Shu, Guang
Shah, Mubarak
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1650 - 1657

← 1 2 3 4 5 →