Progressive expansion for semi-supervised bi-modal salient object detection

被引:2
|
作者
Wang, Jie [1 ]
Zhang, Zihao [1 ]
Yu, Nana [1 ]
Han, Yahong [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
关键词
Bi-modal salient object detection; Cross-model fusion; Semi-supervised learning; NETWORK; CONTEXT;
D O I
10.1016/j.patcog.2024.110868
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing bi-modal salient object detection (SOD) methods primarily rely on fully supervised training strategies that require extensive manual annotation. Undoubtedly, extensive manual annotation is time-consuming and laborious, and the fully supervised strategy is also prone to overfitting on the training set. Therefore, we introduce a semi-supervised learning architecture (SSLA) to alleviate these problems while ensuring detection performance. Considering that the inherent training mode and concise architecture of basic SSLA will limit its ability to effectively explore the learning potential of the model, we further propose two optimization strategies, dynamic adjustment and active expansion. Specifically, we dynamically adjust the supervision scheme for unlabeled samples during training so that the model can continuously utilize the model's gains (pseudo labels) to supervise and guide the model to further explore the unlabeled samples. Furthermore, the active expansion strategy enables the model to acquire more beneficial supervised information and focuses its attention on difficult-to-segment samples. In summary, an effective progressive expansion network (PENet) architecture for semi-supervised bi-modal SOD is proposed. Extensive experiments indicate that our PENet architecture, while effectively alleviating 90% of annotation burdens, has achieved highly competitive results in RGB-T and RGB-D tasks compared to fully supervised methods. The performance is even more pronounced during cross-dataset testing.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Rethinking Pseudo Labels for Semi-supervised Object Detection
    Li, Hengduo
    Wu, Zuxuan
    Shrivastava, Abhinav
    Davis, Larry S.
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1314 - 1322
  • [22] Lesion Localization in OCT by Semi-Supervised Object Detection
    Wu, Yue
    Zhou, Yang
    Zhao, Jianchun
    Yang, Jingyuan
    Yu, Weihong
    Chen, Youxin
    Li, Xirong
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 639 - 646
  • [23] WEAKLY SEMI-SUPERVISED ORIENTED OBJECT DETECTION WITH POINTS
    Zhang, Ziming
    Wang, Yucheng
    He, Chu
    Zhang, Qingyi
    Chen, Xi
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3080 - 3084
  • [24] Semi-supervised Object Detection via VC Learning
    Chen, Changrui
    Debattista, Kurt
    Han, Jungong
    COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 169 - 185
  • [25] SOOD: Towards Semi-Supervised Oriented Object Detection
    Hua, Wei
    Liang, Dingkang
    Li, Jingyu
    Liu, Xiaolong
    Zou, Zhikang
    Ye, Xiaoqing
    Bai, Xiang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15558 - 15567
  • [26] CrossRectify: Leveraging disagreement for semi-supervised object detection
    Ma, Chengcheng
    Pan, Xingjia
    Ye, Qixiang
    Tang, Fan
    Dong, Weiming
    Xu, Changsheng
    PATTERN RECOGNITION, 2023, 137
  • [27] SEMI-SUPERVISED OBJECT DETECTION WITH SPARSELY ANNOTATED DATASET
    Yoon, Jihun
    Hong, Seungbum
    Choi, Min-Kook
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 719 - 723
  • [28] STOD: toward semi-supervised tiny object detection
    Guo Y.
    Feng Y.
    Du K.
    Cao L.
    Neural Computing and Applications, 2024, 36 (27) : 17107 - 17123
  • [29] Dense Learning based Semi-Supervised Object Detection
    Chen, Binghui
    Li, Pengyu
    Chen, Xiang
    Wang, Biao
    Zhang, Lei
    Hua, Xian-Sheng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4805 - 4814
  • [30] Semi-supervised Learning of Feature Hierarchies for Object Detection in a Video
    Yang, Yang
    Shu, Guang
    Shah, Mubarak
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1650 - 1657