Progressive expansion for semi-supervised bi-modal salient object detection

被引:2
|
作者
Wang, Jie [1 ]
Zhang, Zihao [1 ]
Yu, Nana [1 ]
Han, Yahong [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
关键词
Bi-modal salient object detection; Cross-model fusion; Semi-supervised learning; NETWORK; CONTEXT;
D O I
10.1016/j.patcog.2024.110868
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing bi-modal salient object detection (SOD) methods primarily rely on fully supervised training strategies that require extensive manual annotation. Undoubtedly, extensive manual annotation is time-consuming and laborious, and the fully supervised strategy is also prone to overfitting on the training set. Therefore, we introduce a semi-supervised learning architecture (SSLA) to alleviate these problems while ensuring detection performance. Considering that the inherent training mode and concise architecture of basic SSLA will limit its ability to effectively explore the learning potential of the model, we further propose two optimization strategies, dynamic adjustment and active expansion. Specifically, we dynamically adjust the supervision scheme for unlabeled samples during training so that the model can continuously utilize the model's gains (pseudo labels) to supervise and guide the model to further explore the unlabeled samples. Furthermore, the active expansion strategy enables the model to acquire more beneficial supervised information and focuses its attention on difficult-to-segment samples. In summary, an effective progressive expansion network (PENet) architecture for semi-supervised bi-modal SOD is proposed. Extensive experiments indicate that our PENet architecture, while effectively alleviating 90% of annotation burdens, has achieved highly competitive results in RGB-T and RGB-D tasks compared to fully supervised methods. The performance is even more pronounced during cross-dataset testing.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Scale-Equivalent Distillation for Semi-Supervised Object Detection
    Guo, Qiushan
    Mu, Yao
    Chen, Jianyu
    Wang, Tianqi
    Yu, Yizhou
    Luo, Ping
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14502 - 14511
  • [42] Consistency-based Semi-supervised Learning for Object Detection
    Jeong, Jisoo
    Lee, Seungeui
    Kim, Jeesoo
    Kwak, Nojun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [43] Active and semi-supervised learning for object detection with imperfect data
    Rhee, Phill Kyu
    Erdenee, Enkhbayar
    Kyun, Shin Dong
    Ahmed, Minhaz Uddin
    Jin, Songguo
    COGNITIVE SYSTEMS RESEARCH, 2017, 45 : 109 - 123
  • [44] Interpolation-based Semi-supervised Learning for Object Detection
    Jeong, Jisoo
    Verma, Vikas
    Hyun, Minsung
    Kannala, Juho
    Kwak, Nojun
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11597 - 11606
  • [45] Semi-supervised learning based object detection in aerial imagery
    Yao, J
    Zhang, ZF
    2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol 1, Proceedings, 2005, : 1011 - 1016
  • [46] Dense Information Learning Based Semi-Supervised Object Detection
    Yang, Xi
    Li, Penghui
    Zhou, Qiubai
    Wang, Nannan
    Gao, Xinbo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1022 - 1035
  • [47] Global Focal Learning for Semi-Supervised Oriented Object Detection
    Wang, Kai
    Xiao, Zhifeng
    Wan, Qiao
    Xia, Fanfan
    Chen, Pin
    Li, Deren
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [48] Adapting Object Size Variance and Class Imbalance for Semi-supervised Object Detection
    Nie, Yuxiang
    Fang, Chaowei
    Cheng, Lechao
    Lin, Liang
    Li, Guanbin
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1966 - 1974
  • [49] S 3 Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection
    Zhu, Lei
    Wang, Xiaoqiang
    Li, Ping
    Yang, Xin
    Zhang, Qing
    Wang, Weiming
    Schonlieb, Carola-Bibiane
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 676 - 689
  • [50] ProUDA: Progressive unsupervised data augmentation for semi-Supervised 3D object detection on point cloud
    An, Pei
    Liang, Junxiong
    Ma, Tao
    Chen, Yanfei
    Wang, Liheng
    Ma, Jie
    PATTERN RECOGNITION LETTERS, 2023, 170 : 64 - 69