Category-Aware Saliency Enhance Learning Based on CLIP for Weakly Supervised Salient Object Detection

被引:0
|
作者
Yunde Zhang
Zhili Zhang
Tianshan Liu
Jun Kong
机构
[1] Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education)
[2] Anhui University,School of Computer Science and Technology
[3] The Hong Kong Polytechnic University,Department of Electronic and Information Engineering
来源
关键词
Weakly supervised; Salient object detection; Category-aware Saliency Enhance Learning; CLIP;
D O I
暂无
中图分类号
学科分类号
摘要
Weakly supervised salient object detection (SOD) using image-level category labels has been proposed to reduce the annotation cost of pixel-level labels. However, existing methods mostly train a classification network to generate a class activation map, which suffers from coarse localization and difficult pseudo-label updating. To address these issues, we propose a novel Category-aware Saliency Enhance Learning (CSEL) method based on contrastive vision-language pre-training (CLIP), which can perform image-text classification and pseudo-label updating simultaneously. Our proposed method transforms image-text classification into pixel-text matching and generates a category-aware saliency map, which is evaluated by the classification accuracy. Moreover, CSEL assesses the quality of the category-aware saliency map and the pseudo saliency map, and uses the quality confidence scores as weights to update the pseudo labels. The two maps mutually enhance each other to guide the pseudo saliency map in the correct direction. Our SOD network can be trained jointly under the supervision of the updated pseudo saliency maps. We test our model on various well-known RGB-D and RGB SOD datasets. Our model achieves an S-measure of 87.6%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on the RGB-D NLPR dataset and 84.3%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on the RGB ECSSD dataset. Additionally, we obtain satisfactory performance on the weakly supervised E-measure, F-measure, and mean absolute error metrics for other datasets. These results demonstrate the effectiveness of our model.
引用
收藏
相关论文
共 50 条
  • [41] CA-CLIP: category-aware adaptation of CLIP model for few-shot class-incremental learning
    Xu, Yuqiao
    Huang, Shucheng
    Zhou, Haoliang
    MULTIMEDIA SYSTEMS, 2024, 30 (03)
  • [42] Weakly Supervised Video Salient Object Detection via Point Supervision
    Gao, Shuyong
    Xing, Haozhe
    Zhang, Wei
    Wang, Yan
    Guo, Qianyu
    Zhang, Wenqiang
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3656 - 3665
  • [43] Weakly-Supervised Salient Object Detection Using Point Supervison
    Gao, Shuyong
    Zhang, Wei
    Wang, Yan
    Guo, Qianyu
    Zhang, Chenglong
    He, Yangji
    Zhang, Wenqiang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 670 - 678
  • [44] Complementary characteristics fusion network for weakly supervised salient object detection
    Liu, Yan
    Zhang, Yunzhou
    Wang, Zhenyu
    Yang, Fei
    Qin, Cao
    Qiu, Feng
    Coleman, Sonya
    Kerr, Dermot
    IMAGE AND VISION COMPUTING, 2022, 126
  • [45] Weakly Supervised Salient Object Detection With Spatiotemporal Cascade Neural Networks
    Tang, Yi
    Zou, Wenbin
    Jin, Zhi
    Chen, Yuhuan
    Hua, Yang
    Li, Xia
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (07) : 1973 - 1984
  • [46] Zigzag Learning for Weakly Supervised Object Detection
    Zhang, Xiaopeng
    Feng, Jiashi
    Xiong, Hongkai
    Tian, Qi
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4262 - 4270
  • [47] Collaborative Learning for Weakly Supervised Object Detection
    Wang, Jiajie
    Yao, Jiangchao
    Zhang, Ya
    Zhang, Rui
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 971 - 977
  • [48] Synthesize Boundaries: A Boundary-Aware Self-Consistent Framework for Weakly Supervised Salient Object Detection
    Xu, Binwei
    Liang, Haoran
    Liang, Ronghua
    Chen, Peng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4194 - 4205
  • [49] Webly-supervised learning for salient object detection
    Luo, Ao
    Li, Xin
    Yang, Fan
    Jiao, Zhicheng
    Cheng, Hong
    PATTERN RECOGNITION, 2020, 103
  • [50] Co-saliency Detection via Weakly Supervised Learning
    Kompella, Aditya
    Kulkarni, Raghavendra V.
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,