Learning graph structures with transformer for weakly supervised semantic segmentation

被引:1
|
作者
Sun, Wanchun [1 ]
Feng, Xin [1 ,2 ]
Ma, Hui [3 ]
Liu, Jingyao [1 ,4 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun 130022, Peoples R China
[2] Changchun Univ Sci & Technol, Chongqing Res Inst, Chongqing 401122, Peoples R China
[3] Anhui Vocat Coll Police Officers, Comp Basic Teaching & Res Dept, Hefei 232001, Peoples R China
[4] Chuzhou Univ, Sch Comp & Informat Engn, Chuzhou 239000, Peoples R China
关键词
Weakly supervised; Transformer; Graph convolutional network; Semantic segmentation;
D O I
10.1007/s40747-023-01152-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised semantic segmentation (WSSS) is a challenging task of computer vision. The state-of-the-art semantic segmentation methods are usually based on the convolutional neural network (CNN), which mainly have the drawbacks of inability to explore the global information correctly and failure to activate potential object regions. To avoid such drawbacks, the transformer approach is explored in the WSSS task, but no effective semantic association between different patch tokens can be determined in the transformer. To address this issue, inspired by the graph convolutional network (GCN), this paper proposes a graph structure to learn the semantic category relationships between different blocks in the vector sequence. To verify the effectiveness of the proposed method in this paper, a large number of experiments were conducted on the publicly available PASCAL VOC2012 dataset. The experimental results show that our proposed method achieves significant performance improvement in the WSSS task and outperforms other state-of-the-art transformer-based methods.
引用
收藏
页码:7511 / 7521
页数:11
相关论文
共 50 条
  • [31] Hierarchical Semantic Contrast for Weakly Supervised Semantic Segmentation
    Wu, Yuanchen
    Li, Xiaoqiang
    Dai, Songmin
    Li, Jide
    Liu, Tong
    Xie, Shaorong
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1542 - 1550
  • [32] Looking Beyond Single Images for Weakly Supervised Semantic Segmentation Learning
    Wang, Wenguan
    Sun, Guolei
    Van Gool, Luc
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1635 - 1649
  • [33] Weakly Supervised Semantic Segmentation via Adversarial Learning of Classifier and Reconstructor
    Kweon, Hyeokjun
    Yoon, Sung-Hoon
    Yoon, Kuk-Jin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11329 - 11339
  • [34] An optic disk semantic segmentation method based on weakly supervised learning
    Pan, Feng
    Lu, Zheng
    Chen, Dali
    Xue, Dingyu
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 4791 - 4794
  • [35] Learning Semantic Segmentation Score in Weakly Supervised Convolutional Neural Network
    Ikhwantri, Fariz
    Habibie, Novian
    Syulistyo, Arie Rachmad
    Aprinaldi
    Jatmiko, Wisnu
    2015 INTERNATIONAL CONFERENCE ON COMPUTERS, COMMUNICATIONS, AND SYSTEMS (ICCCS), 2015, : 19 - 25
  • [36] Leveraging Auxiliary Tasks with Affinity Learning for Weakly Supervised Semantic Segmentation
    Xu, Lian
    Ouyang, Wanli
    Bennamoun, Mohammed
    Boussaid, Farid
    Sohel, Ferdous
    Xu, Dan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6964 - 6973
  • [37] Credible Dual-Expert Learning for Weakly Supervised Semantic Segmentation
    Zhang, Bingfeng
    Xiao, Jimin
    Wei, Yunchao
    Zhao, Yao
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (08) : 1892 - 1908
  • [38] Learning pseudo labels for semi-and-weakly supervised semantic segmentation
    Wang, Yude
    Zhang, Jie
    Kan, Meina
    Shan, Shiguang
    PATTERN RECOGNITION, 2022, 132
  • [39] Multi-representation fusion learning for weakly supervised semantic segmentation
    Li, Yongqiang
    Hu, Chuanping
    Ren, Kai
    Xi, Hao
    Fan, Jinhao
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 277
  • [40] eX-ViT: A Novel explainable vision transformer for weakly supervised semantic segmentation *
    Yu, Lu
    Xiang, Wei
    Fang, Juan
    Chen, Yi-Ping Phoebe
    Chi, Lianhua
    PATTERN RECOGNITION, 2023, 142