Learning graph structures with transformer for weakly supervised semantic segmentation

被引:1
|
作者
Sun, Wanchun [1 ]
Feng, Xin [1 ,2 ]
Ma, Hui [3 ]
Liu, Jingyao [1 ,4 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun 130022, Peoples R China
[2] Changchun Univ Sci & Technol, Chongqing Res Inst, Chongqing 401122, Peoples R China
[3] Anhui Vocat Coll Police Officers, Comp Basic Teaching & Res Dept, Hefei 232001, Peoples R China
[4] Chuzhou Univ, Sch Comp & Informat Engn, Chuzhou 239000, Peoples R China
关键词
Weakly supervised; Transformer; Graph convolutional network; Semantic segmentation;
D O I
10.1007/s40747-023-01152-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised semantic segmentation (WSSS) is a challenging task of computer vision. The state-of-the-art semantic segmentation methods are usually based on the convolutional neural network (CNN), which mainly have the drawbacks of inability to explore the global information correctly and failure to activate potential object regions. To avoid such drawbacks, the transformer approach is explored in the WSSS task, but no effective semantic association between different patch tokens can be determined in the transformer. To address this issue, inspired by the graph convolutional network (GCN), this paper proposes a graph structure to learn the semantic category relationships between different blocks in the vector sequence. To verify the effectiveness of the proposed method in this paper, a large number of experiments were conducted on the publicly available PASCAL VOC2012 dataset. The experimental results show that our proposed method achieves significant performance improvement in the WSSS task and outperforms other state-of-the-art transformer-based methods.
引用
收藏
页码:7511 / 7521
页数:11
相关论文
共 50 条
  • [41] Learning to Exploit the Prior Network Knowledge for Weakly Supervised Semantic Segmentation
    Redondo-Cabrera, Carolina
    Baptista-Rios, Marcos
    Lopez-Sastre, Roberto J.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) : 3649 - 3661
  • [42] Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation
    Chen, Hongjun
    Wang, Jinbao
    Chen, Hong Cai
    Zhen, Xiantong
    Zheng, Feng
    Ji, Rongrong
    Shao, Ling
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6900 - 6909
  • [43] Credible Dual-Expert Learning for Weakly Supervised Semantic Segmentation
    Bingfeng Zhang
    Jimin Xiao
    Yunchao Wei
    Yao Zhao
    International Journal of Computer Vision, 2023, 131 : 1892 - 1908
  • [44] Weakly Supervised Learning for Point Cloud Semantic Segmentation With Dual Teacher
    Yao, Baochen
    Xiao, Hui
    Zhuang, Jiayan
    Peng, Chengbin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6347 - 6354
  • [45] Exclusive Constrained Discriminative Learning for Weakly-Supervised Semantic Segmentation
    Ying, Peng
    Liu, Jing
    Lu, Hanqing
    Ma, Songde
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1251 - 1254
  • [46] Weakly-supervised Incremental learning for Semantic segmentation with Class Hierarchy
    Kim, Hyoseo
    Choe, Junsuk
    PATTERN RECOGNITION LETTERS, 2024, 182 : 31 - 38
  • [47] Effects of Network Depths on Semantic Image Segmentation By Weakly Supervised Learning
    Bircanoglu, Cenk
    Arica, Nafiz
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [48] TransCAM: Transformer attention-based CAM refinement for Weakly supervised semantic segmentation
    Li, Ruiwen
    Mai, Zheda
    Zhang, Zhibo
    Jang, Jongseong
    Sanner, Scott
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 92
  • [49] MCTformer plus : Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation
    Xu, Lian
    Bennamoun, Mohammed
    Boussaid, Farid
    Laga, Hamid
    Ouyang, Wanli
    Xu, Dan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8380 - 8395
  • [50] Prototypical Transformer for Weakly Supervised Action Segmentation
    Lin, Tao
    Chang, Xiaobin
    Sun, Wei
    Zheng, Weishi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 195 - 206