Full-Scale Selective Transformer for Semantic Segmentation

被引:0
|
作者
Lin, Fangjian [1 ,2 ,3 ]
Wu, Sitong [2 ]
Ma, Yizhe [1 ]
Tian, Shengwei [1 ]
机构
[1] Xinjiang Univ, Sch Software, Urumqi, Peoples R China
[2] Baidu VIS, Beijing, Peoples R China
[3] Baidu Res, Inst Deep Learning, Beijing, Peoples R China
来源
关键词
Semantic segmentation; Transformer; Full-scale feature fusion;
D O I
10.1007/978-3-031-26293-7_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we rethink the multi-scale feature fusion from two perspectives (scale-level and spatial-level) and propose a full-scale selective fusion strategy for semantic segmentation. Based on such strategy, we design a novel segmentation network, named Full-scale Selective Transformer (FSFormer). Specifically, our FSFormer adaptively selects partial tokens from all tokens at all scales to construct a token subset of interest for each scale. Therefore, each token only interacts with the tokens within its corresponding token subset of interest. The proposed full-scale selective fusion strategy can not only filter out the noisy information propagation but also reduce the computational costs to some extent. We evaluate our FSFormer on four challenging semantic segmentation benchmarks, including PASCAL Context, ADE20K, COCO-Stuff 10K, and Cityscapes, outperforming the state-of-the-art methods. We evaluate our FSFormer on four challenging semantic segmentation benchmarks, including PASCAL Context, ADE20K, COCO-Stuff 10K, and Cityscapes, outperforming the state-of-the-art methods.
引用
收藏
页码:310 / 326
页数:17
相关论文
共 50 条
  • [41] Scene sketch semantic segmentation with hierarchical Transformer
    Yang, Jie
    Ke, Aihua
    Yu, Yaoxiang
    Cai, Bo
    KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [42] Graph Structure Guided Transformer for Semantic Segmentation
    Qian, Luyang
    Zhang, Canlong
    Li, Zhixin
    Wang, Zhiwen
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 915 - 922
  • [43] A Unified Efficient Pyramid Transformer for Semantic Segmentation
    Zhu, Fangrui
    Zhu, Yi
    Zhang, Li
    Wu, Chongruo
    Fu, Yanwei
    Li, Mu
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2667 - 2677
  • [44] MMSFormer: Multimodal Transformer for Material and Semantic Segmentation
    Reza, Md Kaykobad
    Prater-Bennette, Ashley
    Asif, M. Salman
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 599 - 610
  • [45] CoT: Contourlet Transformer for Hierarchical Semantic Segmentation
    Shao, Yilin
    Sun, Long
    Jiao, Licheng
    Liu, Xu
    Liu, Fang
    Li, Lingling
    Yang, Shuyuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 132 - 146
  • [46] Semantic segmentation using tag label and transformer
    Jeong S.-W.
    Kim E.-C.
    Yoo J.
    Journal of Institute of Control, Robotics and Systems, 2021, 27 (12) : 1029 - 1037
  • [47] CoT: Contourlet Transformer for Hierarchical Semantic Segmentation
    Shao, Yilin
    Sun, Long
    Jiao, Licheng
    Liu, Xu
    Liu, Fang
    Li, Lingling
    Yang, Shuyuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 132 - 146
  • [48] MarsFormer: Martian Rock Semantic Segmentation With Transformer
    Xiong, Yonggang
    Xiao, Xueming
    Yao, Meibao
    Liu, Haiqiang
    Yang, Hong
    Fu, Yuegang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [49] STATISTICS OF FULL-SCALE SURFACE PRESSURES
    MILFORD, RV
    WALDECK, JL
    JOURNAL OF WIND ENGINEERING AND INDUSTRIAL AERODYNAMICS, 1988, 30 (1-3) : 35 - 44
  • [50] FULL-SCALE HYDRAULIC MINE PLANNED
    JACKSON, D
    COAL AGE, 1983, 88 (09): : 69 - &