RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer

被引:0
|
作者
Wang, Jian [1 ]
Gou, Chenhui [2 ]
Wu, Qiman [1 ]
Feng, Haocheng [1 ]
Han, Junyu [1 ]
Ding, Errui [1 ]
Wang, Jingdong [1 ]
机构
[1] Baidu VIS, Sunnyvale, CA 94089 USA
[2] Australian Natl Univ, Canberra, ACT, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, transformer-based networks have shown impressive results in semantic segmentation. Yet for real-time semantic segmentation, pure CNN-based approaches still dominate in this field, due to the time-consuming computation mechanism of transformer. We propose RTFormer, an efficient dual-resolution transformer for real-time semantic segmenation, which achieves better trade-off between performance and efficiency than CNN-based models. To achieve high inference efficiency on GPU-like devices, our RTFormer leverages GPU-Friendly Attention with linear complexity and discards the multi-head mechanism. Besides, we find that cross-resolution attention is more efficient to gather global context information for high-resolution branch by spreading the high level knowledge learned from low-resolution branch. Extensive experiments on mainstream benchmarks demonstrate the effectiveness of our proposed RTFormer, it achieves state-of-the-art on Cityscapes, CamVid and COCOStuff, and shows promising results on ADE20K. Code is available at PaddleSeg[24]: https://github.com/PaddlePaddle/PaddleSeg.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] EffSegmentNet: Efficient Design for Real-time Semantic Segmentation
    Wang, Cyun-Bo
    Ding, Jian-Jiun
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2104 - 2111
  • [2] Lightweight Real-Time Semantic Segmentation Network With Efficient Transformer and CNN
    Xu, Guoan
    Li, Juncheng
    Gao, Guangwei
    Lu, Huimin
    Yang, Jian
    Yue, Dong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 15897 - 15906
  • [3] EMSFomer: Efficient Multi-Scale Transformer for Real-Time Semantic Segmentation
    Xia, Zhengyu
    Kim, Joohee
    IEEE ACCESS, 2025, 13 : 18239 - 18252
  • [4] A Novel Mamba Architecture with a Semantic Transformer for Efficient Real-Time Remote Sensing Semantic Segmentation
    Ding, Hao
    Xia, Bo
    Liu, Weilin
    Zhang, Zekai
    Zhang, Jinglin
    Wang, Xing
    Xu, Sen
    REMOTE SENSING, 2024, 16 (14)
  • [5] Efficient ConvNet for Real-time Semantic Segmentation
    Romera, Eduardo
    Alvarez, Jose M.
    Bergasa, Luis M.
    Arroyo, Roberto
    2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017), 2017, : 1789 - 1794
  • [6] Lightweight and efficient asymmetric network design for real-time semantic segmentation
    Zhang, Xiu-Ling
    Du, Bing-Ce
    Luo, Zhao-Ci
    Ma, Kai
    APPLIED INTELLIGENCE, 2022, 52 (01) : 564 - 579
  • [7] Lightweight and efficient asymmetric network design for real-time semantic segmentation
    Xiu-Ling Zhang
    Bing-Ce Du
    Zhao-Ci Luo
    Kai Ma
    Applied Intelligence, 2022, 52 : 564 - 579
  • [8] A Real-Time Semantic Segmentation Method Based on Transformer for Autonomous Driving
    Hao, Weiyu
    Wang, Jingyi
    Lu, Huimin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (03): : 4419 - 4433
  • [9] Efficient Adaptive Upsampling Module for Real-Time Semantic Segmentation
    Yang, Xinneng
    Wu, Yan
    Zhao, Junqiao
    Liu, Feilin
    Liao, Yujun
    Mo, Yujian
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (14)
  • [10] Efficient use of recent progresses for Real-time Semantic segmentation
    El Houfi, Safae
    Majda, Aicha
    MACHINE VISION AND APPLICATIONS, 2020, 31 (06)