Efficient Attention Pyramid Network for Semantic Segmentation

被引:8
|
作者
Yang, Qirui [1 ,2 ,3 ]
Ku, Tao [1 ,2 ]
Hu, Kunyuan [1 ,2 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
[2] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110169, Peoples R China
[3] Univ Chinese Acad Sci, Sch Comp & Control, Beijing 100049, Peoples R China
关键词
Semantics; Convolution; Feature extraction; Task analysis; Image segmentation; Decoding; Computer vision; Semantic segmentation; attention mechanism; spatial pyramid; PASCAL VOC 2012; Cityscapes;
D O I
10.1109/ACCESS.2021.3053316
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation is a task that covers most of the perception needs of intelligent vehicles in an unified way. Recent studies witnessed that attention mechanisms achieve impressive performance in computer vision task. Current attention mechanisms based segmentation methods differ with each other in position and form of the attention mechanism, and perform differently in practice. This paper firstly introduces the effectiveness of multi-scale context features and attention mechanisms in segmentation tasks. We find that multi-scale and channel attention can play a vital role in constructing effective context features. Based on this analysis, this paper proposes an efficient attention pyramid network (EAPNet) for semantic segmentation. Specifically, to efficient handle the problem of segmenting objects at multiple scales, we design efficient channel attention pyramid (ECAP) which employ atrous convolution with channel attention in cascade or in parallel to capture multi-scale context by using multiple atrous rates. Furthermore, we propose a residual attention fusion block (RAFB), whose purpose is to simultaneously focus on meaningful low-level feature maps and spatial location information. At the same time, we will explore different channel attention modules and spatial attention modules, and describe their impact on network performance. We empirically evaluate our EAPNet on two semantic segmentation datasets, including PASCAL VOC 2012 and Cityscapes datasets. Experimental results show that without MS COCO pre-training and any post-processing, EAPNet achieved 81.7% mIoU on the PASCAL VOC 2012 validation set. With deeplabv3+ as the benchmark, EAPNet improve the model performance of more than 1.50% mIoU.
引用
收藏
页码:18867 / 18875
页数:9
相关论文
共 50 条
  • [31] POINT SET ATTENTION NETWORK FOR SEMANTIC SEGMENTATION
    Jiang, Jie
    Liu, Jing
    Fu, Jun
    Zhu, Xinxin
    Lu, Hanqing
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2186 - 2190
  • [32] Grouped Double Attention Network for Semantic Segmentation
    Chen Xiaolong
    Zhao Ji
    Chen Siyi
    Du Xinhao
    Liu Xin
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (22)
  • [33] Semantic Segmentation Network Based on Integral Attention
    Xiong, Siqi
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON COMPUTER AND MULTIMEDIA TECHNOLOGY, ICCMT 2024, 2024, : 285 - 288
  • [34] RANet: Region Attention Network for Semantic Segmentation
    Shen, Dingguo
    Ji, Yuanfeng
    Li, Ping
    Wang, Yi
    Lin, Di
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [35] Realtime Global Attention Network for Semantic Segmentation
    Mo, Xi
    Chen, Xiangyu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 1574 - 1580
  • [36] SPARSE SPATIAL ATTENTION NETWORK FOR SEMANTIC SEGMENTATION
    Liu, Mengyu
    Yin, Hujun
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 644 - 648
  • [37] FCPFNet: Feature Complementation Network with Pyramid Fusion for Semantic Segmentation
    Lei, Jingsheng
    Shu, Chente
    Xu, Qiang
    Yu, Yunxiang
    Yang, Shengying
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [38] FCPFNet: Feature Complementation Network with Pyramid Fusion for Semantic Segmentation
    Jingsheng Lei
    Chente Shu
    Qiang Xu
    Yunxiang Yu
    Shengying Yang
    Neural Processing Letters, 56
  • [39] SCARF: A Semantic Constrained Attention Refinement Network for Semantic Segmentation
    Ding, Xiaofeng
    Shen, Chaomin
    Che, Zhengping
    Zeng, Tieyong
    Peng, Yaxin
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3002 - 3011
  • [40] Knowledge and Spatial Pyramid Distance-Based Gated Graph Attention Network for Remote Sensing Semantic Segmentation
    Cui, Wei
    He, Xin
    Yao, Meng
    Wang, Ziwei
    Hao, Yuanjie
    Li, Jie
    Wu, Weijie
    Zhao, Huilin
    Xia, Cong
    Li, Jin
    Cui, Wenqi
    REMOTE SENSING, 2021, 13 (07)