Semantic segmentation using stride spatial pyramid pooling and dual attention decoder

被引:62
|
作者
Peng, Chengli [1 ]
Ma, Jiayi [1 ]
机构
[1] Wuhan Univ, Elect Informat Sch, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic segmentation; Convolutional neural networks; Pyramid pooling; Attention mechanism; NETWORKS; FORCE;
D O I
10.1016/j.patcog.2020.107498
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is an end-to-end task that requires both semantic and spatial accuracy. It is important for deep learning-based segmentation methods to effectively utilize the high-level feature map whose semantic information is abundant and the low-level feature map whose spatial information is accurate. However, existing segmentation networks typically cannot take full advantage of these two kinds of feature maps, leading to inferior performance. This paper attempts to overcome this challenge by introducing two novel structures. On the one hand, we propose a structure called stride spatial pyramid pooling (SSPP) to capture multiscale semantic information from the high-level feature map. Compared with existing pyramid pooling methods based on the atrous convolution, the SSPP structure is able to gather more information from the high-level feature map with faster inference speed, which improves the utilization efficiency of the high-level feature map significantly. On the other hand, we propose a dual attention decoder consisting of a channel attention branch and a spatial attention branch to make full use of the high- and low-level feature maps simultaneously. The dual attention decoder can result in a more "semantic" low-level feature map and a high-level feature map with more accurate spatial information, which bridges the gap between these two kinds of feature maps and benefits their fusion. We evaluate the proposed model on several publicly available semantic image segmentation benchmarks including PASCAL VOC 2012, Cityscapes and COCO-Stuff. The qualitative and quantitative results demonstrate that our method can achieve the state-of-the-art performance. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Mixed spatial pyramid pooling for semantic segmentation
    Xia, Zhengyu
    Kim, Joohee
    APPLIED SOFT COMPUTING, 2020, 91
  • [2] Large Kernel Spatial Pyramid Pooling for Semantic Segmentation
    Yang, Jiayi
    Hu, Tianshi
    Yang, Junli
    Zhang, Zhaoxing
    Pan, Yue
    IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 595 - 605
  • [3] Deformable Spatial Pyramid Pooling for Road Scene Semantic Segmentation
    Lee, Jaeyoung
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2903 - 2908
  • [4] Encoder-decoder with double spatial pyramid for semantic segmentation
    Kong, Huifang
    Hu, Jie
    Fan, Lei
    Zhang, Xiaoxue
    Fang, Yao
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (06)
  • [5] Cascaded hierarchical atrous spatial pyramid pooling module for semantic segmentation
    Lian, Xuhang
    Pang, Yanwei
    Han, Jungong
    Pan, Jing
    PATTERN RECOGNITION, 2021, 110
  • [6] Semantic Segmentation Based on Spatial Pyramid Pooling and Multilayer Feature Fusion
    Ji, Jian
    Li, Sitong
    Liao, Xianfu
    Zhang, Fangrong
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (03) : 1524 - 1535
  • [7] Pooling Attention-based Encoder-Decoder Network for semantic segmentation
    Xu, Haixia
    Huang, Yunjia
    Hancock, Edwin R.
    Wang, Shuailong
    Xuan, Qijun
    Zhou, Wei
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 93
  • [8] PPEDNet: Pyramid Pooling Encoder-Decoder Network for Real-Time Semantic Segmentation
    Tan, Zhentao
    Liu, Bin
    Yu, Nenghai
    IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 328 - 339
  • [9] Semantic segmentation with hybrid pyramid pooling and stacked pyramid structure
    Lian, Xuhang
    Pang, Yanwei
    Han, Jungong
    Pan, Jing
    NEUROCOMPUTING, 2020, 410 : 454 - 467
  • [10] Analysis of Spatial Pyramid Pooling Variations in Semantic Segmentation for Satellite Image Applications
    Abdani, Siti Raihanah
    Zulkifley, Mohd Asyraf
    Zulkifley, Nuraisyah Hani
    2021 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATION (DASA), 2021,