Semantic segmentation using stride spatial pyramid pooling and dual attention decoder

被引:62
|
作者
Peng, Chengli [1 ]
Ma, Jiayi [1 ]
机构
[1] Wuhan Univ, Elect Informat Sch, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic segmentation; Convolutional neural networks; Pyramid pooling; Attention mechanism; NETWORKS; FORCE;
D O I
10.1016/j.patcog.2020.107498
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is an end-to-end task that requires both semantic and spatial accuracy. It is important for deep learning-based segmentation methods to effectively utilize the high-level feature map whose semantic information is abundant and the low-level feature map whose spatial information is accurate. However, existing segmentation networks typically cannot take full advantage of these two kinds of feature maps, leading to inferior performance. This paper attempts to overcome this challenge by introducing two novel structures. On the one hand, we propose a structure called stride spatial pyramid pooling (SSPP) to capture multiscale semantic information from the high-level feature map. Compared with existing pyramid pooling methods based on the atrous convolution, the SSPP structure is able to gather more information from the high-level feature map with faster inference speed, which improves the utilization efficiency of the high-level feature map significantly. On the other hand, we propose a dual attention decoder consisting of a channel attention branch and a spatial attention branch to make full use of the high- and low-level feature maps simultaneously. The dual attention decoder can result in a more "semantic" low-level feature map and a high-level feature map with more accurate spatial information, which bridges the gap between these two kinds of feature maps and benefits their fusion. We evaluate the proposed model on several publicly available semantic image segmentation benchmarks including PASCAL VOC 2012, Cityscapes and COCO-Stuff. The qualitative and quantitative results demonstrate that our method can achieve the state-of-the-art performance. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Utilizing the Clique Atrous Spatial Pyramid Pooling for Pancreas Segmentation
    Yang, M.
    Qi, X.
    Tan, S.
    MEDICAL PHYSICS, 2019, 46 (06) : E448 - E448
  • [32] PCANet: Pyramid convolutional attention network for semantic segmentation
    Sang, Haiwei
    Zhou, Qiuhao
    Zhao, Yong
    IMAGE AND VISION COMPUTING, 2020, 103
  • [33] An Efficient Semantic Segmentation Method using Pyramid ShuffleNet V2 with Vortex Pooling
    Dong, Jiansheng
    Yuan, Jingling
    Li, Lin
    Zhong, Xian
    Liu, Weiru
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1214 - 1220
  • [34] BMSeNet: Multiscale Context Pyramid Pooling and Spatial Detail Enhancement Network for Real-Time Semantic Segmentation
    Zhao, Shan
    Zhao, Xin
    Huo, Zhanqiang
    Zhang, Fukai
    SENSORS, 2024, 24 (16)
  • [35] Real-time semantic segmentation network with an enhanced backbone based on Atrous spatial pyramid pooling module
    Song, Xingguo
    Fang, Xiaojie
    Meng, Xiangyin
    Fang, Xu
    Lv, Maoting
    Zhuo, Yue
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [36] Enhanced global attention upsample decoder based on enhanced spatial attention and feature aggregation module for semantic segmentation
    Yin, Lianglu
    Hu, Haifeng
    ELECTRONICS LETTERS, 2020, 56 (13) : 659 - 661
  • [37] Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation
    Peng, Chengli
    Tian, Tian
    Chen, Chen
    Guo, Xiaojie
    Ma, Jiayi
    NEURAL NETWORKS, 2021, 137 : 188 - 199
  • [38] Stripe Pooling Attention for Real-Time Semantic Segmentation
    Lyu J.
    Sun Y.
    Xu P.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (09): : 1395 - 1404
  • [39] Lidar Point Semantic Segmentation Using Dual Attention Mechanism
    Wang, Haosen
    Zhou, Yuan
    Chen, Tiankai
    Qian, Feng
    Ma, Yue
    Wang, Shifeng
    Lu, Bo
    JOURNAL OF RUSSIAN LASER RESEARCH, 2023, 44 (02) : 224 - 234
  • [40] Spatial Pyramid Based Graph Reasoning for Semantic Segmentation
    Li, Xia
    Yang, Yibo
    Zhao, Qijie
    Shen, Tiancheng
    Lin, Zhouchen
    Liu, Hong
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 8947 - 8956