Multi-scale Global Reasoning Unit for Semantic Segmentation

被引:0
|
作者
Domae, Yukihiro [1 ]
Aizawa, Hiroaki [1 ]
Kato, Kunihito [1 ]
机构
[1] Gifu Univ, 1-1 Yanagido, Gifu 5011193, Japan
来源
FRONTIERS OF COMPUTER VISION, IW-FCV 2021 | 2021年 / 1405卷
关键词
Semantic segmentation; Graph convolution; Global reasoning;
D O I
10.1007/978-3-030-81638-4_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Obtaining context information in a scene is an essential ability for semantic segmentation. GloRe [1] learns to infer the context from a graph-based feature constructed by the GlobalReasoning unit. The graph nodes are features that are segmented into regions in image space, and the edges are relationships between nodes. Therefore, a failure to construct the graph results in poor performance. In this study, to resolve this problem, we propose a novel unit to construct the graph using multi-scale information. We call it Multi-scale Global Reasoning Unit. It considers the relationship between each region that retains detailed multi-scale spatial information. Specifically, the proposed unit consists of a Feature Aggregation Module and a Global Reasoning Module. The former selects the features required to construct the graph using the multi-scale features. The latter uses GloRe to infer the relationship from the features. The unit is trained in an end-to-end manner. In experiments, we evaluate the effectiveness of the proposed method on Cityscapes and Pascal-context datasets. As a result, we confirmed that the proposed method outperforms the original GloRe.
引用
收藏
页码:46 / 56
页数:11
相关论文
共 50 条
  • [21] Multi-Scale Recursive Context Aggregation Network for Semantic Segmentation
    Yalcin, Abdullah
    Keskinoz, Mehmet
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [22] Multi-scale sequential network for semantic text segmentation and localization
    Villamizar, Michael
    Canevet, Olivier
    Odobez, Jean-Marc
    PATTERN RECOGNITION LETTERS, 2020, 129 : 63 - 69
  • [23] DNS: A multi-scale deconvolution semantic segmentation network for joint detection and segmentation
    Feng, Ning
    Dong, Le
    Zhang, Qianni
    Zhang, Ning
    Wu, Xi
    Chen, Jianwen
    2018 INTERNATIONAL JOINT CONFERENCE ON METALLURGICAL AND MATERIALS ENGINEERING (JCMME 2018), 2019, 277
  • [24] Multi-Scale Semantic Segmentation for Fire Smoke Image Based on Global Information and U-Net
    Zheng, Yuanpan
    Wang, Zhenyu
    Xu, Boyang
    Niu, Yiqing
    ELECTRONICS, 2022, 11 (17)
  • [25] Mix-layers semantic extraction and multi-scale aggregation transformer for semantic segmentation
    Li, Tianping
    Yang, Xiaolong
    Zhang, Zhenyi
    Cui, Zhaotong
    Maoxia, Zhou
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
  • [26] Efficient Parallel Multi-Scale Detail and Semantic Encoding Network for Lightweight Semantic Segmentation
    Liu, Xiao
    Shi, Xiuya
    Chen, Lufei
    Qing, Linbo
    Ren, Chao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2544 - 2552
  • [27] Multi-scale Fusion and Global Semantic Encoding for Affordance Detection
    Zhang, Yang
    Li, Huiyong
    Ren, Tao
    Dou, Yuanbo
    Li, Qingfeng
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [28] Multi-scale fusion for RGB-D indoor semantic segmentation
    Jiang, Shiyi
    Xu, Yang
    Li, Danyang
    Fan, Runze
    SCIENTIFIC REPORTS, 2022, 12 (01):
  • [29] Adaptive multi-scale feature fusion with spatial translation for semantic segmentation
    Wang, Hongru
    Wang, Haoyu
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 8337 - 8348
  • [30] Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
    Gu, Jiaqi
    Kwon, Hyoukjun
    Wang, Dilin
    Ye, Wei
    Li, Meng
    Chen, Yu-Hsin
    Lai, Liangzhen
    Chandra, Vikas
    Pan, David Z.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12084 - 12093