Multi-scale Global Reasoning Unit for Semantic Segmentation

被引：0

作者：

Domae, Yukihiro ^{[1
]}

Aizawa, Hiroaki ^{[1
]}

Kato, Kunihito ^{[1
]}

机构：

[1] Gifu Univ, 1-1 Yanagido, Gifu 5011193, Japan

来源：

FRONTIERS OF COMPUTER VISION, IW-FCV 2021 | 2021年 / 1405卷

关键词：

Semantic segmentation; Graph convolution; Global reasoning;

D O I：

10.1007/978-3-030-81638-4_4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Obtaining context information in a scene is an essential ability for semantic segmentation. GloRe [1] learns to infer the context from a graph-based feature constructed by the GlobalReasoning unit. The graph nodes are features that are segmented into regions in image space, and the edges are relationships between nodes. Therefore, a failure to construct the graph results in poor performance. In this study, to resolve this problem, we propose a novel unit to construct the graph using multi-scale information. We call it Multi-scale Global Reasoning Unit. It considers the relationship between each region that retains detailed multi-scale spatial information. Specifically, the proposed unit consists of a Feature Aggregation Module and a Global Reasoning Module. The former selects the features required to construct the graph using the multi-scale features. The latter uses GloRe to infer the relationship from the features. The unit is trained in an end-to-end manner. In experiments, we evaluate the effectiveness of the proposed method on Cityscapes and Pascal-context datasets. As a result, we confirmed that the proposed method outperforms the original GloRe.

引用

页码：46 / 56

页数：11

共 50 条

[21] Multi-Scale Recursive Context Aggregation Network for Semantic Segmentation
Yalcin, Abdullah
Keskinoz, Mehmet
32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
[22] Multi-scale sequential network for semantic text segmentation and localization
Villamizar, Michael
Canevet, Olivier
Odobez, Jean-Marc
PATTERN RECOGNITION LETTERS, 2020, 129 : 63 - 69
[23] DNS: A multi-scale deconvolution semantic segmentation network for joint detection and segmentation
Feng, Ning
Dong, Le
Zhang, Qianni
Zhang, Ning
Wu, Xi
Chen, Jianwen
2018 INTERNATIONAL JOINT CONFERENCE ON METALLURGICAL AND MATERIALS ENGINEERING (JCMME 2018), 2019, 277
[24] Multi-Scale Semantic Segmentation for Fire Smoke Image Based on Global Information and U-Net
Zheng, Yuanpan
Wang, Zhenyu
Xu, Boyang
Niu, Yiqing
ELECTRONICS, 2022, 11 (17)
[25] Mix-layers semantic extraction and multi-scale aggregation transformer for semantic segmentation
Li, Tianping
Yang, Xiaolong
Zhang, Zhenyi
Cui, Zhaotong
Maoxia, Zhou
COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
[26] Efficient Parallel Multi-Scale Detail and Semantic Encoding Network for Lightweight Semantic Segmentation
Liu, Xiao
Shi, Xiuya
Chen, Lufei
Qing, Linbo
Ren, Chao
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2544 - 2552
[27] Multi-scale Fusion and Global Semantic Encoding for Affordance Detection
Zhang, Yang
Li, Huiyong
Ren, Tao
Dou, Yuanbo
Li, Qingfeng
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[28] Multi-scale fusion for RGB-D indoor semantic segmentation
Jiang, Shiyi
Xu, Yang
Li, Danyang
Fan, Runze
SCIENTIFIC REPORTS, 2022, 12 (01):
[29] Adaptive multi-scale feature fusion with spatial translation for semantic segmentation
Wang, Hongru
Wang, Haoyu
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 8337 - 8348
[30] Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Gu, Jiaqi
Kwon, Hyoukjun
Wang, Dilin
Ye, Wei
Li, Meng
Chen, Yu-Hsin
Lai, Liangzhen
Chandra, Vikas
Pan, David Z.
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12084 - 12093

← 1 2 3 4 5 →