Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation

被引:0
|
作者
Liu, Weitao [1 ]
Wu, Junjun [1 ]
机构
[1] Foshan Univ, Guangdong Prov Key Lab Ind Intelligent Inspection, Foshan, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Semantic segmentation; Channel2DTransformer; Self-attention; Deep learning;
D O I
10.1007/s44196-024-00630-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is a crucial technology for intelligent vehicles, enabling scene understanding in complex driving environments. However, complex real-world scenarios often contain diverse multi-scale objects, which bring challenges to the accurate semantic segmentation. To address this challenge, we propose a multi-level features self-attention fusion module called Channel2DTransformer. The module utilizes self-attention mechanisms to dynamically fuse multi-level features by computing self-attention weights between their channels, resulting in a consistent and comprehensive representation of scene features. We perform the module on the Cityscapes and NYUDepthV2 datasets, which contain a large number of multi-scale objects. The experimental results validate the positive contributions of the module in enhancing the semantic segmentation accuracy of multi-scale objects and improving the performance of semantic segmentation in complex scenes.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Sequence recommendation using multi-level self-attention network with gated spiking neural P systems
    Bai, Xinzhu
    Huang, Yanping
    Peng, Hong
    Wang, Jun
    Yang, Qian
    Orellana-Martin, David
    Ramirez-de-Arellano, Antonio
    Perez-Jimenez, Mario J.
    INFORMATION SCIENCES, 2024, 656
  • [42] Semantic segmentation-assisted instance feature fusion for multi-level 3D part instance segmentation
    Sun, Chun-Yu
    Tong, Xin
    Liu, Yang
    COMPUTATIONAL VISUAL MEDIA, 2023, 9 (04) : 699 - 715
  • [43] Semantic segmentation-assisted instance feature fusion for multi-level 3D part instance segmentation
    Chun-Yu Sun
    Xin Tong
    Yang Liu
    Computational Visual Media, 2023, 9 : 699 - 715
  • [44] Multi-scale Self-attention Based Semi-supervised Remote Sensing Image Semantic Segmentation
    Sun, Deyan
    Liu, Hai
    Chen, Wei
    Zhu, Pengcheng
    Chen, Dufeng
    Liu, Jueting
    Wang, Jiaqi
    Wu, Yuliang
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 430 - 439
  • [45] Weakly supervised semantic segmentation for point cloud based on view-based adversarial training and self-attention fusion
    Miao, Yongwei
    Ren, Guoxiang
    Wang, Jinrong
    Liu, Fuchang
    COMPUTERS & GRAPHICS-UK, 2023, 116 : 46 - 54
  • [46] Small-scale Image Semantic Segmentation Method Based on Multi-level Superposition and Enhancement Fusion
    Su, Xiaodong
    Liang, Hongyu
    Yao, Guilin
    Li, Hui
    Li, Shizhou
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1502 - 1507
  • [47] SEMANTICS-GUIDED MULTI-LEVEL RGB-D FEATURE FUSION FOR INDOOR SEMANTIC SEGMENTATION
    Li, Yabei
    Zhang, Junge
    Cheng, Yanhua
    Huang, Kaiqi
    Tan, Tieniu
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1262 - 1266
  • [48] Multi-level features fusion via cross-layer guided attention for hyperspectral pansharpening
    Hou, Shaoxiong
    Xiao, Song
    Dong, Wenqian
    Qu, Jiahui
    NEUROCOMPUTING, 2022, 506 : 380 - 392
  • [49] Multi-level multi-type self-generated knowledge fusion for cardiac ultrasound segmentation
    Yu, Chengjin
    Li, Shuang
    Ghista, Dhanjoo
    Gao, Zhifan
    Zhang, Heye
    Del Ser, Javier
    Xu, Lin
    INFORMATION FUSION, 2023, 92 : 1 - 12
  • [50] ISSMF: Integrated semantic and spatial information of multi-level features for automatic segmentation in prenatal ultrasound images
    Sun, Yihao
    Yang, Hongjian
    Zhou, Jiliu
    Wang, Yan
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 125