Attention-Guided Multi-Scale Fusion Network for Similar Objects Semantic Segmentation

被引:0
|
作者
Yao, Fengqin [1 ]
Wang, Shengke [1 ]
Ding, Laihui [2 ]
Zhong, Guoqiang [1 ]
Li, Shu [1 ]
Xu, Zhiwei [2 ]
机构
[1] Ocean Univ China, Qingdao 266100, Peoples R China
[2] Shandong Willand Intelligent Technol Co Ltd, Qingdao 266100, Peoples R China
关键词
Semantic segmentation; Attention-guided; Multi-scale fusion; High inter-class similarity;
D O I
10.1007/s12559-023-10206-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image segmentation accuracy is critical in marine ecological detection utilizing unmanned aerial vehicles (UAVs). By flying a drone around, we can swiftly determine the location of a variety of species. However, remote sensing photos, particularly those of inter-class items, are remarkably similar, and there are a significant number of little objects. The universal segmentation network is ineffective. This research constructs attentional networks that imitate the human cognitive system, inspired by camouflaged object detection and the management of human attentional mechanisms in the recognition of diverse things. This research proposes TriseNet, an attention-guided multi-scale fusion semantic segmentation network that solves the challenges of high item similarity and poor segmentation accuracy in UAV settings. To begin, we employ a bidirectional feature extraction network to extract low-level spatial and high-level semantic information. Second, we leverage the attention-induced cross-level fusion module (ACFM) to create a new multi-scale fusion branch that performs cross-level learning and enhances the representation of inter-class comparable objects. Finally, the receptive field block (RFB) module is used to increase the receptive field, resulting in richer characteristics in specific layers. The inter-class similarity increases the difficulty of segmentation accuracy greatly, whereas the three modules improve feature expression and segmentation results. Experiments are conducted using our UAV dataset, UAV-OUC-SEG (55.61% MIoU), and the public dataset, Cityscapes (76.10% MIoU), to demonstrate the efficacy of our strategy. In two datasets, the TriseNet delivers the best results when compared to other prominent segmentation algorithms.
引用
收藏
页码:366 / 376
页数:11
相关论文
共 50 条
  • [21] Multi-Scale Context Aggregation Network with Attention-Guided for Crowd Counting
    Wang, Xin
    Lv, Rongrong
    Zhao, Yang
    Yang, Tangwen
    Ruan, Qiuqi
    PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 240 - 245
  • [22] Attention-Guided Multi-Scale Segmentation Neural Network for Interactive Extraction of Region Objects from High-Resolution Satellite Imagery
    Li, Kun
    Hu, Xiangyun
    Jiang, Huiwei
    Shu, Zhen
    Zhang, Mi
    REMOTE SENSING, 2020, 12 (05)
  • [23] Collaborative Attention Guided Multi-Scale Feature Fusion Network for Medical Image Segmentation
    Xu, Zhenghua
    Tian, Biao
    Liu, Shijie
    Wang, Xiangtao
    Yuan, Di
    Gu, Junhua
    Chen, Junyang
    Lukasiewicz, Thomas
    Leung, Victor C. M.
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 1857 - 1871
  • [24] Attention-Guided Multi-modal and Multi-scale Fusion for Multispectral Pedestrian Detection
    Bao, Wei
    Huang, Meiyu
    Hu, Jingjing
    Xiang, Xueshuang
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 382 - 393
  • [25] Deformable image registration with attention-guided fusion of multi-scale deformation fields
    Zhiquan He
    Yupeng He
    Wenming Cao
    Applied Intelligence, 2023, 53 : 2936 - 2950
  • [26] Adaptive multi-scale dual attention network for semantic segmentation
    Wang, Weizhen
    Wang, Suyu
    Li, Yue
    Jin, Yishu
    NEUROCOMPUTING, 2021, 460 : 39 - 49
  • [27] Deformable image registration with attention-guided fusion of multi-scale deformation fields
    He, Zhiquan
    He, Yupeng
    Cao, Wenming
    APPLIED INTELLIGENCE, 2023, 53 (03) : 2936 - 2950
  • [28] Dual Attention Based Multi-scale Feature Fusion Network for Indoor RGBD Semantic Segmentation
    Hua, Zhongwei
    Qi, Lizhe
    Du, Daming
    Jiang, Wenxuan
    Sun, Yunquan
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3639 - 3644
  • [29] A Multi-Scale Attention Fusion Network for Retinal Vessel Segmentation
    Wang, Shubin
    Chen, Yuanyuan
    Yi, Zhang
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [30] LQCANet: Learnable-Query-Guided Multi-Scale Fusion Network Based on Cross-Attention for Radar Semantic Segmentation
    Zhuang, Long
    Jiang, Tiezhen
    Jiang, Hao
    Wang, Anqi
    Huang, Zhixiang
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (02): : 3330 - 3344