Semantic Segmentation of Aerial Imagery Using U-Net with Self-Attention and Separable Convolutions

被引:1
|
作者
Khan, Bakht Alam [1 ]
Jung, Jin-Woo [1 ]
机构
[1] Dongguk Univ, Dept Comp Sci & Engn, Seoul 04620, South Korea
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 09期
关键词
semantic segmentation; U-Net; self-attention; separable convolutions; aerial imagery; remote sensing; RESOLUTION; SATELLITE; NETWORK;
D O I
10.3390/app14093712
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This research addresses the crucial task of improving accuracy in the semantic segmentation of aerial imagery, essential for applications such as urban planning and environmental monitoring. This study emphasizes the significance of maintaining the Intersection over Union (IOU) score as a metric and employs data augmentation with the Patchify library, using a patch size of 256, to effectively augment the dataset, which is subsequently split into training and testing sets. The core of this investigation lies in a novel architecture that combines a U-Net framework with self-attention mechanisms and separable convolutions. The introduction of self-attention mechanisms enhances the model's understanding of image context, while separable convolutions expedite the training process, contributing to overall efficiency. The proposed model demonstrates a substantial accuracy improvement, surpassing the previous state-of-the-art Dense Plus U-Net, achieving an accuracy of 91% compared to the former's 86%. Visual representations, including original patch images, original masked patches, and predicted patch masks, showcase the model's proficiency in semantic segmentation, marking a significant advancement in aerial image analysis and underscoring the importance of innovative architectural elements for enhanced accuracy and efficiency in such tasks.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Automated seismic semantic segmentation using attention U-Net
    Alsalmi, Haifa
    Elsheikh, Ahmed H.
    GEOPHYSICS, 2024, 89 (01) : WA247 - WA263
  • [2] Modernized Training of U-Net for Aerial Semantic Segmentation
    Straka, Jakub
    Gruber, Ivan
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 785 - 793
  • [3] Self-Attention in Reconstruction Bias U-Net for Semantic Segmentation of Building Rooftops in Optical Remote Sensing Images
    Chen, Ziyi
    Li, Dilong
    Fan, Wentao
    Guan, Haiyan
    Wang, Cheng
    Li, Jonathan
    REMOTE SENSING, 2021, 13 (13)
  • [4] Semantic Segmentation of Tumors in Kidneys using Attention U-Net Models
    Geethanjali, T. M.
    Minavathi
    Dinesh, M. S.
    2021 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER TECHNOLOGIES AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2021, : 286 - 290
  • [5] SAU-Net: Medical Image Segmentation Method Based on U-Net and Self-Attention
    Zhang S.-J.
    Peng Z.
    Li H.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (10): : 2433 - 2442
  • [6] Bilateral U-Net semantic segmentation with spatial attention mechanism
    Zhao Guangzhe
    Zhang Yimeng
    Maoning Ge
    Yu Min
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (02) : 297 - 307
  • [7] Attention-augmented U-Net (AA-U-Net) for semantic segmentation
    Kumar T. Rajamani
    Priya Rani
    Hanna Siebert
    Rajkumar ElagiriRamalingam
    Mattias P. Heinrich
    Signal, Image and Video Processing, 2023, 17 : 981 - 989
  • [8] Attention-augmented U-Net (AA-U-Net) for semantic segmentation
    Rajamani, Kumar T.
    Rani, Priya
    Siebert, Hanna
    ElagiriRamalingam, Rajkumar
    Heinrich, Mattias P.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 981 - 989
  • [9] CHARACTERIZING SPEECH ADVERSARIAL EXAMPLES USING SELF-ATTENTION U-NET ENHANCEMENT
    Yang, Chao-Han
    Qi, Jun
    Chen, Pin-Yu
    Ma, Xiaoli
    Lee, Chin-Hui
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3107 - 3111
  • [10] Improved organs at risk segmentation based on modified U-Net with self-attention and consistency regularisation
    Manko, Maksym
    Popov, Anton
    Gorriz, Juan Manuel
    Ramirez, Javier
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (04) : 850 - 865