Semantic segmentation of underwater images based on the improved SegFormer

被引:0
|
作者
Chen, Bowei [1 ,2 ]
Zhao, Wei [1 ,2 ]
Zhang, Qiusheng [3 ]
Li, Mingliang [3 ]
Qi, Mingyang [3 ]
Tang, You [3 ,4 ,5 ]
机构
[1] Qingdao Innovat & Dev Base, Harbin, Peoples R China
[2] Harbin Engn Univ, Lab Underwater Intelligence, Qingdao, Peoples R China
[3] Jilin Agr Sci & Technol Univ, Elect & Informat Engn Coll, Jilin, Peoples R China
[4] Jilin Agr Univ, Coll Informat Technol, Changchun, Peoples R China
[5] Yanbian Univ, Coll Agr, Yanji, Peoples R China
关键词
underwater images; semantic segmentation; attention mechanism; feature fusion; SegFormer;
D O I
10.3389/fmars.2025.1522160
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Underwater images segmentation is essential for tasks such as underwater exploration, marine environmental monitoring, and resource development. Nevertheless, given the complexity and variability of the underwater environment, improving model accuracy remains a key challenge in underwater image segmentation tasks. To address these issues, this study presents a high-performance semantic segmentation approach for underwater images based on the standard SegFormer model. First, the Mix Transformer backbone in SegFormer is replaced with a Swin Transformer to enhance feature extraction and facilitate efficient acquisition of global context information. Next, the Efficient Multi-scale Attention (EMA) mechanism is introduced in the backbone's downsampling stages and the decoder to better capture multi-scale features, further improving segmentation accuracy. Furthermore, a Feature Pyramid Network (FPN) structure is incorporated into the decoder to combine feature maps at multiple resolutions, allowing the model to integrate contextual information effectively, enhancing robustness in complex underwater environments. Testing on the SUIM underwater image dataset shows that the proposed model achieves high performance across multiple metrics: mean Intersection over Union (MIoU) of 77.00%, mean Recall (mRecall) of 85.04%, mean Precision (mPrecision) of 89.03%, and mean F1score (mF1score) of 86.63%. Compared to the standard SegFormer, it demonstrates improvements of 3.73% in MIoU, 1.98% in mRecall, 3.38% in mPrecision, and 2.44% in mF1score, with an increase of 9.89M parameters. The results demonstrate that the proposed method achieves superior segmentation accuracy with minimal additional computation, showcasing high performance in underwater image segmentation.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Improved SegFormer Network Based Method for Semantic Segmentation of Remote Sensing Images
    Tian, Xuewei
    Wang, Jiali
    Chen, Ming
    Du, Shouqing
    Computer Engineering and Applications, 2023, 59 (08): : 217 - 226
  • [2] Semantic Segmentation of Underwater Images Based on Improved Deeplab
    Liu, Fangfang
    Fang, Ming
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2020, 8 (03)
  • [3] Efficient Semantic Segmentation of Nuclei in Histopathology Images Using Segformer
    Khaled, Marwan
    Hammouda, Mostafa A.
    Ali, Hesham
    Elattar, Mustafa
    Selim, Sahar
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2023, 2024, 14122 : 81 - 95
  • [4] Semantic segmentation of microbial alterations based on SegFormer
    Elmessery, Wael M.
    Maklakov, Danil V.
    El-Messery, Tamer M.
    Baranenko, Denis A.
    Gutierrez, Joaquin
    Shams, Mahmoud Y.
    El-Hafeez, Tarek Abd
    Elsayed, Salah
    Alhag, Sadeq K.
    Moghanm, Farahat S.
    Mulyukin, Maksim A.
    Petrova, Yuliya Yu.
    Elwakeel, Abdallah E.
    FRONTIERS IN PLANT SCIENCE, 2024, 15
  • [5] Improved SegFormer Remote Sensing Image Semantic Segmentation Network
    Zhang, Hao
    He, Lingmin
    Pan, Chen
    Computer Engineering and Applications, 2023, 59 (24) : 248 - 258
  • [6] Improving Semantic Segmentation Performance in Underwater Images
    Nunes, Alexandra
    Matos, Anibal
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (12)
  • [7] Solar cell defect segmentation model based on improved SegFormer
    Luo, Wei
    Yan, Zuotao
    Guan, Jiahao
    Han, Jian
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (12): : 2459 - 2468
  • [8] DYNAMICALLY PRUNING SEGFORMER FOR EFFICIENT SEMANTIC SEGMENTATION
    Bai, Haoli
    Mao, Hongda
    Nair, Dinesh
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3298 - 3302
  • [9] Semantic Segmentation of Cucumber Leaf Disease Spots Based on ECA-SegFormer
    Yang, Ruotong
    Guo, Yaojiang
    Hu, Zhiwei
    Gao, Ruibo
    Yang, Hua
    AGRICULTURE-BASEL, 2023, 13 (08):
  • [10] Semantic segmentation method of underwater images based on encoder-decoder architecture
    Wang, Jinkang
    He, Xiaohui
    Shao, Faming
    Lu, Guanlin
    Hu, Ruizhe
    Jiang, Qunyan
    PLOS ONE, 2022, 17 (08):