A Swin Transformer, YOLO, and Weighted Boxes Fusion-Based Approach for Tree Detection in Satellite Images

被引:3
|
作者
Durgut, Ozan [1 ]
Unsalan, Cem [2 ]
机构
[1] Marmara Univ, Elekt Elekt Muhendisligi, Istanbul, Turkiye
[2] Yeditepe Univ, Elekt Elekt Muhendisligi, Istanbul, Turkiye
关键词
Swin Transformers; YOLO; Weighted Box Fusion; Tree Detection; BENEFITS; FORESTS;
D O I
10.1109/SIU61531.2024.10601134
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Forests are terrestrial ecosystems that provide ecological balance and environmental benefits. Due to uncontrolled logging, increasing population, and commercial use, forest reserves are rapidly decreasing worldwide. This situation carries risks that will lead to climate crisis and biodiversity loss. For this reason, trees need to be protected and kept under control. Since human-powered control efforts are insufficient, the use of satellite images and artificial intelligence-based systems has come to the fore to prevent forest destruction. Nowadays, there are studies on tree detection applications from satellite images using different deep learning models. This article aims to combine Swin transformers and YOLO models with a post-processing technique known as weighted boxes fusion to enhance object detection capabilities. The object detection approaches of the two models are different. When the wide variation detection ability of the Swin transformers method is combined with the low variation but sharp detection ability of the YOLO model, the results combine the advantages of both models in tree detection. In this way, the proposed method detected trees with %3.7 higher precision than Swin transformers and %13.1 higher precision than YOLO.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Multi-model tree detection in satellite images with weighted boxes fusion
    Durgut, Ozan
    Unsalan, Cem
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [2] Underwater Target Detection Algorithm Based on YOLO and Swin Transformer for Sonar Images
    Chen, Ruoyu
    Zhan, Shuyue
    Chen, Ying
    2022 OCEANS HAMPTON ROADS, 2022,
  • [3] HYPERSPECTRAL AND MULTISPECTRAL IMAGES FUSION BASED ON PYRAMID SWIN TRANSFORMER
    Lang, Han
    Bao, Wenxing
    Feng, Wei
    Sun, Shasha
    Ma, Xuan
    Zhang, Xiaowu
    IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 3125 - 3128
  • [4] Hyperspectral and multispectral images fusion based on pyramid swin transformer
    Lang, Han
    Bao, Wenxing
    Feng, Wei
    Qu, Kewen
    Ma, Xuan
    Zhang, Xiaowu
    INFRARED PHYSICS & TECHNOLOGY, 2024, 143
  • [5] Combining Swin Transformer and Attention-Weighted Fusion for Scene Text Detection
    Xianguo Li
    Xingchen Yao
    Yi Liu
    Neural Processing Letters, 56
  • [6] Combining Swin Transformer and Attention-Weighted Fusion for Scene Text Detection
    Li, Xianguo
    Yao, Xingchen
    Liu, Yi
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [7] Multimodal Fusion-based Swin Transformer for Facial Recognition Micro-Expression Recognition
    Zhao, Xinhua
    Lv, Yongjia
    Huang, Zheng
    PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 780 - 785
  • [8] Ship Detection in SAR Images Based on Feature Enhancement Swin Transformer and Adjacent Feature Fusion
    Li, Kuoyang
    Zhang, Min
    Xu, Maiping
    Tang, Rui
    Wang, Liang
    Wang, Hai
    REMOTE SENSING, 2022, 14 (13)
  • [9] Weighted Fusion-Based Representation Classifiers for Marine Floating Raft Detection of SAR Images
    Geng, Jie
    Fan, Jianchao
    Wang, Hongyu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (03) : 444 - 448
  • [10] OFGST-Swin: Swin Transformer Utilizing Overlap Fusion-Based Generalized S-Transform for Respiratory Cycle Classification
    Wang, Fan
    Yuan, Xiaochen
    Bao, Junqi
    Lam, Chan-Tong
    Huang, Guoheng
    Chen, Hai
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73