small object detection;
deep learning;
HRNet;
Mamba;
YOLO;
feature fusion;
NEURAL-NETWORK;
D O I:
10.3390/s24154966
中图分类号:
O65 [分析化学];
学科分类号:
070302 ;
081704 ;
摘要:
(1) Background: Small objects in Unmanned Aerial Vehicle (UAV) images are often scattered throughout various regions of the image, such as the corners, and may be blocked by larger objects, as well as susceptible to image noise. Moreover, due to their small size, these objects occupy a limited area in the image, resulting in a scarcity of effective features for detection. (2) Methods: To address the detection of small objects in UAV imagery, we introduce a novel algorithm called High-Resolution Feature Pyramid Network Mamba-Based YOLO (HRMamba-YOLO). This algorithm leverages the strengths of a High-Resolution Network (HRNet), EfficientVMamba, and YOLOv8, integrating a Double Spatial Pyramid Pooling (Double SPP) module, an Efficient Mamba Module (EMM), and a Fusion Mamba Module (FMM) to enhance feature extraction and capture contextual information. Additionally, a new Multi-Scale Feature Fusion Network, High-Resolution Feature Pyramid Network (HRFPN), and FMM improved feature interactions and enhanced the performance of small object detection. (3) Results: For the VisDroneDET dataset, the proposed algorithm achieved a 4.4% higher Mean Average Precision (mAP) compared to YOLOv8-m. The experimental results showed that HRMamba achieved a mAP of 37.1%, surpassing YOLOv8-m by 3.8% (Dota1.5 dataset). For the UCAS_AOD dataset and the DIOR dataset, our model had a mAP 1.5% and 0.3% higher than the YOLOv8-m model, respectively. To be fair, all the models were trained without a pre-trained model. (4) Conclusions: This study not only highlights the exceptional performance and efficiency of HRMamba-YOLO in small object detection tasks but also provides innovative solutions and valuable insights for future research.
机构:
Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai,201210, China
School of Information Science and Technology, ShanghaiTech University, Shanghai,201210, ChinaShanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai,201210, China
Xie, Chunhui
Wu, Jinming
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai,201210, ChinaShanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai,201210, China
Wu, Jinming
Xu, Huaiyu
论文数: 0引用数: 0
h-index: 0
机构:
School of Information Science and Technology, ShanghaiTech University, Shanghai,201210, ChinaShanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai,201210, China
机构:
Harbin Inst Technol, Sch Astronaut, Ctr Control Theory & Guidance Technol Res, Harbin 150001, Peoples R ChinaHarbin Inst Technol, Sch Astronaut, Ctr Control Theory & Guidance Technol Res, Harbin 150001, Peoples R China
Yang, Yahu
Gao, Xiangzhou
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, Sch Astronaut, Ctr Control Theory & Guidance Technol Res, Harbin 150001, Peoples R ChinaHarbin Inst Technol, Sch Astronaut, Ctr Control Theory & Guidance Technol Res, Harbin 150001, Peoples R China
Gao, Xiangzhou
Wang, Yu
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Technol & Business Univ, Sch Artificial Intelligence, Beijing 100048, Peoples R ChinaHarbin Inst Technol, Sch Astronaut, Ctr Control Theory & Guidance Technol Res, Harbin 150001, Peoples R China
Wang, Yu
Song, Shenmin
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, Sch Astronaut, Ctr Control Theory & Guidance Technol Res, Harbin 150001, Peoples R ChinaHarbin Inst Technol, Sch Astronaut, Ctr Control Theory & Guidance Technol Res, Harbin 150001, Peoples R China
机构:
North China University of Technology
Key Laboratory of Large Structure Health Monitoring and ControlNorth China University of Technology
Yundong LI
Han DONG
论文数: 0引用数: 0
h-index: 0
机构:
North China University of TechnologyNorth China University of Technology
Han DONG
Hongguang LI
论文数: 0引用数: 0
h-index: 0
机构:
Unmanned System Research Institute, Beihang University
Guangdong Provincial Key Laboratory of Computer Vision and Virtual Reality Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of SciencesNorth China University of Technology
Hongguang LI
Xueyan ZHANG
论文数: 0引用数: 0
h-index: 0
机构:
North China University of TechnologyNorth China University of Technology
Xueyan ZHANG
Baochang ZHANG
论文数: 0引用数: 0
h-index: 0
机构:
School of Automation Science and Electrical Engineering, Beihang UniversityNorth China University of Technology
Baochang ZHANG
Zhifeng XIAO
论文数: 0引用数: 0
h-index: 0
机构:
State Key Laboratory of Information Engineering in Surveying, Wuhan UniversityNorth China University of Technology
机构:
North China University of Technology
Key Laboratory of Large Structure Health Monitoring and ControlNorth China University of Technology
Yundong LI
Han DONG
论文数: 0引用数: 0
h-index: 0
机构:
North China University of TechnologyNorth China University of Technology
Han DONG
Hongguang LI
论文数: 0引用数: 0
h-index: 0
机构:
Unmanned System Research Institute, Beihang University
Guangdong Provincial Key Laboratory of Computer Vision and Virtual Reality Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of SciencesNorth China University of Technology
Hongguang LI
Xueyan ZHANG
论文数: 0引用数: 0
h-index: 0
机构:
North China University of TechnologyNorth China University of Technology
Xueyan ZHANG
Baochang ZHANG
论文数: 0引用数: 0
h-index: 0
机构:
School of Automation Science and Electrical Engineering, Beihang UniversityNorth China University of Technology
Baochang ZHANG
Zhifeng XIAO
论文数: 0引用数: 0
h-index: 0
机构:
State Key Laboratory of Information Engineering in Surveying, WuhanNorth China University of Technology