YOLO-MMS for aerial object detection model based on hybrid feature extractor and improved multi-scale prediction

被引:1
|
作者
Junos, Mohamad Haniff [1 ]
Khairuddin, Anis Salwa Mohd [2 ]
机构
[1] Univ Sains Malaysia, Sch Aerosp Engn, Engn Campus, Nibong Tebal 14300, Penang, Malaysia
[2] Univ Malaya, Fac Engn, Dept Elect Engn, Kuala Lumpur 50603, Malaysia
来源
关键词
Lightweight YOLO; MixMBConv; Aerial object detection; Deep learning; NETWORK;
D O I
10.1007/s00371-024-03689-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Object detection in aerial images has become an important research subject due to the widespread use of aerial platforms, including satellites and unmanned aerial vehicles. However, the task is challenging because it involves a complex background, a high number of small objects, and densely distributed objects, leading to poor detection accuracy. Moreover, despite their excellent detection accuracy, existing one-stage object detection methods have complex structures that require huge computational power, generate high parameters, and exhibit slow inference speed, which makes them unsuitable for edge device applications. To address these issues, this paper proposes an accurate and lightweight object detection model named the YOLO-MMS model. The developed model incorporates several improvements, notably the hybrid backbone structure, which integrates a novel Mix-Mobile inverted bottleneck module to optimize efficiency by reducing the number of generated parameters. Additionally, the multi-scale prediction employs small efficient layer aggregation network and spatial pyramid pooling modules to improve feature extraction across multiple scales. Finally, the model includes an additional detection head and utilizes the Swish activation function to enhance detection accuracy. The evaluation results on the VisDrone and VEDAI datasets demonstrate that the proposed YOLO-MMS model achieved superior accuracy compared to other lightweight YOLO-based models. Furthermore, the proposed model showed significant improvements in model size with a reduction of 41.77% compared to its original YOLOv4-tiny model. These findings indicate that the proposed model presents optimal trade-offs in terms of accuracy and efficiency, rendering it exceptionally suitable for real-time applications on embedded systems. Our code is available at: https://github.com/hanifjunos/YOLO-MMS.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] MGFPN: Enhancing multi-scale feature for object detection
    He, Weiming
    Wu, You
    Xiao, Jing
    Cao, Yang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (06) : 11171 - 11181
  • [32] Multi-scale redistribution feature pyramid for object detection
    Qian, Huifang
    Guo, Jiahao
    Zhou, Xuan
    AI COMMUNICATIONS, 2022, 35 (01) : 15 - 30
  • [33] OD-YOLO: Robust Small Object Detection Model in Remote Sensing Image with a Novel Multi-Scale Feature Fusion
    Bu, Yangcheng
    Ye, Hairong
    Tie, Zhixin
    Chen, Yanbing
    Zhang, Dingming
    SENSORS, 2024, 24 (11)
  • [34] Multi-Scale Feature Attention-DEtection TRansformer: Multi-Scale Feature Attention for security check object detection
    Sima, Haifeng
    Chen, Bailiang
    Tang, Chaosheng
    Zhang, Yudong
    Sun, Junding
    IET COMPUTER VISION, 2024, 18 (05) : 613 - 625
  • [35] A Novel Multi-Scale Transformer for Object Detection in Aerial Scenes
    Lu, Guanlin
    He, Xiaohui
    Wang, Qiang
    Shao, Faming
    Wang, Hongwei
    Wang, Jinkang
    DRONES, 2022, 6 (08)
  • [36] Multi-Scale Cross Distillation for Object Detection in Aerial Images
    Wang, Kun
    Wang, Zi
    Li, Zhang
    Teng, Xichao
    Li, Yang
    COMPUTER VISION - ECCV 2024, PT XLIX, 2025, 15107 : 452 - 471
  • [37] ADS-YOLO: A Multi-Scale Feature Extraction Remote Sensing Image Object Detection Algorithm Based on Dilated Residuals
    Li, Jianying
    Chen, Yajun
    Niu, Meiqi
    Cai, Wenhao
    Qiu, Xiaoyang
    IEEE ACCESS, 2025, 13 : 26225 - 26234
  • [38] SSW-YOLO: Enhanced Blood Cell Detection with Improved Feature Extraction and Multi-scale Attention
    Sun, Hai
    Wan, Xiaorong
    Tang, Shouguo
    Li, Yingna
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2025,
  • [39] Fusion of multi-scale attention for aerial images small-target detection model based on PARE-YOLO
    Zhang, Huiying
    Xiao, Pan
    Yao, Feifan
    Zhang, Qinghua
    Gong, Yifei
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [40] PS-YOLO: a small object detector based on efficient convolution and multi-scale feature fusion
    Peng, Shifeng
    Fan, Xin
    Tian, Shengwei
    Yu, Long
    MULTIMEDIA SYSTEMS, 2024, 30 (05)