MLSA-YOLO: a multi-level feature fusion and scale-adaptive framework for small object detection

被引:0
|
作者
Peng, Jiayu [1 ]
Lv, Kai [2 ]
Wang, Guoliang [2 ]
Xiao, Wendong [2 ]
Ran, Teng [2 ]
Yuan, Liang [2 ]
机构
[1] Xinjiang Univ, Sch Software, Urumqi 830091, Peoples R China
[2] Xinjiang Univ, Sch Mech Engn, Urumqi 830017, Peoples R China
来源
JOURNAL OF SUPERCOMPUTING | 2025年 / 81卷 / 04期
基金
中国国家自然科学基金;
关键词
YOLOv8; Small object detection; Multi-level feature fusion; Scale-adaptive;
D O I
10.1007/s11227-025-06961-0
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the limited target area occupied by small objects, certain feature extraction paradigms that are not well-suited for small objects can further exacerbate the loss of their already limited information. Additionally, inconsistencies between features at different levels in FPN can result in suboptimal feature fusion, hindering the accurate representation of multi-scale features. As a result, even high-performance detectors struggle to recognize small objects effectively. To resolve the above issues, we propose MLSA-YOLO, a small object detection algorithm based on multi-level feature fusion and scale-adaptive. Initially, we restructured the network architecture using SPD-Conv with the proposed Convolutional Space-to-Depth (CSPD) module to improve the network's capacity for capturing local spatial details in images and to ensure that information is preserved during the downsampling process. Furthermore, to address the challenges in feature fusion, we employed a three-layer PAFPN structure at the neck and combined it with the proposed multi-level Feature Fusion and Scale-Adaptive (MLSA) feature pyramid network. This method enhances the complementarity of multi-level information, while effectively filtering the conflicting information generated during the fusion phase. To improve the quality of feature extraction, we incorporated the designed DCN_C2f module into the neck network. This module can accurately capture foreground object features, while enhancing the network's adaptability to geometric deformations of objects. Experimental results show that our approach performs better than other state-of-the-art detection algorithms on the VisDrone2019, DOTA, and FocusTiny datasets. Compared to YOLOv8s, mAP50 improved by 9.5%, 3.4%, and 5.1%, respectively.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Multi-level consistency regularization for domain adaptive object detection
    Tian, Kun
    Zhang, Chenghao
    Wang, Ying
    Xiang, Shiming
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (24): : 18003 - 18018
  • [42] Multi-level consistency regularization for domain adaptive object detection
    Kun Tian
    Chenghao Zhang
    Ying Wang
    Shiming Xiang
    Neural Computing and Applications, 2023, 35 : 18003 - 18018
  • [43] Human Pose Estimation with Multi-Scale and Multi-Level Feature Fusion
    Wang, Yanni
    Hu, Min
    Han, Shipeng
    Chen, Yixuan
    Lyu, Hao
    Computer Engineering and Applications, 2025, 61 (06) : 199 - 209
  • [44] Multi-dimensional, multi-functional and multi-level attention in YOLO for underwater object detection
    Shen, Xin
    Sun, Xudong
    Wang, Huibing
    Fu, Xianping
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 19935 - 19960
  • [45] Image segmentation algorithm based on multi-level feature adaptive fusion
    Yuan X.-P.
    He X.
    Wang X.-Q.
    Hu Y.-M.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (10): : 1958 - 1966
  • [46] Multi-dimensional, multi-functional and multi-level attention in YOLO for underwater object detection
    Xin Shen
    Xudong Sun
    Huibing Wang
    Xianping Fu
    Neural Computing and Applications, 2023, 35 : 19935 - 19960
  • [47] Adaptive Weighted Multi-Level Fusion of Multi-Scale Features: A New Approach to Pedestrian Detection
    Xu, Yao
    Yu, Qin
    FUTURE INTERNET, 2021, 13 (02) : 1 - 16
  • [48] AN ADAPTIVE MULTI-SCALE AND MULTI-LEVEL FEATURES FUSION NETWORK WITH PERCEPTUAL LOSS FOR CHANGE DETECTION
    Xu, Jialang
    Luo, Yang
    Chen, Xinyue
    Luo, Chunbo
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2275 - 2279
  • [49] Adaptive multi-level feature fusion and attention-based network for arbitrary-oriented object detection in remote sensing imagery
    Chen, Luchang
    Liu, Chunsheng
    Chang, Faliang
    Li, Shuang
    Nie, Zhaoying
    NEUROCOMPUTING, 2021, 451 : 67 - 80
  • [50] MSF-YOLO: A multi-scale features fusion-based method for small object detection
    Yang, Fengyu
    Zhou, Jiaqi
    Chen, Yuan
    Liao, Jie
    Yang, Mingxiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (22) : 61239 - 61260