A Decoupled YOLOv5 with Deformable Convolution and Multi-scale Attention

被引:2
|
作者
Yuan, Gui [1 ]
Liu, Gang [1 ]
Chen, Jian [1 ]
机构
[1] Hubei Univ Technol, Sch Comp, Wuhan 430068, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Decoupled head; Deformable convolution; Multi-scale attention; YOLOv5;
D O I
10.1007/978-3-031-10983-6_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
YOLO series are very classic detection frameworks in the field of object detection, and they have achieved remarkable results on general datasets. Among them, YOLOv5, as a single-stage multi-scale detector, has great advantages in accuracy and speed, but it still has the problem of inaccuracy localization when detecting the objects. In order to solve this problem, we propose three methods to improve YOLOv5. First, due to the conflict between classification and regression tasks, the classification and the localization in the detection head in our method are decoupled. Secondly, because the feature fusion method used by YOLOv5 can cause the problem of feature alignment, we added the deformable convolution to automatically align the features of different scales. Finally, we added the proposed multi-scale attention mechanism to the features of adjacent scales to predict a relative weighting between adjacent scales. Experiments show that our method on the PASCAL VOC dataset can obtain a mAP0.5 of 85.11% and a mAP0.5:0.95 of 63.33%.
引用
收藏
页码:3 / 14
页数:12
相关论文
共 50 条
  • [11] MSA-YOLOv5: Multi-scale attention-based YOLOv5 for automatic detection of acute ischemic stroke from multi-modality MRI images
    Chen, Shannan
    Duan, Jinfeng
    Zhang, Nan
    Qi, Miao
    Li, Jinze
    Wang, Hong
    Wang, Rongqiang
    Ju, Ronghui
    Duan, Yang
    Qi, Shouliang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 165
  • [12] FE-YOLOv5: Improved YOLOv5 Network for Multi-scale Drone-Captured Scene Detection
    Zhao, Chen
    Yan, Zhe
    Dong, Zhiyan
    Yang, Dingkang
    Zhang, Lihua
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II, 2024, 14448 : 290 - 304
  • [13] A multi-scale cucumber disease detection method in natural scenes based on YOLOv5
    Li, Shufei
    Li, Kaiyu
    Qiao, Yan
    Zhang, Lingxian
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 202
  • [14] Deep image compression based on multi-scale deformable convolution
    Li, Daowen
    Li, Yingming
    Sun, Heming
    Yu, Lu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [15] A Multi-scale Deformable Convolution Network Model for Text Recognition
    Cheng, Lang
    Yan, Junhong
    Chen, Minghui
    Lu, Yuanwen
    Li, Yunhong
    Hu, Lei
    THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [16] MCF-YOLOv5: A Small Target Detection Algorithm Based on Multi-Scale Feature Fusion Improved YOLOv5
    Gao, Song
    Gao, Mingwang
    Wei, Zhihui
    INFORMATION, 2024, 15 (05)
  • [17] Hyperspectral Unmixing With Multi-Scale Convolution Attention Network
    Hu, Sheng
    Li, Huali
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 2531 - 2542
  • [18] Multi-scale ship target detection using SAR images based on improved Yolov5
    Yasir, Muhammad
    Shanwei, Liu
    Mingming, Xu
    Hui, Sheng
    Hossain, Md Sakaouth
    Colak, Arife Tugsan Isiacik
    Wang, Dawei
    Jianhua, Wan
    Dang, Kinh Bac
    FRONTIERS IN MARINE SCIENCE, 2023, 9
  • [19] Improved YOLOv5 network for real-time multi-scale traffic sign detection
    Wang, Junfan
    Chen, Yi
    Dong, Zhekang
    Gao, Mingyu
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (10): : 7853 - 7865
  • [20] Improved YOLOv5 network for real-time multi-scale traffic sign detection
    Junfan Wang
    Yi Chen
    Zhekang Dong
    Mingyu Gao
    Neural Computing and Applications, 2023, 35 : 7853 - 7865