A Decoupled YOLOv5 with Deformable Convolution and Multi-scale Attention

被引:2
|
作者
Yuan, Gui [1 ]
Liu, Gang [1 ]
Chen, Jian [1 ]
机构
[1] Hubei Univ Technol, Sch Comp, Wuhan 430068, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Decoupled head; Deformable convolution; Multi-scale attention; YOLOv5;
D O I
10.1007/978-3-031-10983-6_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
YOLO series are very classic detection frameworks in the field of object detection, and they have achieved remarkable results on general datasets. Among them, YOLOv5, as a single-stage multi-scale detector, has great advantages in accuracy and speed, but it still has the problem of inaccuracy localization when detecting the objects. In order to solve this problem, we propose three methods to improve YOLOv5. First, due to the conflict between classification and regression tasks, the classification and the localization in the detection head in our method are decoupled. Secondly, because the feature fusion method used by YOLOv5 can cause the problem of feature alignment, we added the deformable convolution to automatically align the features of different scales. Finally, we added the proposed multi-scale attention mechanism to the features of adjacent scales to predict a relative weighting between adjacent scales. Experiments show that our method on the PASCAL VOC dataset can obtain a mAP0.5 of 85.11% and a mAP0.5:0.95 of 63.33%.
引用
收藏
页码:3 / 14
页数:12
相关论文
共 50 条
  • [41] Multi-Scale Coordinate Attention Pyramid Convolution for Facial Expression Recognition
    Ni, Jinyuan
    Zhang, Jianxun
    Computer Engineering and Applications, 2023, 59 (22) : 242 - 250
  • [42] BS-YOLOV5S: INSULATOR DEFECT DETECTION WITH ATTENTION MECHANISM AND MULTI-SCALE FUSION
    Zhang, Zengbin
    Lv, Guohua
    Zhao, Guixin
    Zhai, Yi
    Cheng, Jinyong
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2365 - 2369
  • [43] Multi-Head-Self-Attention based YOLOv5X-transformer for multi-scale object detection
    Vasanthi, Ponduri
    Mohan, Laavanya
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36491 - 36517
  • [44] Multi-Head-Self-Attention based YOLOv5X-transformer for multi-scale object detection
    Ponduri Vasanthi
    Laavanya Mohan
    Multimedia Tools and Applications, 2024, 83 : 36491 - 36517
  • [45] Defect detection in automotive glass based on modified YOLOv5 with multi-scale feature fusion and dual lightweight strategy
    Chen, Zhe
    Huang, Shihao
    Lv, Hui
    Luo, Zhixue
    Liu, Jinhao
    VISUAL COMPUTER, 2024, 40 (11): : 8099 - 8112
  • [46] Multi-Scale Kolmogorov-Arnold Network (KAN)-Based Linear Attention Network: Multi-Scale Feature Fusion with KAN and Deformable Convolution for Urban Scene Image Semantic Segmentation
    Li, Yuanhang
    Liu, Shuo
    Wu, Jie
    Sun, Weichao
    Wen, Qingke
    Wu, Yibiao
    Qin, Xiujuan
    Qiao, Yanyou
    REMOTE SENSING, 2025, 17 (05)
  • [47] An improved YOLOv5 method for large objects detection with multi-scale feature cross-layer fusion network
    Qu, Zhong
    Gao, Le-yuan
    Wang, Sheng-ye
    Yin, Hao-nan
    Yi, Tu-ming
    IMAGE AND VISION COMPUTING, 2022, 125
  • [48] MCDCNet: Multi-scale constrained deformable convolution network for apple leaf disease detection
    Liu, Bin
    Huang, Xulei
    Sun, Leiming
    Wei, Xing
    Ji, Zeyu
    Zhang, Haixi
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 222
  • [49] Crack detection based on attention mechanism with YOLOv5
    Lan, Min-Li
    Yang, Dan
    Zhou, Shuang-Xi
    Ding, Yang
    ENGINEERING REPORTS, 2025, 7 (01)
  • [50] Driver Attention Detection Based on Improved YOLOv5
    Wang, Zhongzhou
    Yao, Keming
    Guo, Fuao
    APPLIED SCIENCES-BASEL, 2023, 13 (11):