A Decoupled YOLOv5 with Deformable Convolution and Multi-scale Attention

被引:2
|
作者
Yuan, Gui [1 ]
Liu, Gang [1 ]
Chen, Jian [1 ]
机构
[1] Hubei Univ Technol, Sch Comp, Wuhan 430068, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Decoupled head; Deformable convolution; Multi-scale attention; YOLOv5;
D O I
10.1007/978-3-031-10983-6_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
YOLO series are very classic detection frameworks in the field of object detection, and they have achieved remarkable results on general datasets. Among them, YOLOv5, as a single-stage multi-scale detector, has great advantages in accuracy and speed, but it still has the problem of inaccuracy localization when detecting the objects. In order to solve this problem, we propose three methods to improve YOLOv5. First, due to the conflict between classification and regression tasks, the classification and the localization in the detection head in our method are decoupled. Secondly, because the feature fusion method used by YOLOv5 can cause the problem of feature alignment, we added the deformable convolution to automatically align the features of different scales. Finally, we added the proposed multi-scale attention mechanism to the features of adjacent scales to predict a relative weighting between adjacent scales. Experiments show that our method on the PASCAL VOC dataset can obtain a mAP0.5 of 85.11% and a mAP0.5:0.95 of 63.33%.
引用
收藏
页码:3 / 14
页数:12
相关论文
共 50 条
  • [21] Research on Multi-Scale Pest Detection and Identification Method in Granary Based on Improved YOLOv5
    Chu, Jinyu
    Li, Yane
    Feng, Hailin
    Weng, Xiang
    Ruan, Yaoping
    AGRICULTURE-BASEL, 2023, 13 (02):
  • [22] A Multi-Scale Traffic Object Detection Algorithm for Road Scenes Based on Improved YOLOv5
    Li, Ang
    Sun, Shijie
    Zhang, Zhaoyang
    Feng, Mingtao
    Wu, Chengzhong
    Li, Wang
    ELECTRONICS, 2023, 12 (04)
  • [23] DCMS-YOLOv5: A Dual-Channel and Multi-Scale Vertical Expansion Helmet Detection Model Based on YOLOv5
    Liu, Yulu
    Tian, Ying
    ENGINEERING LETTERS, 2023, 31 (01) : 1 - 7
  • [24] YOLOD: A Task Decoupled Network Based on YOLOv5
    Liang, Xingzhu
    Cheng, Wei
    Zhang, Chunjiong
    Wang, Lixin
    Yan, Xinyun
    Chen, Qing
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2023, 69 (04) : 775 - 785
  • [25] Video Frame Interpolation via Multi-scale Expandable Deformable Convolution
    Zhang, Dengyong
    Huang, Pu
    Ding, Xiangling
    Li, Feng
    Yang, Gaobo
    PROCEEDINGS OF THE 2023 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH&MMSEC 2023, 2023, : 19 - 28
  • [26] Autonomous underwater robot for underwater image enhancement via multi-scale deformable convolution network with attention mechanism
    Lin, Yi
    Zhou, Jingchun
    Ren, Wenqi
    Zhang, Weishi
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 191
  • [27] Multi-scale Small Target Detection for Indoor Mobile Rescue Vehicles Based on Improved YOLOv5
    Li, Maoyue
    Yang, Tenghui
    Xu, Shengbo
    Meng, Lingqiang
    Liu, Zhicheng
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2024, 25 (06) : 1431 - 1444
  • [28] ETSR-YOLO: An improved multi-scale traffic sign detection algorithm based on YOLOv5
    Liu, Haibin
    Zhou, Kui
    Zhang, Youbing
    Zhang, Yufeng
    PLOS ONE, 2023, 18 (12):
  • [29] A Kind of Water Surface Multi-Scale Object Detection Method Based on Improved YOLOv5 Network
    Ma, Zhongli
    Wan, Yi
    Liu, Jiajia
    An, Ruojin
    Wu, Lili
    MATHEMATICS, 2023, 11 (13)
  • [30] A Study on Multi-Scale Behavior Recognition of Dairy Cows in Complex Background Based on Improved YOLOv5
    Zong, Zheying
    Ban, Zeyu
    Wang, Chunguang
    Wang, Shuai
    Yuan, Wenbo
    Zhang, Chunhui
    Su, Lide
    Yuan, Ze
    AGRICULTURE-BASEL, 2025, 15 (02):