A Decoupled YOLOv5 with Deformable Convolution and Multi-scale Attention

被引:2
|
作者
Yuan, Gui [1 ]
Liu, Gang [1 ]
Chen, Jian [1 ]
机构
[1] Hubei Univ Technol, Sch Comp, Wuhan 430068, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Decoupled head; Deformable convolution; Multi-scale attention; YOLOv5;
D O I
10.1007/978-3-031-10983-6_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
YOLO series are very classic detection frameworks in the field of object detection, and they have achieved remarkable results on general datasets. Among them, YOLOv5, as a single-stage multi-scale detector, has great advantages in accuracy and speed, but it still has the problem of inaccuracy localization when detecting the objects. In order to solve this problem, we propose three methods to improve YOLOv5. First, due to the conflict between classification and regression tasks, the classification and the localization in the detection head in our method are decoupled. Secondly, because the feature fusion method used by YOLOv5 can cause the problem of feature alignment, we added the deformable convolution to automatically align the features of different scales. Finally, we added the proposed multi-scale attention mechanism to the features of adjacent scales to predict a relative weighting between adjacent scales. Experiments show that our method on the PASCAL VOC dataset can obtain a mAP0.5 of 85.11% and a mAP0.5:0.95 of 63.33%.
引用
收藏
页码:3 / 14
页数:12
相关论文
共 50 条
  • [31] Multi-Scale Convolution Attention Neural Network for Gesture Recognition
    Ji, Penghui
    Cao, Chongli
    Zhang, Hang
    Li, Qi
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CRYPTOGRAPHY, NETWORK SECURITY AND COMMUNICATION TECHNOLOGY, CNSCT 2024, 2024, : 421 - 425
  • [32] Image Inpainting Based Multi-scale Gated Convolution and Attention
    Jiang, Hualiang
    Ma, Xiaohu
    Yang, Dongdong
    Zhao, Jiaxin
    Shen, Yao
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 407 - 418
  • [33] Ship Detection Algorithm Based on YOLOv5 Network Improved with Lightweight Convolution and Attention Mechanism
    Wang, Langyu
    Zhang, Yan
    Lin, Yahong
    Yan, Shuai
    Xu, Yuanyuan
    Sun, Bo
    ALGORITHMS, 2023, 16 (12)
  • [34] AMDNet: Adaptive Fall Detection Based on Multi-scale Deformable Convolution Network
    Jiang, Minghua
    Zhang, Keyi
    Ma, Yongkang
    Liu, Li
    Peng, Tao
    Hu, Xinrong
    Yu, Feng
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT III, 2024, 14497 : 3 - 14
  • [35] A Lightweight YOLOv5 Optimization of Coordinate Attention
    Wu, Jun
    Dong, Jiaming
    Nie, Wanyu
    Ye, Zhiwei
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [36] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhou, Zhenghua
    Xue, Boxiang
    Wang, Hai
    Zhao, Jianwei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27809 - 27830
  • [37] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhenghua Zhou
    Boxiang Xue
    Hai Wang
    Jianwei Zhao
    Multimedia Tools and Applications, 2024, 83 : 27809 - 27830
  • [38] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
    Zhou, Zhenghua
    Xue, Boxiang
    Wang, Hai
    Zhao, Jianwei
    Multimedia Tools and Applications, 83 (09): : 27809 - 27830
  • [39] DSA: Deformable Segmentation Attention for Multi-Scale Fisheye Image Segmentation
    Jiang, Junzhe
    Xu, Cheng
    Liu, Hongzhe
    Fu, Ying
    Jian, Muwei
    ELECTRONICS, 2023, 12 (19)
  • [40] Point Cloud Completion via Multi-Scale Edge Convolution and Attention
    Cao, Rui
    Zhang, Kaiyi
    Chen, Yang
    Yang, Ximing
    Jin, Cheng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6183 - 6192