A Decoupled YOLOv5 with Deformable Convolution and Multi-scale Attention

被引：2

作者：

Yuan, Gui ^{[1
]}

Liu, Gang ^{[1
]}

Chen, Jian ^{[1
]}

机构：

[1] Hubei Univ Technol, Sch Comp, Wuhan 430068, Peoples R China

来源：

KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I | 2022年 / 13368卷

基金：

中国国家自然科学基金;

关键词：

Object detection; Decoupled head; Deformable convolution; Multi-scale attention; YOLOv5;

D O I：

10.1007/978-3-031-10983-6_1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

YOLO series are very classic detection frameworks in the field of object detection, and they have achieved remarkable results on general datasets. Among them, YOLOv5, as a single-stage multi-scale detector, has great advantages in accuracy and speed, but it still has the problem of inaccuracy localization when detecting the objects. In order to solve this problem, we propose three methods to improve YOLOv5. First, due to the conflict between classification and regression tasks, the classification and the localization in the detection head in our method are decoupled. Secondly, because the feature fusion method used by YOLOv5 can cause the problem of feature alignment, we added the deformable convolution to automatically align the features of different scales. Finally, we added the proposed multi-scale attention mechanism to the features of adjacent scales to predict a relative weighting between adjacent scales. Experiments show that our method on the PASCAL VOC dataset can obtain a mAP0.5 of 85.11% and a mAP0.5:0.95 of 63.33%.

引用

页码：3 / 14

页数：12

共 50 条

[41] Multi-Scale Coordinate Attention Pyramid Convolution for Facial Expression Recognition
Ni, Jinyuan
Zhang, Jianxun
Computer Engineering and Applications, 2023, 59 (22) : 242 - 250
[42] BS-YOLOV5S: INSULATOR DEFECT DETECTION WITH ATTENTION MECHANISM AND MULTI-SCALE FUSION
Zhang, Zengbin
Lv, Guohua
Zhao, Guixin
Zhai, Yi
Cheng, Jinyong
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2365 - 2369
[43] Multi-Head-Self-Attention based YOLOv5X-transformer for multi-scale object detection
Vasanthi, Ponduri
Mohan, Laavanya
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36491 - 36517
[44] Multi-Head-Self-Attention based YOLOv5X-transformer for multi-scale object detection
Ponduri Vasanthi
Laavanya Mohan
Multimedia Tools and Applications, 2024, 83 : 36491 - 36517
[45] Defect detection in automotive glass based on modified YOLOv5 with multi-scale feature fusion and dual lightweight strategy
Chen, Zhe
Huang, Shihao
Lv, Hui
Luo, Zhixue
Liu, Jinhao
VISUAL COMPUTER, 2024, 40 (11): : 8099 - 8112
[46] Multi-Scale Kolmogorov-Arnold Network (KAN)-Based Linear Attention Network: Multi-Scale Feature Fusion with KAN and Deformable Convolution for Urban Scene Image Semantic Segmentation
Li, Yuanhang
Liu, Shuo
Wu, Jie
Sun, Weichao
Wen, Qingke
Wu, Yibiao
Qin, Xiujuan
Qiao, Yanyou
REMOTE SENSING, 2025, 17 (05)
[47] An improved YOLOv5 method for large objects detection with multi-scale feature cross-layer fusion network
Qu, Zhong
Gao, Le-yuan
Wang, Sheng-ye
Yin, Hao-nan
Yi, Tu-ming
IMAGE AND VISION COMPUTING, 2022, 125
[48] MCDCNet: Multi-scale constrained deformable convolution network for apple leaf disease detection
Liu, Bin
Huang, Xulei
Sun, Leiming
Wei, Xing
Ji, Zeyu
Zhang, Haixi
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 222
[49] Crack detection based on attention mechanism with YOLOv5
Lan, Min-Li
Yang, Dan
Zhou, Shuang-Xi
Ding, Yang
ENGINEERING REPORTS, 2025, 7 (01)
[50] Driver Attention Detection Based on Improved YOLOv5
Wang, Zhongzhou
Yao, Keming
Guo, Fuao
APPLIED SCIENCES-BASEL, 2023, 13 (11):

← 1 2 3 4 5 →