A Decoupled YOLOv5 with Deformable Convolution and Multi-scale Attention

被引：2

作者：

Yuan, Gui ^{[1
]}

Liu, Gang ^{[1
]}

Chen, Jian ^{[1
]}

机构：

[1] Hubei Univ Technol, Sch Comp, Wuhan 430068, Peoples R China

来源：

KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I | 2022年 / 13368卷

基金：

中国国家自然科学基金;

关键词：

Object detection; Decoupled head; Deformable convolution; Multi-scale attention; YOLOv5;

D O I：

10.1007/978-3-031-10983-6_1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

YOLO series are very classic detection frameworks in the field of object detection, and they have achieved remarkable results on general datasets. Among them, YOLOv5, as a single-stage multi-scale detector, has great advantages in accuracy and speed, but it still has the problem of inaccuracy localization when detecting the objects. In order to solve this problem, we propose three methods to improve YOLOv5. First, due to the conflict between classification and regression tasks, the classification and the localization in the detection head in our method are decoupled. Secondly, because the feature fusion method used by YOLOv5 can cause the problem of feature alignment, we added the deformable convolution to automatically align the features of different scales. Finally, we added the proposed multi-scale attention mechanism to the features of adjacent scales to predict a relative weighting between adjacent scales. Experiments show that our method on the PASCAL VOC dataset can obtain a mAP0.5 of 85.11% and a mAP0.5:0.95 of 63.33%.

引用

页码：3 / 14

页数：12

共 50 条

[31] Multi-Scale Convolution Attention Neural Network for Gesture Recognition
Ji, Penghui
Cao, Chongli
Zhang, Hang
Li, Qi
PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CRYPTOGRAPHY, NETWORK SECURITY AND COMMUNICATION TECHNOLOGY, CNSCT 2024, 2024, : 421 - 425
[32] Image Inpainting Based Multi-scale Gated Convolution and Attention
Jiang, Hualiang
Ma, Xiaohu
Yang, Dongdong
Zhao, Jiaxin
Shen, Yao
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 407 - 418
[33] Ship Detection Algorithm Based on YOLOv5 Network Improved with Lightweight Convolution and Attention Mechanism
Wang, Langyu
Zhang, Yan
Lin, Yahong
Yan, Shuai
Xu, Yuanyuan
Sun, Bo
ALGORITHMS, 2023, 16 (12)
[34] AMDNet: Adaptive Fall Detection Based on Multi-scale Deformable Convolution Network
Jiang, Minghua
Zhang, Keyi
Ma, Yongkang
Liu, Li
Peng, Tao
Hu, Xinrong
Yu, Feng
ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT III, 2024, 14497 : 3 - 14
[35] A Lightweight YOLOv5 Optimization of Coordinate Attention
Wu, Jun
Dong, Jiaming
Nie, Wanyu
Ye, Zhiwei
APPLIED SCIENCES-BASEL, 2023, 13 (03):
[36] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
Zhou, Zhenghua
Xue, Boxiang
Wang, Hai
Zhao, Jianwei
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27809 - 27830
[37] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
Zhenghua Zhou
Boxiang Xue
Hai Wang
Jianwei Zhao
Multimedia Tools and Applications, 2024, 83 : 27809 - 27830
[38] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
Zhou, Zhenghua
Xue, Boxiang
Wang, Hai
Zhao, Jianwei
Multimedia Tools and Applications, 83 (09): : 27809 - 27830
[39] DSA: Deformable Segmentation Attention for Multi-Scale Fisheye Image Segmentation
Jiang, Junzhe
Xu, Cheng
Liu, Hongzhe
Fu, Ying
Jian, Muwei
ELECTRONICS, 2023, 12 (19)
[40] Point Cloud Completion via Multi-Scale Edge Convolution and Attention
Cao, Rui
Zhang, Kaiyi
Chen, Yang
Yang, Ximing
Jin, Cheng
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6183 - 6192

← 1 2 3 4 5 →