GDALR: Global Dual Attention and Local Representations in transformer for surface defect detection

被引：9

作者：

Zhou, Xin ^{[1
]}

Zhou, Shihua ^{[1
]}

Zhang, Yongchao ^{[1
]}

Ren, Zhaohui ^{[1
]}

Jiang, Zeyu ^{[1
]}

Luo, Hengfa ^{[1
]}

机构：

[1] Northeastern Univ, Sch Mech Engn & Automat, Wenhua Rd, Shenyang 110819, Liaoning, Peoples R China

来源：

MEASUREMENT | 2024年 / 229卷

关键词：

Surface defect detection; Semantic segmentation; Vision transformer; Dual-attention; Local transformer;

D O I：

10.1016/j.measurement.2024.114398

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Automated surface detection has gradually emerged as a promising and crucial inspection method in the industrial sector, greatly enhancing production quality and efficiency. However, current semantic network models based on Vision Transformers are primarily trained on natural images, which exhibit complex object textures and backgrounds. Additionally, pure Vision Transformers lack the ability to capture local representations, making it challenging to directly apply existing semantic segmentation models to industrial production scenarios. In this paper, we propose a novel transformer segmentation model specifically designed for surface defect detection in industrial settings. Firstly, we employ a Dual -Attention Transformer (DAT) as the backbone of our model. This backbone replaces the generic 2D convolution block with a new self -attention block in the Spatial Reduction Attention module (SRA), enabling the establishment of a global view for each layer. Secondly, we enhance the collection of local information during decoding by initializing the relative position between query and key pixels. Finally, to strengthen the salient defect structure, we utilize Pixel Shuffle to rearrange the Ground Truth (GT) in order to guide the feature maps at each scale. Extensive experiments are conducted on three publicly industrial datasets, and evaluation results describe the outstanding performance of our network in surface defect detection.

引用

页数：10

共 50 条

[21] Fine coordinate attention for surface defect detection
Xiao, Meng
Yang, Bo
Wang, Shilong
Zhang, Zhengping
He, Yan
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
[22] Aggregating Global and Local Representations via Hybrid Transformer for Video Deraining
Mao, Deqian
Gao, Shanshan
Li, Zhenyu
Dai, Honghao
Zhang, Yunfeng
Zhou, Yuanfeng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7512 - 7522
[23] Spatial attention-based dual stream transformer for concrete defect identification
Yadav, Dhirendra Prasad
Chauhan, Shivank
Kada, Belkacem
Kumar, Ankit
MEASUREMENT, 2023, 218
[24] PPLA-Transformer: An Efficient Transformer for Defect Detection with Linear Attention Based on Pyramid Pooling
Song, Xiaona
Tian, Yubo
Liu, Haichao
Wang, Lijun
Niu, Jinxing
SENSORS, 2025, 25 (03)
[25] Global contextual attention augmented YOLO with ConvMixer prediction heads for PCB surface defect detection
Xia, Kewen
Lv, Zhongliang
Liu, Kang
Lu, Zhenyu
Zhou, Chuande
Zhu, Hong
Chen, Xuanlin
SCIENTIFIC REPORTS, 2023, 13 (01):
[26] Global contextual attention augmented YOLO with ConvMixer prediction heads for PCB surface defect detection
Kewen Xia
Zhongliang Lv
Kang Liu
Zhenyu Lu
Chuande Zhou
Hong Zhu
Xuanlin Chen
Scientific Reports, 13 (1)
[27] Dual-branch information extraction and local attention anchor-free network for defect detection
Wang, Xiaobin
Zhang, Qiang
Chen, Chengjun
SCIENTIFIC REPORTS, 2024, 14 (01):
[28] Local dual-branch attention feature learning framework from UAVs for visual defect detection
Xu, Jianbing
Zhou, Jiangxin
Xu, Dongxu
Chen, Yu
VISUAL COMPUTER, 2025,
[29] Local and Global Context-Enhanced Lightweight CenterNet for PCB Surface Defect Detection
Chen, Weixun
Meng, Siming
Wang, Xueping
SENSORS, 2024, 24 (14)
[30] Defect-aware transformer network for intelligent visual surface defect detection
Shang, Hongbing
Sun, Chuang
Liu, Jinxin
Chen, Xuefeng
Yan, Ruqiang
ADVANCED ENGINEERING INFORMATICS, 2023, 55

← 1 2 3 4 5 →