Enriched multi-scale cascade pyramid features and guided context attention network for industrial surface defect detection

被引:17
|
作者
Shao, Linhao [1 ]
Zhang, Erhu [1 ]
Duan, Jinghong [2 ]
Ma, Qiurui [3 ]
机构
[1] Xian Univ Technol, Dept Informat Sci, Xian 710048, Peoples R China
[2] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
[3] Xian Univ Technol, Sch Mech & Precis Instrument Engn, Xian 710048, Peoples R China
基金
中国国家自然科学基金;
关键词
Surface defect detection; Deep learning; Pyramid feature fusion; Guided context attention; Attention mechanism; CLASSIFICATION;
D O I
10.1016/j.engappai.2023.106369
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Surface defect detection is a very important technique to guarantee product quality in industrial fields. However, the detection of multi-scale defects and defects with poor visibility is still a challenging problem. To address this issue, we propose a novel network by collaborating multi-scale cascade pyramid features and a guided context attention mechanism for the pixel-wise defection of surface defects, called MPA-Net. The MPA-Net is a full y-convolutional network (FCN) with an encoder-decoder architecture, which can integrate multi-scale features and merge them into the different stages of the decoder for generating the defect segmentation map. Specifically, the proposed guided context attention module (GCA) is used to transmit the global context information from the large scale to the small scale, which can promote the initial recovery capability of the decoder, and thus help to locate defects with different sizes and defects with poor visibility. Moreover, the proposed pyramid feature fusion and enrichment module (FFEM) is employed to aggregate low-level coarse features and high-level semantic features in each scale, so as to increase the ability of defect feature representation. The aggregation features at different scales are then fused to the different layers of the decoder, which is beneficial to recover the details of defects gradually. The evaluation results on four public datasets demonstrate that the proposed method has excellent performances on mean intersection of union (DAGM2007: 64.94%, KolektorSSD: 77.90%, RSDDs-I: 86.63%, RSDDs-II: 80.62%, FID: 96.98%) and mean pixel accuracy (DAGM2007: 67.97%, KolektorSSD: 85.01%, RSDDs-I: 94.13%, RSDDs-II: 88.53%, FID: 98.71%).
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Multi-Scale Pyramid Squeeze Attention Similarity Optimization Classification Neural Network for ERP Detection
    Xu, Ruitian
    Allison, Brendan Z.
    Zhao, Xueqing
    Liang, Wei
    Wang, Xingyu
    Cichocki, Andrzej
    Jin, Jing
    NEURAL NETWORKS, 2025, 184
  • [22] Semantic Adversarial Network with Multi-Scale Pyramid Attention for Video Classification
    Xie, De
    Deng, Cheng
    Wang, Hao
    Li, Chao
    Tao, Dapeng
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9030 - 9037
  • [23] Multi-scale Residual Pyramid Attention Network for Monocular Depth Estimation
    Liu, Jing
    Zhang, Xiaona
    Li, Zhaoxin
    Mao, Tianlu
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5137 - 5144
  • [24] Multi-scale features and attention guided for brain tumor segmentation
    Wang, Zekun
    Zou, Yanni
    Chen, Hongyu
    Liu, Peter X.
    Chen, Junyu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100
  • [25] A Multi-Scale Guided Cascade Hourglass Network for Depth Completion
    Li, Ang
    Yuan, Zejian
    Ling, Yonggen
    Chi, Wanchao
    Zhang, Shenghao
    Zhang, Chong
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 32 - 40
  • [26] Multi-scale global context feature pyramid network for object detector
    Yunhao Li
    Mingwen Shao
    Bingbing Fan
    Wei Zhang
    Signal, Image and Video Processing, 2022, 16 : 705 - 713
  • [27] Multi-scale Pyramid Pooling Network for salient object detection
    Dakhia, Abdelhafid
    Wang, Tiantian
    Lu, Huchuan
    NEUROCOMPUTING, 2019, 333 : 211 - 220
  • [28] Multi-scale global context feature pyramid network for object detector
    Li, Yunhao
    Shao, Mingwen
    Fan, Bingbing
    Zhang, Wei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (03) : 705 - 713
  • [29] Multi-Scale Cascade Network for Salient Object Detection
    Li, Xin
    Yang, Fan
    Cheng, Hong
    Chen, Junyu
    Guo, Yuxiao
    Chen, Leiting
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 439 - 447
  • [30] Multi-Scale Attention-Guided Network for mammograms classification
    Xu, Chunbo
    Lou, Meng
    Qi, Yunliang
    Wang, Yiming
    Pi, Jiande
    Ma, Yide
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 68