Enriched multi-scale cascade pyramid features and guided context attention network for industrial surface defect detection

被引:17
|
作者
Shao, Linhao [1 ]
Zhang, Erhu [1 ]
Duan, Jinghong [2 ]
Ma, Qiurui [3 ]
机构
[1] Xian Univ Technol, Dept Informat Sci, Xian 710048, Peoples R China
[2] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
[3] Xian Univ Technol, Sch Mech & Precis Instrument Engn, Xian 710048, Peoples R China
基金
中国国家自然科学基金;
关键词
Surface defect detection; Deep learning; Pyramid feature fusion; Guided context attention; Attention mechanism; CLASSIFICATION;
D O I
10.1016/j.engappai.2023.106369
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Surface defect detection is a very important technique to guarantee product quality in industrial fields. However, the detection of multi-scale defects and defects with poor visibility is still a challenging problem. To address this issue, we propose a novel network by collaborating multi-scale cascade pyramid features and a guided context attention mechanism for the pixel-wise defection of surface defects, called MPA-Net. The MPA-Net is a full y-convolutional network (FCN) with an encoder-decoder architecture, which can integrate multi-scale features and merge them into the different stages of the decoder for generating the defect segmentation map. Specifically, the proposed guided context attention module (GCA) is used to transmit the global context information from the large scale to the small scale, which can promote the initial recovery capability of the decoder, and thus help to locate defects with different sizes and defects with poor visibility. Moreover, the proposed pyramid feature fusion and enrichment module (FFEM) is employed to aggregate low-level coarse features and high-level semantic features in each scale, so as to increase the ability of defect feature representation. The aggregation features at different scales are then fused to the different layers of the decoder, which is beneficial to recover the details of defects gradually. The evaluation results on four public datasets demonstrate that the proposed method has excellent performances on mean intersection of union (DAGM2007: 64.94%, KolektorSSD: 77.90%, RSDDs-I: 86.63%, RSDDs-II: 80.62%, FID: 96.98%) and mean pixel accuracy (DAGM2007: 67.97%, KolektorSSD: 85.01%, RSDDs-I: 94.13%, RSDDs-II: 88.53%, FID: 98.71%).
引用
收藏
页数:15
相关论文
共 50 条
  • [41] PGA-Net: Pyramid Feature Fusion and Global Context Attention Network for Automated Surface Defect Detection
    Dong, Hongwen
    Song, Kechen
    He, Yu
    Xu, Jing
    Yan, Yunhui
    Meng, Qinggang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (12) : 7448 - 7458
  • [42] Multi-scale Context Enhancement Network for Object Detection
    Wang, Yanan
    Ma, Yingdong
    2022 2ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE (SEAI 2022), 2022, : 6 - 11
  • [43] Multi-Scale Detail Enhanced Pyramid Network for Esophageal Lesion Detection
    Li, Chi
    Zhou, Yingyue
    Yao, Hanmin
    Li, Xiaoxia
    Qin, Jiamin
    Zhuang, Ming
    Wen, Liming
    Computer Engineering and Applications, 2024, 60 (04) : 229 - 236
  • [44] Multi-Scale Residual Aggregation Feature Pyramid Network for Object Detection
    Wang, Hongyang
    Wang, Tiejun
    ELECTRONICS, 2023, 12 (01)
  • [45] MLPN: Multi-Scale Laplacian Pyramid Network for deepfake detection and localization
    Zhang, Yibo
    Lin, Weiguo
    Xu, Junfeng
    Xu, Wanshang
    Xu, Yikun
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2025, 89
  • [46] FEATURE FUSING OF FEATURE PYRAMID NETWORK FOR MULTI-SCALE PEDESTRIAN DETECTION
    Tesema, Fiseha B.
    Lin, Junpeng
    Ou, Jie
    Wu, Hong
    Zhu, William
    2018 15TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2018, : 10 - 13
  • [47] MCAC-UNet: Multi scale Attention Cascade Compensation U-Net Network for Rail Surface Defect Detection
    Wu, Ziqing
    Lv, Jinglong
    Sun, Xiaoguang
    Niu, Weilong
    2024 2ND ASIA CONFERENCE ON COMPUTER VISION, IMAGE PROCESSING AND PATTERN RECOGNITION, CVIPPR 2024, 2024,
  • [48] Attention Guided Encoder-Decoder Network With Multi-Scale Context Aggregation for Land Cover Segmentation
    Wang, Shuyang
    Mu, Xiaodong
    Yang, Dongfang
    He, Hao
    Zhao, Peng
    IEEE ACCESS, 2020, 8 : 215299 - 215309
  • [49] An efficient model for metal surface defect detection based on attention mechanism and multi-scale feature
    Zhang, Heng
    Fu, Wei
    Wang, Xiaoming
    Li, Dong
    Zhu, Danchen
    Su, Xingwang
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [50] ADMNet: Attention-Guided Densely Multi-Scale Network for Lightweight Salient Object Detection
    Zhou, Xiaofei
    Shen, Kunye
    Liu, Zhi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 10828 - 10841