Multi-scale triple-attention network for pixelwise crack segmentation

被引:38
|
作者
Yang, Lei [1 ,2 ]
Bai, Suli [1 ,2 ]
Liu, Yanhong [1 ,2 ]
Yu, Hongnian [2 ,3 ]
机构
[1] Zhengzhou Univ, Sch Elect & Informat Engn, Henan 450001, Peoples R China
[2] Robot Percept & Control Engn Lab Henan Prov, Zhengzhou 450001, Henan, Peoples R China
[3] Edinburgh Napier Univ, Built Environm, Edinburgh EH10 5DT, Scotland
关键词
Pavement crack segmentation; Semantic segmentation; Residual network; Multiscale input strategy; Deep supervision mechanism;
D O I
10.1016/j.autcon.2023.104853
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Currently, intelligent crack detection is of great value for the maintenance of infrastructure, of which the most significant kind in China is roads. For pavement defects, the pavement can be repaired and maintained in a timely manner with an accurate defect detection task, which significantly reduces the occurrence of hazards. However, the detection of pavement defects remains a great challenge owing to many difficulties, for example, complex backgrounds, microdefects, various defect shapes and sizes, class imbalance issues, etc. Recently, deep learning has demonstrated its superior performance on pixelwise image segmentation, but some issues still exist on demanding pixelwise image segmentation, for instance, limited receptive field, insufficiency processing of local features, information loss issue generated by pooling operations, etc. Based on all of the above issues, a multiscale triple-attention network, named MST-Net, is proposed for end-to-end pixelwise crack detection. First, a multiscale input strategy is applied to the proposed segmentation network to capture more context information. Meanwhile, it can capably reduce the effect of the information loss issue generated by pooling operations. Second, to realize effective feature representation of local features, an additive attention fusion (AAF) block is proposed to guide feature learning to capture both global and local contexts. In addition, faced with the crack detection task with class imbalance issues, a triple attention (TA) block is proposed to detect spatial, channel and pixel attention information to suppress the background and useless information, which is conducive to the characterization of microcracks. Finally, aiming at the limited receptive field, a multiscale feature aggregation unit is proposed for feature fusion to increase the detection ability of multiscale defects. To better guide network training, a deep supervision mechanism is also introduced to speed up the convergence of the proposed segmentation model and improve the performance of defect segmentation. The related evaluation and detection experiments are carried out on three public datasets on crack segmentation, and the comparison experiments with the mainstream segmentation models show that the proposed segmentation network achieves excellent performance on pixelwise crack detection.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A Multi-Scale Channel Attention Network for Prostate Segmentation
    Ding, Meiwen
    Lin, Zhiping
    Lee, Chau Hung
    Tan, Cher Heng
    Huang, Weimin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (05) : 1754 - 1758
  • [2] A Multi-Scale Contextual Information Enhancement Network for Crack Segmentation
    Zhang, Lili
    Liao, Yang
    Wang, Gaoxu
    Chen, Jun
    Wang, Huibin
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [3] A multi-scale residual encoding network for concrete crack segmentation
    Liu, Die
    Xu, MengDie
    Li, ZhiTing
    He, Yingying
    Zheng, Long
    Xue, Pengpeng
    Wu, Xiaodong
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (01) : 1379 - 1392
  • [4] A Multi-scale and Multi-attention Network for Skin Lesion Segmentation
    Wu, Cong
    Zhang, Hang
    Chen, Dingsheng
    Gan, Haitao
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 : 537 - 550
  • [5] Parallel multi-scale network with attention mechanism for pancreas segmentation
    Long, Jianwu
    Song, Xinlei
    An, Yong
    Li, Tong
    Zhu, Jiangzhou
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2022, 17 (01) : 110 - 119
  • [6] Adaptive multi-scale dual attention network for semantic segmentation
    Wang, Weizhen
    Wang, Suyu
    Li, Yue
    Jin, Yishu
    NEUROCOMPUTING, 2021, 460 : 39 - 49
  • [7] Attention based multi-scale parallel network for polyp segmentation
    Song, Pengfei
    Li, Jinjiang
    Fan, Hui
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
  • [8] A Multi-Scale Residual Attention Network for Retinal Vessel Segmentation
    Jiang, Yun
    Yao, Huixia
    Wu, Chao
    Liu, Wenhuan
    SYMMETRY-BASEL, 2021, 13 (01): : 1 - 16
  • [9] A Multi-Scale Attention Fusion Network for Retinal Vessel Segmentation
    Wang, Shubin
    Chen, Yuanyuan
    Yi, Zhang
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [10] DCMA-Net: A dual channel multi-scale feature attention network for crack image segmentation
    Yan, Yidan
    Sun, Junding
    Zhang, Hongyuan
    Tang, Chaosheng
    Wu, Xiaosheng
    Wang, Shuihua
    Zhang, Yudong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 148