TransMF: Transformer-Based Multi-Scale Fusion Model for Crack Detection

被引:11
|
作者
Ju, Xiaochen [1 ]
Zhao, Xinxin [1 ]
Qian, Shengsheng [2 ]
机构
[1] China Acad Railway Sci Corp Ltd, Railway Engn Res Inst, Beijing 100081, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing 100090, Peoples R China
关键词
crack detection; convolutional neural network; transformer; multi-scale fusion;
D O I
10.3390/math10132354
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Cracks are widespread in infrastructure that are closely related to human activity. It is very popular to use artificial intelligence to detect cracks intelligently, which is known as crack detection. The noise in the background of crack images, discontinuity of cracks and other problems make the crack detection task a huge challenge. Although many approaches have been proposed, there are still two challenges: (1) cracks are long and complex in shape, making it difficult to capture long-range continuity; (2) most of the images in the crack dataset have noise, and it is difficult to detect only the cracks and ignore the noise. In this paper, we propose a novel method called Transformer-based Multi-scale Fusion Model (TransMF) for crack detection, including an Encoder Module (EM), Decoder Module (DM) and Fusion Module (FM). The Encoder Module uses a hybrid of convolution blocks and Swin Transformer block to model the long-range dependencies of different parts in a crack image from a local and global perspective. The Decoder Module is designed with symmetrical structure to the Encoder Module. In the Fusion Module, the output in each layer with unique scales of Encoder Module and Decoder Module are fused in the form of convolution, which can release the effect of background noise and strengthen the correlations between relevant context in order to enhance the crack detection. Finally, the output of each layer of the Fusion Module is concatenated to achieve the purpose of crack detection. Extensive experiments on three benchmark datasets (CrackLS315, CRKWH100 and DeepCrack) demonstrate that the proposed TransMF in this paper exceeds the best performance of present baselines.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] A novel multi-scale network intrusion detection model with transformer
    Xi, Chiming
    Wang, Hui
    Wang, Xubin
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [22] Swin Transformer-Based Segmentation and Multi-Scale Feature Pyramid Fusion Module for Alzheimer's Disease with Machine Learning
    Gharaibeh, Nasr
    Abu-Ein, Ashraf A.
    Al-hazaimeh, Obaida M.
    Nahar, Khalid M. O.
    Abu-Ain, Waleed A.
    Al-Nawashi, Malek M.
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (04) : 22 - 50
  • [23] GGMNet: Pavement-Crack Detection Based on Global Context Awareness and Multi-Scale Fusion
    Wang, Yong
    He, Zhenglong
    Zeng, Xiangqiang
    Zeng, Juncheng
    Cen, Zongxi
    Qiu, Luyang
    Xu, Xiaowei
    Zhuo, Qunxiong
    REMOTE SENSING, 2024, 16 (10)
  • [24] Ship detection based on multi-scale weighted fusion*
    Zhou, Weina
    Peng, Yujie
    DISPLAYS, 2023, 78
  • [25] Pedestrian Detection Based on Multi-Scale Fusion Features
    Jiang, Hao
    Zhang, Chuang
    Wu, Ming
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 329 - 333
  • [26] Drone Detection Based on Multi-scale Feature Fusion
    Zeng, Zhenni
    Wang, Zhenning
    Qin, Lang
    Li, Hui
    2021 6TH INTERNATIONAL CONFERENCE ON UK-CHINA EMERGING TECHNOLOGIES (UCET 2021), 2021, : 194 - 198
  • [27] Face spoofing detection model based on multi-scale predictive feature fusion
    Huang, Ling
    He, Xi Ping
    He, Dan
    Proceedings of SPIE - The International Society for Optical Engineering, 2023, 12707
  • [28] A Robust Vehicle Detection Model Based on Attention and Multi-scale Feature Fusion
    Zhu, Yuxin
    Liu, Wenbo
    Yan, Fei
    Li, Jun
    2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 143 - 148
  • [29] Transformer-Based Multi-scale Optimization Network for Low-Light Image Enhancement
    Niu Y.
    Lin X.
    Xu H.
    Li Y.
    Chen Y.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (06): : 511 - 529
  • [30] Transformer-Based Multi-Scale Data-Driven Wellbore Risk Prediction Method
    Zhang, Hongyuan
    Liu, Yupei
    Zhang, Xingquan
    Yin, Zhiming
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 53 - 58