TransMF: Transformer-Based Multi-Scale Fusion Model for Crack Detection

被引:11
|
作者
Ju, Xiaochen [1 ]
Zhao, Xinxin [1 ]
Qian, Shengsheng [2 ]
机构
[1] China Acad Railway Sci Corp Ltd, Railway Engn Res Inst, Beijing 100081, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing 100090, Peoples R China
关键词
crack detection; convolutional neural network; transformer; multi-scale fusion;
D O I
10.3390/math10132354
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Cracks are widespread in infrastructure that are closely related to human activity. It is very popular to use artificial intelligence to detect cracks intelligently, which is known as crack detection. The noise in the background of crack images, discontinuity of cracks and other problems make the crack detection task a huge challenge. Although many approaches have been proposed, there are still two challenges: (1) cracks are long and complex in shape, making it difficult to capture long-range continuity; (2) most of the images in the crack dataset have noise, and it is difficult to detect only the cracks and ignore the noise. In this paper, we propose a novel method called Transformer-based Multi-scale Fusion Model (TransMF) for crack detection, including an Encoder Module (EM), Decoder Module (DM) and Fusion Module (FM). The Encoder Module uses a hybrid of convolution blocks and Swin Transformer block to model the long-range dependencies of different parts in a crack image from a local and global perspective. The Decoder Module is designed with symmetrical structure to the Encoder Module. In the Fusion Module, the output in each layer with unique scales of Encoder Module and Decoder Module are fused in the form of convolution, which can release the effect of background noise and strengthen the correlations between relevant context in order to enhance the crack detection. Finally, the output of each layer of the Fusion Module is concatenated to achieve the purpose of crack detection. Extensive experiments on three benchmark datasets (CrackLS315, CRKWH100 and DeepCrack) demonstrate that the proposed TransMF in this paper exceeds the best performance of present baselines.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Multi-Scale Transformer-Based Matching Network for Generalizable Person Re-Identification
    Jiang, Jinhua
    Zhang, Wenfeng
    Ran, Ruisheng
    Hu, Wei
    Dai, Jiangyan
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 (1277-1281) : 1277 - 1281
  • [32] MSTD: A Multi-Scale Transformer-Based Method to Diagnose Benign and Malignant Lung Nodules
    Zhao, Xiaoyu
    Li, Jiao
    Qi, Man
    Chen, Xuxin
    Chen, Wei
    Li, Yongqun
    Liu, Qi
    Tang, Jiajia
    Han, Zhihai
    Zhang, Chunyang
    IEEE ACCESS, 2025, 13 : 16182 - 16195
  • [33] LTUNet: A Lightweight Transformer-Based UNet with Multi-scale Mechanism for Skin Lesion Segmentation
    Guo, Huike
    Zhang, Han
    Li, Minghe
    Quan, Xiongwen
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 147 - 158
  • [34] MSTFDN: Multi-scale transformer fusion dehazing network
    Yan Yang
    Haowen Zhang
    Xudong Wu
    Xiaozhen Liang
    Applied Intelligence, 2023, 53 : 5951 - 5962
  • [35] MSTFDN: Multi-scale transformer fusion dehazing network
    Yang, Yan
    Zhang, Haowen
    Wu, Xudong
    Liang, Xiaozhen
    APPLIED INTELLIGENCE, 2023, 53 (05) : 5951 - 5962
  • [36] Research on Multi-Scale CNN and Transformer-Based Multi-Level Multi-Classification Method for Images
    Gou, Quandeng
    Ren, Yuheng
    IEEE ACCESS, 2024, 12 : 103049 - 103059
  • [37] A Novel Multi-Scale Feature Fusion-Based 3SCNet for Building Crack Detection
    Yadav, Dhirendra Prasad
    Kishore, Kamal
    Gaur, Ashish
    Kumar, Ankit
    Singh, Kamred Udham
    Singh, Teekam
    Swarup, Chetan
    SUSTAINABILITY, 2022, 14 (23)
  • [38] MFT: Multi-scale Fusion Transformer for Infrared and Visible Image Fusion
    Zhang, Chen-Ming
    Yuan, Chengbo
    Luo, Yong
    Zhou, Xin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 485 - 496
  • [39] A Multi-Scale Transformer Fusion Deep Clustering Network for Unsupervised Planetary Change Detection
    Jia, Yutong
    Wan, Gang
    Liu, Jia
    Zhao, Chenxu
    Wang, Guoping
    Zhang, Yifan
    Liu, Lei
    Xie, Bin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [40] TMIF: transformer-based multi-modal interactive fusion for automatic rumor detection
    Lv, Jiandong
    Wang, Xingang
    Shao, Cuiling
    MULTIMEDIA SYSTEMS, 2022, 29 (5) : 2979 - 2989