Integrated pixel-level crack detection and quantification using an ensemble of advanced U-Net architectures

被引:0
|
作者
Rakshitha, R. [1 ]
Srinath, S. [1 ]
Kumar, N. Vinay [2 ]
Rashmi, S. [1 ]
Poornima, B., V [1 ]
机构
[1] JSS Sci & Technol Univ, Dept Comp Sci & Engn, Mysuru, India
[2] Freelance Res, Bangalore, India
关键词
Crack segmentation; Crack quantification; Deep learning; U; -Net; TransUNet; Swin-UNet; Ensemble learning; CONVOLUTIONAL NEURAL-NETWORK; PAVEMENT;
D O I
10.1016/j.rineng.2024.103726
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Automated pavement crack detection faces significant challenges due to the complex shapes of crack patterns, their similarity to non-crack textures, and varying environmental conditions such as lighting and noise. Traditional methods often struggle to adapt, leading to inconsistent and less accurate results in real-world scenarios. This study introduces a hybrid framework that combines convolutional and transformer-based architectures, leveraging their strengths to achieve reliable crack segmentation and pixel-level quantification. The framework incorporates state-of-the-art deep learning models, including U-Net, Attention U-Net, Residual Attention U-Net (RAUNet), TransUNet, and Swin-Unet. U-Net variants, enhanced with attention mechanisms and residual connections, improve feature extraction and gradient flow, enabling precise delineation of crack boundaries. Transformer-based models like TransUNet and Swin-Unet use self-attention mechanisms to capture both local and global spatial relationships, enhancing robustness across diverse crack patterns. A key contribution of this study is the evaluation of loss functions, including Binary Cross-Entropy (BCE) Loss, Dice Loss, and Binary Focal Loss. Binary Focal Loss proved particularly effective in addressing class imbalance across four benchmark datasets. To further improve segmentation performance, two ensemble strategies were applied: stochastic reordering using logical operations (AND, OR, and averaging) and a weighted average ensemble optimized through grid search. The weighted average ensemble demonstrated superior performance, achieving mean Intersection over Union (mIoU) scores of 0.73, 0.70, 0.78, and 0.86 on the CFD, AgileRN, Crack500, and DeepCrack datasets, respectively. In addition to segmentation, this study developed a method for accurately quantifying crack length and width. By using Euclidean distance along skeletal paths, the algorithm minimized error rates in length and width estimation. This framework provides a scalable and efficient solution for automated pavement crack analysis. It addresses critical challenges in accuracy, adaptability, and reliability under diverse operational conditions, marking significant progress in crack detection technology.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Defect Detection of Subway Tunnels Using Advanced U-Net Network
    Wang, An
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    SENSORS, 2022, 22 (06)
  • [32] U-Net Based Architectures for Document Text Detection and Binarization
    Nikitin, Filipp
    Dokholyan, Vladimir
    Zharikov, Ilia
    Strijov, Vadim
    ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT II, 2019, 11845 : 79 - 88
  • [33] Modeling automatic pavement crack object detection and pixel-level segmentation
    Du, Yuchuan
    Zhong, Shan
    Fang, Hongyuan
    Wang, Niannian
    Liu, Chenglong
    Wu, Difei
    Sun, Yan
    Xiang, Mang
    AUTOMATION IN CONSTRUCTION, 2023, 150
  • [34] Hybrid pixel-level concrete crack segmentation and quantification across complex backgrounds using deep learning
    Kang, Dongho
    Benipal, Sukhpreet S.
    Gopal, Dharshan L.
    Cha, Young-Jin
    AUTOMATION IN CONSTRUCTION, 2020, 118
  • [35] An Effective Hybrid Atrous Convolutional Network for Pixel-Level Crack Detection
    Chen, Hanshen
    Lin, Huiping
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [36] Hybrid pixel-level concrete crack segmentation and quantification across complex backgrounds using deep learning
    Kang, Dongho
    Benipal, Sukhpreet S.
    Gopal, Dharshan L.
    Cha, Young-Jin
    Cha, Young-Jin (young.cha@umanitoba.ca), 1600, Elsevier B.V., Netherlands (118):
  • [37] HDCB-Net: A Neural Network With the Hybrid Dilated Convolution for Pixel-Level Crack Detection on Concrete Bridges
    Jiang, Wenbo
    Liu, Min
    Peng, Yunuo
    Wu, Lehui
    Wang, Yaonan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (08) : 5485 - 5494
  • [38] Addressing class imbalance in micro-CT image segmentation: A modified U-Net model with pixel-level class weighting
    Mahmoudi, Shahin
    Asghari, Omid
    Boisvert, Jeff
    COMPUTERS & GEOSCIENCES, 2025, 196
  • [39] Crack-Att Net: crack detection based on improved U-Net with parallel attention
    Na Xu
    Lizhi He
    Qing Li
    Multimedia Tools and Applications, 2023, 82 : 42465 - 42484
  • [40] Crack-Att Net: crack detection based on improved U-Net with parallel attention
    Xu, Na
    He, Lizhi
    Li, Qing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (27) : 42465 - 42484