Integrated pixel-level crack detection and quantification using an ensemble of advanced U-Net architectures

被引：0

作者：

Rakshitha, R. ^{[1
]}

Srinath, S. ^{[1
]}

Kumar, N. Vinay ^{[2
]}

Rashmi, S. ^{[1
]}

Poornima, B., V ^{[1
]}

机构：

[1] JSS Sci & Technol Univ, Dept Comp Sci & Engn, Mysuru, India

[2] Freelance Res, Bangalore, India

来源：

RESULTS IN ENGINEERING | 2025年 / 25卷

关键词：

Crack segmentation; Crack quantification; Deep learning; U; -Net; TransUNet; Swin-UNet; Ensemble learning; CONVOLUTIONAL NEURAL-NETWORK; PAVEMENT;

D O I：

10.1016/j.rineng.2024.103726

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Automated pavement crack detection faces significant challenges due to the complex shapes of crack patterns, their similarity to non-crack textures, and varying environmental conditions such as lighting and noise. Traditional methods often struggle to adapt, leading to inconsistent and less accurate results in real-world scenarios. This study introduces a hybrid framework that combines convolutional and transformer-based architectures, leveraging their strengths to achieve reliable crack segmentation and pixel-level quantification. The framework incorporates state-of-the-art deep learning models, including U-Net, Attention U-Net, Residual Attention U-Net (RAUNet), TransUNet, and Swin-Unet. U-Net variants, enhanced with attention mechanisms and residual connections, improve feature extraction and gradient flow, enabling precise delineation of crack boundaries. Transformer-based models like TransUNet and Swin-Unet use self-attention mechanisms to capture both local and global spatial relationships, enhancing robustness across diverse crack patterns. A key contribution of this study is the evaluation of loss functions, including Binary Cross-Entropy (BCE) Loss, Dice Loss, and Binary Focal Loss. Binary Focal Loss proved particularly effective in addressing class imbalance across four benchmark datasets. To further improve segmentation performance, two ensemble strategies were applied: stochastic reordering using logical operations (AND, OR, and averaging) and a weighted average ensemble optimized through grid search. The weighted average ensemble demonstrated superior performance, achieving mean Intersection over Union (mIoU) scores of 0.73, 0.70, 0.78, and 0.86 on the CFD, AgileRN, Crack500, and DeepCrack datasets, respectively. In addition to segmentation, this study developed a method for accurately quantifying crack length and width. By using Euclidean distance along skeletal paths, the algorithm minimized error rates in length and width estimation. This framework provides a scalable and efficient solution for automated pavement crack analysis. It addresses critical challenges in accuracy, adaptability, and reliability under diverse operational conditions, marking significant progress in crack detection technology.

引用

页数：21

共 50 条

[31] Defect Detection of Subway Tunnels Using Advanced U-Net Network
Wang, An
Togo, Ren
Ogawa, Takahiro
Haseyama, Miki
SENSORS, 2022, 22 (06)
[32] U-Net Based Architectures for Document Text Detection and Binarization
Nikitin, Filipp
Dokholyan, Vladimir
Zharikov, Ilia
Strijov, Vadim
ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT II, 2019, 11845 : 79 - 88
[33] Modeling automatic pavement crack object detection and pixel-level segmentation
Du, Yuchuan
Zhong, Shan
Fang, Hongyuan
Wang, Niannian
Liu, Chenglong
Wu, Difei
Sun, Yan
Xiang, Mang
AUTOMATION IN CONSTRUCTION, 2023, 150
[34] Hybrid pixel-level concrete crack segmentation and quantification across complex backgrounds using deep learning
Kang, Dongho
Benipal, Sukhpreet S.
Gopal, Dharshan L.
Cha, Young-Jin
AUTOMATION IN CONSTRUCTION, 2020, 118
[35] An Effective Hybrid Atrous Convolutional Network for Pixel-Level Crack Detection
Chen, Hanshen
Lin, Huiping
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
[36] Hybrid pixel-level concrete crack segmentation and quantification across complex backgrounds using deep learning
Kang, Dongho
Benipal, Sukhpreet S.
Gopal, Dharshan L.
Cha, Young-Jin
Cha, Young-Jin (young.cha@umanitoba.ca), 1600, Elsevier B.V., Netherlands (118):
[37] HDCB-Net: A Neural Network With the Hybrid Dilated Convolution for Pixel-Level Crack Detection on Concrete Bridges
Jiang, Wenbo
Liu, Min
Peng, Yunuo
Wu, Lehui
Wang, Yaonan
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (08) : 5485 - 5494
[38] Addressing class imbalance in micro-CT image segmentation: A modified U-Net model with pixel-level class weighting
Mahmoudi, Shahin
Asghari, Omid
Boisvert, Jeff
COMPUTERS & GEOSCIENCES, 2025, 196
[39] Crack-Att Net: crack detection based on improved U-Net with parallel attention
Na Xu
Lizhi He
Qing Li
Multimedia Tools and Applications, 2023, 82 : 42465 - 42484
[40] Crack-Att Net: crack detection based on improved U-Net with parallel attention
Xu, Na
He, Lizhi
Li, Qing
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (27) : 42465 - 42484

← 1 2 3 4 5 →