A road defect detection algorithm incorporating partially transformer and multiple aggregate trail attention mechanisms

被引:0
|
作者
Wang, Xueqiu [1 ,2 ]
Gao, Huanbing [1 ,2 ]
Jia, Zemeng [1 ,2 ]
Zhao, Jiayang [3 ]
机构
[1] Shandong Jianzhu Univ, Sch Informat & Elect Engn, Jinan 250101, Peoples R China
[2] Shandong Key Lab Intelligent Bldg Technol, Jinan 250101, Peoples R China
[3] Shandong Quanhai Automobile Technol Co, Liaocheng 252000, Peoples R China
关键词
aggregate multiple coordinate attention; road damage detection; re-calibration FPN; CSP_PTB; CRACK DETECTION;
D O I
10.1088/1361-6501/ada1e7
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Road infrastructure, fundamental to daily life, inevitably sustains damage over time. Timely and precise identification and remediation of road defects are critical to prolong the lifespan of roads and ensure driving safety. Given the limitations of the widely-used You Look Only Once (YOLO) algorithm, including its insufficient receptive field and suboptimal detection accuracy, this paper introduces a novel road defect detection method. First, we propose a new attention mechanism, aggregate multiple coordinate attention, that effectively retains and concatenates channel information while preserving localization data, thereby enhancing the focus on intrinsic features. Second, we design a cross stage partial-partially transformer block (CSP_PTB) that combines CNNs and transformers to yield richer and more varied feature representations. Finally, we develop a novel neck structure, the re-calibrated feature pyramid network (Re-Calibration FPN), which selectively combines boundary and semantic information for finer object contour delineation and positional recalibration. Experimental results show that the S version of the algorithm in this paper achieves a detection accuracy of 73.2% on the road defect dataset, which is 4.2% higher than the YOLOv8 algorithm. Additionally, with an FPS of 80, it meets the requirements for real-time detection, achieving a good balance between detection speed and detection accuracy. Additionally, it exhibits excellent generalizability and robustness on the UAV asphalt pavement distress and PASCAL VOC 2007 datasets.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Research on Road Defect Detection Algorithm Based on LD-YOLOv8
    Zhao, Enlong
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT VI, 2025, 15206 : 230 - 244
  • [32] DAPONet: A Dual Attention and Partially Overparameterized Network for Real-Time Road Damage Detection
    Pan, Weichao
    Lei, Jianmei
    Wang, Xu
    Lv, Chengze
    Wang, Gongrui
    Li, Chong
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [33] Lightweight Single Shot Multi-Box Detector: A fabric defect detection algorithm incorporating parallel dilated convolution and dual channel attention
    Liu, Shuhan
    Huang, Limin
    Zhao, Yingbao
    Wu, Xiaojing
    TEXTILE RESEARCH JOURNAL, 2024, 94 (1-2) : 209 - 224
  • [34] An Improved YOLOv5 Algorithm for Wood Defect Detection Based on Attention
    Han, Siyu
    Jiang, Xiangtao
    Wu, Zhenyu
    IEEE ACCESS, 2023, 11 : 71800 - 71810
  • [35] Improved Real-Time Detection Transformer-Based Rail Fastener Defect Detection Algorithm
    Song, Wei
    Liao, Bin
    Ning, Keqing
    Yan, Xiaoyu
    MATHEMATICS, 2024, 12 (21)
  • [36] MA-SPRNet: A multiple attention mechanisms-based network for self-piercing riveting joint defect detection
    Zhang, Peng
    Zhao, Lun
    Ren, Yu
    Wei, Dong
    To, Sandy
    Abbas, Zeshan
    Islam, Md Shafiqul
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120
  • [37] PMENet: a parallel UNet based on the fusion of multiple attention mechanisms for road crack segmentation
    Wang, Ban
    Dai, Changlu
    Li, Jun
    Jiang, Xiaoliang
    Zhang, Juyong
    Jia, Guanshuai
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 757 - 769
  • [38] Convolutional Neural Network Incorporating Multiple Attention Mechanisms for MRI Classification of Lumbar Spinal Stenosis
    Lin, Juncai
    Zhang, Honglai
    Shang, Hongcai
    BIOENGINEERING-BASEL, 2024, 11 (10):
  • [39] Dense Multiscale Feature Learning Transformer Embedding Cross-Shaped Attention for Road Damage Detection
    Xu, Chuan
    Zhang, Qi
    Mei, Liye
    Shen, Sen
    Ye, Zhaoyi
    Li, Di
    Yang, Wei
    Zhou, Xiangyang
    ELECTRONICS, 2023, 12 (04)
  • [40] Attention mechanism and lightweight network fusion HRNet: a lightweight remote sensing road extraction algorithm integrating attention mechanisms
    Gao, ZiMeng
    Wang, ShouBin
    Yang, Zijian
    Peng, Guili
    Li, Youbing
    Fang, Xinchang
    Li, Shunqun
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)