RFCS-YOLO: Target Detection Algorithm in Adverse Weather Conditions via Receptive Field Enhancement and Cross-Scale Fusion

被引:0
|
作者
Liu, Gang [1 ,2 ]
Huang, Yingzheng [2 ]
Yan, Shuguang [2 ]
Hou, Enxiang [2 ]
机构
[1] Wuxi Univ, Jiangsu Prov Engn Res Ctr Photon Devices & Syst In, Wuxi 214105, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Elect & Informat Engn, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
receptive field enhancement; cross-scale fusion; attention mechanism; YOLOv7; loss function;
D O I
10.3390/s25030912
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The paper proposes a model based on receptive field enhancement and cross-scale fusion (RFCS-YOLO). It addresses challenges like complex backgrounds and problems of missing and mis-detecting traffic targets in bad weather. First, an efficient feature extraction module (EFEM) is created. It reconfigures the backbone network. This helps to make the receptive field better and improves its ability to extract features of targets at different scales. Next, a cross-scale fusion module (CSF) is introduced. It uses the receptive field coordinate attention mechanism (RFCA) to fuse information from different scales well. It also filters out noise and background information that might interfere. Also, a new Focaler-Minimum Point Distance Intersection over Union (F-MPDIoU) loss function is proposed. It makes the model converge faster and deals with issues of leakage and false detection. Experiments were conducted on the expanded Vehicle Detection in Adverse Weather Nature dataset (DWAN). The results show significant improvements compared to the conventional You Only Look Once v7 (YOLOv7) model. The mean Average Precision (mAP@0.5), precision, and recall are enhanced by 4.2%, 8.3%, and 1.4%, respectively. The mean Average Precision is 86.5%. The frame rate is 68 frames per second (FPS), which meets the requirements for real-time detection. A generalization experiment was conducted using the autonomous driving dataset SODA10M. The mAP@0.5 achieved 56.7%, which is a 3.6% improvement over the original model. This result demonstrates the good generalization ability of the proposed method.
引用
收藏
页数:20
相关论文
共 26 条
  • [21] RE-YOLO: An apple picking detection algorithm fusing receptive-field attention convolution and efficient multi-scale attention
    Sui, Jinxue
    Liu, Li
    Wang, Zuoxun
    Yang, Li
    PLOS ONE, 2025, 20 (03):
  • [22] Enhanced feature extraction YOLO industrial small object detection algorithm based on receptive-field attention and multi-scale features
    Tao, Hongfeng
    Zheng, Yuechang
    Wang, Yue
    Qiu, Jier
    Stojanovic, Vladimir
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (10)
  • [23] ICGM-YOLO: UAV remote sensing small target detection algorithm based on feature fusion enhancement module and ghost convolution
    Zhang, Hongbin
    Yang, Jingmin
    Zhang, Wenjie
    Ren, Jinghui
    PHYSICA SCRIPTA, 2024, 99 (12)
  • [24] Weakly Supervised RGBT salient object detection via SAM-Guided Label Optimization and Progressive Cross-modal Cross-scale Fusion
    Zhai, Sulan
    Liu, Chengzhuang
    Tu, Zhengzheng
    Li, Chenglong
    Gao, Liuxuanqi
    INFORMATION FUSION, 2025, 120
  • [25] MC-Refine: Enhanced Cross-Modal 3-D Object Detection via Multistage Cross-Scale Fusion and Box Refinement
    Xu, Wencai
    Hu, Jie
    Tang, Yuxuan
    Chen, Jiaji
    Chen, Nan
    Wang, Zhanbin
    IEEE SENSORS JOURNAL, 2025, 25 (01) : 1784 - 1798
  • [26] MAL-YOLO: a lightweight algorithm for target detection in side-scan sonar images based on multi-scale feature fusion and attention mechanism
    Cao, Yu
    Cui, Xiaodong
    Gan, Mingyi
    Wang, Yaxue
    Yang, Fanlin
    Huang, Yi
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)