Res2Net-based multi-scale and multi-attention model for traffic scene image classification

被引:0
|
作者
Gao, Guanghui [1 ]
Guo, Yining [1 ]
Zhou, Lumei [1 ]
Li, Li [1 ]
Shi, Gang [1 ]
机构
[1] Xinjiang Univ, Sch Comp Sci & Technol, Urumqi, Peoples R China
来源
PLOS ONE | 2024年 / 19卷 / 05期
关键词
D O I
10.1371/journal.pone.0300017
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
With the increasing applications of traffic scene image classification in intelligent transportation systems, there is a growing demand for improved accuracy and robustness in this classification task. However, due to weather conditions, time, lighting variations, and annotation costs, traditional deep learning methods still have limitations in extracting complex traffic scene features and achieving higher recognition accuracy. The previous classification methods for traffic scene images had gaps in multi-scale feature extraction and the combination of frequency domain, spatial, and channel attention. To address these issues, this paper proposes a multi-scale and multi-attention model based on Res2Net. Our proposed framework introduces an Adaptive Feature Refinement Pyramid Module (AFRPM) to enhance multi-scale feature extraction, thus improving the accuracy of traffic scene image classification. Additionally, we integrate frequency domain and spatial-channel attention mechanisms to develop recognition capabilities for complex backgrounds, objects of different scales, and local details in traffic scene images. The paper conducts the task of classifying traffic scene images using the Traffic-Net dataset. The experimental results demonstrate that our model achieves an accuracy of 96.88% on this dataset, which is an improvement of approximately 2% compared to the baseline Res2Net network. Furthermore, we validate the effectiveness of the proposed modules through ablation experiments.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Hyperspectral Image Classification Based on Multi-Scale Convolutional Features and Multi-Attention Mechanisms
    Sun, Qian
    Zhao, Guangrui
    Xia, Xinyuan
    Xie, Yu
    Fang, Chenrong
    Sun, Le
    Wu, Zebin
    Pan, Chengsheng
    REMOTE SENSING, 2024, 16 (12)
  • [2] Remote sensing scene image classification model based on multi-scale features and attention mechanism
    Wang, Guowei
    Xu, Haixia
    Wang, Xinyu
    Yuan, Liming
    Wen, Xianbin
    JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (04)
  • [3] Hyperspectral Image Classification Based on SE-Res2Net and Multi-Scale Spatial Spectral Fusion Attention Mechanism
    Xu Q.
    Liang Y.
    Wang D.
    Luo B.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (11): : 1726 - 1734
  • [4] Multi-scale Multi-attention Network for Moire Document Image Binarization
    Guo, Yanqing
    Ji, Caijuan
    Zheng, Xin
    Wang, Qianyu
    Luo, Xiangyang
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 90
  • [5] Multi-Attention Object Detection Model in Remote Sensing Images Based on Multi-Scale
    Ying, Xiang
    Wang, Qiang
    Li, Xuewei
    Yu, Mei
    Jiang, Han
    Gao, Jie
    Liu, Zhiqiang
    Yu, Ruiguo
    IEEE ACCESS, 2019, 7 : 94508 - 94519
  • [6] Multi-view Remote Sensing Image Scene Classification by Fusing Multi-scale Attention
    Shi Y.
    Zhou W.
    Shao Z.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2024, 49 (03): : 366 - 375
  • [7] Remote Sensing Image Scene Classification Method Based on Multi-Scale Cyclic Attention Network
    Ma X.
    Wang L.
    Qi K.
    Zheng G.
    Diqiu Kexue - Zhongguo Dizhi Daxue Xuebao/Earth Science - Journal of China University of Geosciences, 2021, 46 (10): : 3740 - 3752
  • [8] A New Multi-Scale Convolutional Model Based on Multiple Attention for Image Classification
    Yang, Yadong
    Xu, Chengji
    Dong, Feng
    Wang, Xiaofeng
    APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [9] Image scene classification based on multi-scale and contextual semantic information
    Zhang, Rui-Jie
    Li, Bi-Cheng
    Wei, Fu-Shan
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2014, 42 (04): : 646 - 652
  • [10] MSCE-Net: Multi-scale Spatial and Channel Enhancing Net based on Attention for Cloud Image Classification
    Lin, Chih-Wei
    Jin, Lingjie
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2458 - 2465