Multispectral Pedestrian Detection Based on Prior-Saliency Attention and Image Fusion

被引:0
|
作者
Guo, Jiaren [1 ,2 ]
Huang, Zihao [1 ,2 ]
Tao, Yanyun [1 ,2 ,3 ]
机构
[1] Soochow Univ, Sch Rail Transportat, Suzhou 215005, Peoples R China
[2] Suzhou Transportat Big Data Innovat & Applicat Lab, Suzhou 215005, Peoples R China
[3] Shanghai Jiao Tong Univ, Key Lab Informat Proc & Intelligent Control, Shanghai 350121, Peoples R China
关键词
multispectral; pedestrian detection; feature fusion; computer vision; prior-attention; NEURAL-NETWORKS;
D O I
10.3390/electronics13091770
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting pedestrians in varying illumination conditions poses a significant challenge, necessitating the development of innovative solutions. In response to this, we introduce Prior-AttentionNet, a pedestrian detection model featuring a Prior-Attention mechanism. This model leverages the stark contrast between thermal objects and their backgrounds in far-infrared (FIR) images by employing saliency attention derived from FIR images via UNet. However, extracting salient regions of diverse scales from FIR images poses a challenge for saliency attention. To address this, we integrate Simple Linear Iterative Clustering (SLIC) superpixel segmentation, embedding the segmentation feature map as prior knowledge into UNet's decoding stage for comprehensive end-to-end training and detection. This integration enhances the extraction of focused attention regions, with the synergy of segmentation prior and saliency attention forming the core of Prior-AttentionNet. Moreover, to enrich pedestrian details and contour visibility in low-light conditions, we implement multispectral image fusion. Experimental evaluations were conducted on the KAIST and OTCBVS datasets. Applying Prior-Attention mode to FIR-RGB images significantly improves the delineation and focus on multi-scale pedestrians. Prior-AttentionNet's general detector demonstrates the capability of detecting pedestrians with minimal computational resources. The ablation studies indicate that the FIR-RGB+ Prior-Attention mode markedly enhances detection robustness over other modes. When compared to conventional multispectral pedestrian detection models, Prior-AttentionNet consistently surpasses them by achieving higher mean average precision and lower miss rates in diverse scenarios, during both day and night.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Visible and infrared image fusion based on visual saliency detection
    Tan, Xizi
    Guo, Liqiang
    2020 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2020), 2020, : 134 - 137
  • [22] Attention-Guided Multi-modal and Multi-scale Fusion for Multispectral Pedestrian Detection
    Bao, Wei
    Huang, Meiyu
    Hu, Jingjing
    Xiang, Xueshuang
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 382 - 393
  • [23] INSANet: INtra-INter Spectral Attention Network for Effective Feature Fusion of Multispectral Pedestrian Detection
    Lee, Sangin
    Kim, Taejoo
    Shin, Jeongmin
    Kim, Namil
    Choi, Yukyung
    SENSORS, 2024, 24 (04)
  • [24] Image Retrieval Based on Saliency Attention
    Wen, Zhenkun
    Gao, Jinhua
    Luo, Ruijie
    Wu, Huisi
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISKE 2013), 2014, 277 : 177 - 188
  • [25] Attention-Based Cross-Modality Feature Complementation for Multispectral Pedestrian Detection
    Jiang, Qunyan
    Dai, Juying
    Rui, Ting
    Shao, Faming
    Wang, Jinkang
    Lu, Guanlin
    IEEE ACCESS, 2022, 10 : 53797 - 53809
  • [26] Transformer fusion and histogram layer multispectral pedestrian detection network
    Ying Zang
    Chenglong Fu
    Dongsheng Yang
    Hui Li
    Chaotao Ding
    Qingshan Liu
    Signal, Image and Video Processing, 2023, 17 : 3545 - 3553
  • [27] Illumination-aware Multispectral Fusion Network for Pedestrian Detection
    Peng P.
    Ren S.
    Li J.
    Zhou H.
    Xu T.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 (09): : 2622 - 2630
  • [28] Transformer fusion and histogram layer multispectral pedestrian detection network
    Zang, Ying
    Fu, Chenglong
    Yang, Dongsheng
    Li, Hui
    Ding, Chaotao
    Liu, Qingshan
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (07) : 3545 - 3553
  • [29] HAFNet: Hierarchical Attentive Fusion Network for Multispectral Pedestrian Detection
    Peng, Peiran
    Xu, Tingfa
    Huang, Bo
    Li, Jianan
    REMOTE SENSING, 2023, 15 (08)
  • [30] An Underwater Saliency Detection Method Based on Grayscale Image Information Fusion
    Xu, Tao
    Zhao, Weishuo
    Cai, Lei
    Chai, Haojie
    Zhou, Jiyong
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 255 - 260