Multispectral Pedestrian Detection Based on Prior-Saliency Attention and Image Fusion

被引:0
|
作者
Guo, Jiaren [1 ,2 ]
Huang, Zihao [1 ,2 ]
Tao, Yanyun [1 ,2 ,3 ]
机构
[1] Soochow Univ, Sch Rail Transportat, Suzhou 215005, Peoples R China
[2] Suzhou Transportat Big Data Innovat & Applicat Lab, Suzhou 215005, Peoples R China
[3] Shanghai Jiao Tong Univ, Key Lab Informat Proc & Intelligent Control, Shanghai 350121, Peoples R China
关键词
multispectral; pedestrian detection; feature fusion; computer vision; prior-attention; NEURAL-NETWORKS;
D O I
10.3390/electronics13091770
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting pedestrians in varying illumination conditions poses a significant challenge, necessitating the development of innovative solutions. In response to this, we introduce Prior-AttentionNet, a pedestrian detection model featuring a Prior-Attention mechanism. This model leverages the stark contrast between thermal objects and their backgrounds in far-infrared (FIR) images by employing saliency attention derived from FIR images via UNet. However, extracting salient regions of diverse scales from FIR images poses a challenge for saliency attention. To address this, we integrate Simple Linear Iterative Clustering (SLIC) superpixel segmentation, embedding the segmentation feature map as prior knowledge into UNet's decoding stage for comprehensive end-to-end training and detection. This integration enhances the extraction of focused attention regions, with the synergy of segmentation prior and saliency attention forming the core of Prior-AttentionNet. Moreover, to enrich pedestrian details and contour visibility in low-light conditions, we implement multispectral image fusion. Experimental evaluations were conducted on the KAIST and OTCBVS datasets. Applying Prior-Attention mode to FIR-RGB images significantly improves the delineation and focus on multi-scale pedestrians. Prior-AttentionNet's general detector demonstrates the capability of detecting pedestrians with minimal computational resources. The ablation studies indicate that the FIR-RGB+ Prior-Attention mode markedly enhances detection robustness over other modes. When compared to conventional multispectral pedestrian detection models, Prior-AttentionNet consistently surpasses them by achieving higher mean average precision and lower miss rates in diverse scenarios, during both day and night.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Attention Based Multi-Layer Fusion of Multispectral Images for Pedestrian Detection
    Zhang, Yongtao
    Yin, Zhishuai
    Nie, Linzhen
    Huang, Song
    IEEE ACCESS, 2020, 8 : 165071 - 165084
  • [2] Attention Fusion for One-Stage Multispectral Pedestrian Detection
    Cao, Zhiwei
    Yang, Huihua
    Zhao, Juan
    Guo, Shuhong
    Li, Lingqiao
    SENSORS, 2021, 21 (12)
  • [3] Deep saliency detection-based pedestrian detection with multispectral multi-scale features fusion network
    Ma, Li
    Wang, Jinjin
    Dai, Xinguan
    Gao, Hangbiao
    FRONTIERS IN PHYSICS, 2024, 11
  • [4] Attention-based Cross-Modality Multiscale Fusion for Multispectral Pedestrian Detection
    Hui, Zhou
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 1244 - 1253
  • [5] Image Saliency Detection Based on Background Prior and Multi-feature Fusion
    Jia, Chao
    Jia, Changrun
    Kong, Fanshu
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (IEEE ICBDA 2020), 2020, : 276 - 281
  • [6] Multiscale Image Fusion for Pansharpening of Multispectral Images using Saliency Detection
    Shruti
    Budhiraja, Sumit
    2016 NINTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2016, : 361 - 366
  • [7] A Background Prior based Saliency Detection for JPEG Image
    Sun, Xiaolong
    Liu, Zhanghui
    Guo, Wenzhong
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 1, 2014, : 166 - 169
  • [8] SALIENCY DETECTION BASED ON EXTENDED BOUNDARY PRIOR WITH FOCI OF ATTENTION
    Li, Yijun
    Fu, Keren
    Zhou, Lei
    Qiao, Yu
    Yang, Jie
    Li, Bai
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] FDENet: Fusion Depth Semantics and Edge-Attention Information for Multispectral Pedestrian Detection
    Liu, Xiaowei
    Xu, Xinying
    Xie, Jun
    Li, Pengyue
    Wei, Jiamin
    Sang, Yiyu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (06) : 5441 - 5448
  • [10] HYPERSPECTRAL AND MULTISPECTRAL IMAGE FUSION BASED ON DEEP ATTENTION NETWORK
    Yang, Qing
    Xu, Yang
    Wu, Zebin
    Wei, Zhihui
    2019 10TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING - EVOLUTION IN REMOTE SENSING (WHISPERS), 2019,