Multispectral Pedestrian Detection Based on Prior-Saliency Attention and Image Fusion

被引:0
|
作者
Guo, Jiaren [1 ,2 ]
Huang, Zihao [1 ,2 ]
Tao, Yanyun [1 ,2 ,3 ]
机构
[1] Soochow Univ, Sch Rail Transportat, Suzhou 215005, Peoples R China
[2] Suzhou Transportat Big Data Innovat & Applicat Lab, Suzhou 215005, Peoples R China
[3] Shanghai Jiao Tong Univ, Key Lab Informat Proc & Intelligent Control, Shanghai 350121, Peoples R China
关键词
multispectral; pedestrian detection; feature fusion; computer vision; prior-attention; NEURAL-NETWORKS;
D O I
10.3390/electronics13091770
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting pedestrians in varying illumination conditions poses a significant challenge, necessitating the development of innovative solutions. In response to this, we introduce Prior-AttentionNet, a pedestrian detection model featuring a Prior-Attention mechanism. This model leverages the stark contrast between thermal objects and their backgrounds in far-infrared (FIR) images by employing saliency attention derived from FIR images via UNet. However, extracting salient regions of diverse scales from FIR images poses a challenge for saliency attention. To address this, we integrate Simple Linear Iterative Clustering (SLIC) superpixel segmentation, embedding the segmentation feature map as prior knowledge into UNet's decoding stage for comprehensive end-to-end training and detection. This integration enhances the extraction of focused attention regions, with the synergy of segmentation prior and saliency attention forming the core of Prior-AttentionNet. Moreover, to enrich pedestrian details and contour visibility in low-light conditions, we implement multispectral image fusion. Experimental evaluations were conducted on the KAIST and OTCBVS datasets. Applying Prior-Attention mode to FIR-RGB images significantly improves the delineation and focus on multi-scale pedestrians. Prior-AttentionNet's general detector demonstrates the capability of detecting pedestrians with minimal computational resources. The ablation studies indicate that the FIR-RGB+ Prior-Attention mode markedly enhances detection robustness over other modes. When compared to conventional multispectral pedestrian detection models, Prior-AttentionNet consistently surpasses them by achieving higher mean average precision and lower miss rates in diverse scenarios, during both day and night.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] SEDMI: Saliency based edge detection in multispectral images
    Dinh, Cuong V.
    Leitner, Raimund
    Paclik, Pavel
    Loog, Marco
    Duin, Robert P. W.
    IMAGE AND VISION COMPUTING, 2011, 29 (08) : 546 - 556
  • [42] Prior Fusion and Feature Transformation-Based Principal Component Analysis for Saliency Detection
    Shan, Dongjing
    Zhang, Chao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (08)
  • [43] Deep Feature Fusion by Competitive Attention for Pedestrian Detection
    Chen, Zhichang
    Zhang, Li
    Khattak, Abdul Mateen
    Gao, Wanlin
    Wang, Minjuan
    IEEE ACCESS, 2019, 7 : 21981 - 21989
  • [44] SAF: Semantic Attention Fusion Mechanism for Pedestrian Detection
    Yu, Ruizhe
    Wang, Shunzhou
    Lu, Yao
    Di, Huijun
    Zhang, Lin
    Lu, Lihua
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11671 : 523 - 533
  • [45] Visual Attention Modeling in Compressed Domain:From Image Saliency Detection to Video Saliency Detection
    FANG Yuming
    ZHANG Xiaoqiang
    ZTECommunications, 2019, 17 (01) : 31 - 37
  • [46] PROPAGATION BASED SALIENCY DETECTION FOR INFRARED PEDESTRIAN IMAGES
    Zheng, Yu
    Zhou, Fugen
    Li, Lu
    Bai, Xiangzhi
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1527 - 1531
  • [47] Cross-modality complementary information fusion for multispectral pedestrian detection
    Chaoqi Yan
    Hong Zhang
    Xuliang Li
    Yifan Yang
    Ding Yuan
    Neural Computing and Applications, 2023, 35 : 10361 - 10386
  • [48] Cross-modality complementary information fusion for multispectral pedestrian detection
    Yan, Chaoqi
    Zhang, Hong
    Li, Xuliang
    Yang, Yifan
    Yuan, Ding
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (14): : 10361 - 10386
  • [49] Feature Map Swap: Multispectral Data Fusion Method for Pedestrian Detection
    Ryu, Junhwan
    Kim, Sungho
    2019 19TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2019), 2019, : 319 - 323
  • [50] Pedestrian Detection Based on Visual Saliency and Supervised Learning
    Zhang, Wanhan
    Ren, Jie
    Gu, Meihua
    ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, BICS 2018, 2018, 10989 : 506 - 513