Cross-modality complementary information fusion for multispectral pedestrian detection

被引:12
|
作者
Yan, Chaoqi [1 ]
Zhang, Hong [1 ]
Li, Xuliang [1 ]
Yang, Yifan [2 ]
Yuan, Ding [1 ]
机构
[1] Beihang Univ, Image Proc Ctr, 37 Xueyuan Rd, Beijing 100191, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, 37 Xueyuan Rd, Beijing 100191, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 14期
基金
中国国家自然科学基金;
关键词
Multispectral pedestrian detection; Cross-modality; Information fusion; Illumination-aware; Feature alignment; DEEP NEURAL-NETWORKS;
D O I
10.1007/s00521-023-08239-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multispectral pedestrian detection has received increasing attention in recent years as color and thermal modalities can provide complementary visual information, especially under insufficient illumination conditions. However, there is still a persistent crucial problem that how to design the cross-modality fusion mechanism to fully exploit the complementary characteristics between different modalities. In this paper, we propose a novel cross-modality complementary information fusion network (denoted as CCIFNet) to comprehensively capture the long-range interactions with precise positional information and meanwhile preserve the inter-spatial relationship between different modalities in the feature extraction stage. Further, we design an adaptive illumination-aware weight generation module to adaptively weight the final detection confidence of color and thermal modalities by taking various illumination conditions into consideration. Specifically, we comprehensively compare three different fusion strategies about this module to synthetically explore the best way for generating the final illumination-aware fusion weights. Finally, we present a simple but effective feature alignment module to alleviate the position shift problem caused by the weakly aligned color-thermal image pairs. Extensive experiments and ablation studies on KAIST, CVC-14, FLIR and LLVIP multispectral object detection datasets show that the proposed CCIFNet can achieve state-of-the-art performance under different illumination evaluation settings, while keeping a competitive speed-accuracy trade-off for real-time applications.
引用
收藏
页码:10361 / 10386
页数:26
相关论文
共 50 条
  • [1] Cross-modality complementary information fusion for multispectral pedestrian detection
    Chaoqi Yan
    Hong Zhang
    Xuliang Li
    Yifan Yang
    Ding Yuan
    Neural Computing and Applications, 2023, 35 : 10361 - 10386
  • [2] Illumination-Aware Cross-Modality Differential Fusion Multispectral Pedestrian Detection
    Wang, Chishe
    Qian, Jinjin
    Wang, Jie
    Chen, Yuting
    ELECTRONICS, 2023, 12 (17)
  • [3] Attention-based Cross-Modality Multiscale Fusion for Multispectral Pedestrian Detection
    Hui, Zhou
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 1244 - 1253
  • [4] Cross-modality interactive attention network for multispectral pedestrian detection
    Zhang, Lu
    Liu, Zhiyong
    Zhang, Shifeng
    Yang, Xu
    Qiao, Hong
    Huang, Kaizhu
    Hussain, Amir
    INFORMATION FUSION, 2019, 50 : 20 - 29
  • [5] Cross-modality feature fusion for night pedestrian detection
    Feng, Yong
    Luo, Enbo
    Lu, Hai
    Zhai, SuWei
    FRONTIERS IN PHYSICS, 2024, 12
  • [6] MCANet: Multiscale Cross-Modality Attention Network for Multispectral Pedestrian Detection
    Wang, Xiaotian
    Zhao, Letian
    Wu, Wei
    Jin, Xi
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 41 - 53
  • [7] Attention-based Cross-modality Interaction for Multispectral Pedestrian Detection
    Liu, Tianshan
    Zhao, Rui
    Lam, Kin-Man
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2021, 2021, 11766
  • [8] Attention-Based Cross-Modality Feature Complementation for Multispectral Pedestrian Detection
    Jiang, Qunyan
    Dai, Juying
    Rui, Ting
    Shao, Faming
    Wang, Jinkang
    Lu, Guanlin
    IEEE ACCESS, 2022, 10 : 53797 - 53809
  • [9] MCAFNet: Multiscale cross-modality adaptive fusion network for multispectral object detection
    Zheng, Shangpo
    Liu, Junfeng
    Jun, Zeng
    DIGITAL SIGNAL PROCESSING, 2025, 159
  • [10] Cyclic Cross-Modality Interaction for Hyperspectral and Multispectral Image Fusion
    Chen, Shi
    Zhang, Lefei
    Zhang, Liangpei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 741 - 753