Cross-modality complementary information fusion for multispectral pedestrian detection

被引:12
|
作者
Yan, Chaoqi [1 ]
Zhang, Hong [1 ]
Li, Xuliang [1 ]
Yang, Yifan [2 ]
Yuan, Ding [1 ]
机构
[1] Beihang Univ, Image Proc Ctr, 37 Xueyuan Rd, Beijing 100191, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, 37 Xueyuan Rd, Beijing 100191, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 14期
基金
中国国家自然科学基金;
关键词
Multispectral pedestrian detection; Cross-modality; Information fusion; Illumination-aware; Feature alignment; DEEP NEURAL-NETWORKS;
D O I
10.1007/s00521-023-08239-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multispectral pedestrian detection has received increasing attention in recent years as color and thermal modalities can provide complementary visual information, especially under insufficient illumination conditions. However, there is still a persistent crucial problem that how to design the cross-modality fusion mechanism to fully exploit the complementary characteristics between different modalities. In this paper, we propose a novel cross-modality complementary information fusion network (denoted as CCIFNet) to comprehensively capture the long-range interactions with precise positional information and meanwhile preserve the inter-spatial relationship between different modalities in the feature extraction stage. Further, we design an adaptive illumination-aware weight generation module to adaptively weight the final detection confidence of color and thermal modalities by taking various illumination conditions into consideration. Specifically, we comprehensively compare three different fusion strategies about this module to synthetically explore the best way for generating the final illumination-aware fusion weights. Finally, we present a simple but effective feature alignment module to alleviate the position shift problem caused by the weakly aligned color-thermal image pairs. Extensive experiments and ablation studies on KAIST, CVC-14, FLIR and LLVIP multispectral object detection datasets show that the proposed CCIFNet can achieve state-of-the-art performance under different illumination evaluation settings, while keeping a competitive speed-accuracy trade-off for real-time applications.
引用
收藏
页码:10361 / 10386
页数:26
相关论文
共 50 条
  • [41] SiamSMN: Siamese Cross-Modality Fusion Network for Object Tracking
    Han, Shuo
    Gao, Lisha
    Wu, Yue
    Wei, Tian
    Wang, Manyu
    Cheng, Xu
    INFORMATION, 2024, 15 (07)
  • [42] Cross-modality image feature fusion diagnosis in breast cancer
    Jiang, Mingkuan
    Han, Lu
    Sun, Hang
    Li, Jing
    Bao, Nan
    Li, Hong
    Zhou, Shi
    Yu, Tao
    PHYSICS IN MEDICINE AND BIOLOGY, 2021, 66 (10):
  • [43] Cross-Modality 3D Object Detection
    Zhu, Ming
    Ma, Chao
    Ji, Pan
    Yang, Xiaokang
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3771 - 3780
  • [44] Multi-Granularity and Cross-Modality Pedestrian Re-Identification Algorithm
    Xiong Wei
    Yue Ling
    Zhou Lei
    Zhang Kai
    Li Lirong
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (22)
  • [45] Deep adaptive fusion with cross-modality feature transition and modality quaternion learning for medical image fusion
    Srivastava, Somya
    Bhatia, Shaveta
    Agrawal, Arun Prakash
    Jayswal, Anant Kumar
    Godara, Jyoti
    Dubey, Gaurav
    EVOLVING SYSTEMS, 2025, 16 (01)
  • [46] Cross-Modality Target Detection Using Infrared and Visible Image Fusion for Robust Objection recognition
    Yu, Hang
    Gao, Jichen
    Zhou, Suiping
    Li, Chenyang
    Shi, Jiaqi
    Guo, Feng
    COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123
  • [47] Stabilizing Multispectral Pedestrian Detection With Evidential Hybrid Fusion
    Li, Qing
    Zhang, Changqing
    Hu, Qinghua
    Zhu, Pengfei
    Fu, Huazhu
    Chen, Lei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 3017 - 3029
  • [48] Guided Attentive Feature Fusion for Multispectral Pedestrian Detection
    Zhang, Heng
    Fromont, Elisa
    Lefevre, Sebastien
    Avignon, Bruno
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 72 - 80
  • [49] Exploiting fusion architectures for multispectral pedestrian detection and segmentation
    Guan, Dayan
    Cao, Yanpeng
    Yang, Jiangxin
    Cao, Yanlong
    Tisse, Christel-Loic
    APPLIED OPTICS, 2018, 57 (18) : D108 - D116
  • [50] A multispectral feature fusion network for robust pedestrian detection
    Song, Xiaoru
    Gao, Song
    Chen, Chaobo
    ALEXANDRIA ENGINEERING JOURNAL, 2021, 60 (01) : 73 - 85