Multiscale Feature Fusion Approach for Dual-Modal Object Detection

被引:0
|
作者
Zhang, Rui [1 ]
Li, Yunchen [1 ]
Wang, Jiabao [1 ]
Chen, Yao [1 ]
Wang, Ziqi [1 ]
Li, Yang [1 ]
机构
[1] College of Command and Control Engineering, Army Engineering University of PLA, Nanjing,210007, China
关键词
Benchmarking - Feature extraction - Image enhancement - Image fusion - Image texture - Large datasets - Modal analysis - Object detection - Object recognition;
D O I
10.3778/j.issn.1002-8331.2305-0412
中图分类号
学科分类号
摘要
Object detection based on visible images is difficult to adapt to complex lighting conditions such as low light, no light, strong light, etc., while object detection based on infrared images is greatly affected by background noise. Infrared objects lack color information and have weak texture features, which pose a greater challenge. To address these problems, a dual-modal object detection approach that can effectively fuse the features of visible and infrared dual-modal images is proposed. A multiscale feature attention module is proposed, which can extract the multiscale features of the input IR and RGB images separately. Meanwhile, channel attention and spatial pixel attention is introduced to focus the multiscale feature information of dual-modal images from both channel and pixel dimensions. Finally, a dual-modal feature fusion module is proposed to adaptively fuse the feature information of dual-modal images. On the large-scale dual-modal image dataset DroneVehicle, compared with the benchmark algorithm YOLOv5s using visible or infrared single-modal image detection, the proposed algorithm improves the detection accuracy by 13.42 and 2.27 percentage points, and the detection speed reaches 164 frame/s, with ultra-real-time end-to-end detection capability. The proposed algorithm effectively improves the robustness and accuracy of object detection in complex scenes, which has good application prospects. © 2024 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:233 / 242
相关论文
共 50 条
  • [21] Multispectral Object Detection Based on Multilevel Feature Fusion and Dual Feature Modulation
    Sun, Jin
    Yin, Mingfeng
    Wang, Zhiwei
    Xie, Tao
    Bei, Shaoyi
    ELECTRONICS, 2024, 13 (02)
  • [22] Dual-modal Physiological Feature Fusion-based Sleep Recognition Using CFS and RF Algorithm
    Zhang, Bing-Tao
    Wang, Xiao-Peng
    Shen, Yu
    Lei, Tao
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2019, 16 (03) : 286 - 296
  • [23] CoLA: Conditional Dropout and Language-Driven Robust Dual-Modal Salient Object Detection
    Hao, Shuang
    Zhong, Chunlin
    Tang, He
    COMPUTER VISION - ECCV 2024, PT XV, 2025, 15073 : 354 - 371
  • [24] Dual-modal Physiological Feature Fusion-based Sleep Recognition Using CFS and RF Algorithm
    Bing-Tao Zhang
    Xiao-Peng Wang
    Yu Shen
    Tao Lei
    International Journal of Automation and Computing, 2019, 16 : 286 - 296
  • [25] Adaptive multiscale feature for object detection
    Yu, Xiaoyong
    Wu, Siyuan
    Lu, Xiaoqiang
    Gao, Guilong
    NEUROCOMPUTING, 2021, 449 : 146 - 158
  • [26] A Non-Local Attention Feature Fusion Network for Multiscale Object Detection
    Wu, Xuke
    Xiong, Gang
    Tian, Bin
    Song, Bing
    Lu, Bo
    Liu, Sheng
    Zhu, Fenghua
    IEEE JOURNAL OF RADIO FREQUENCY IDENTIFICATION, 2022, 6 : 733 - 738
  • [27] Multiscale Feature Adaptive Fusion for Object Detection in Optical Remote Sensing Images
    Lv, Hao
    Qian, Weixing
    Chen, Tianxiao
    Yang, Han
    Zhou, Xuecheng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [28] Multiscale Feature Fusion for Salient Object Detection of Strip Steel Surface Defects
    Zhang, Li
    Li, Xirui
    Sun, Yange
    Feng, Yan
    Guo, Huaping
    IEEE ACCESS, 2025, 13 : 42689 - 42702
  • [29] Object Detection For Remote Sensing Image Based on Multiscale Feature Fusion Network
    Tian Tingting
    Yang Jun
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (16)
  • [30] MULTI-MODAL FEATURE FUSION NETWORK FOR GHOST IMAGING OBJECT DETECTION
    Hu, Nan
    Ma, Huimin
    Le, Chao
    Shao, Xuehui
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 351 - 355