Multiscale Feature Fusion Approach for Dual-Modal Object Detection

被引:0
|
作者
Zhang, Rui [1 ]
Li, Yunchen [1 ]
Wang, Jiabao [1 ]
Chen, Yao [1 ]
Wang, Ziqi [1 ]
Li, Yang [1 ]
机构
[1] College of Command and Control Engineering, Army Engineering University of PLA, Nanjing,210007, China
关键词
Benchmarking - Feature extraction - Image enhancement - Image fusion - Image texture - Large datasets - Modal analysis - Object detection - Object recognition;
D O I
10.3778/j.issn.1002-8331.2305-0412
中图分类号
学科分类号
摘要
Object detection based on visible images is difficult to adapt to complex lighting conditions such as low light, no light, strong light, etc., while object detection based on infrared images is greatly affected by background noise. Infrared objects lack color information and have weak texture features, which pose a greater challenge. To address these problems, a dual-modal object detection approach that can effectively fuse the features of visible and infrared dual-modal images is proposed. A multiscale feature attention module is proposed, which can extract the multiscale features of the input IR and RGB images separately. Meanwhile, channel attention and spatial pixel attention is introduced to focus the multiscale feature information of dual-modal images from both channel and pixel dimensions. Finally, a dual-modal feature fusion module is proposed to adaptively fuse the feature information of dual-modal images. On the large-scale dual-modal image dataset DroneVehicle, compared with the benchmark algorithm YOLOv5s using visible or infrared single-modal image detection, the proposed algorithm improves the detection accuracy by 13.42 and 2.27 percentage points, and the detection speed reaches 164 frame/s, with ultra-real-time end-to-end detection capability. The proposed algorithm effectively improves the robustness and accuracy of object detection in complex scenes, which has good application prospects. © 2024 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:233 / 242
相关论文
共 50 条
  • [41] Multiscale object detection in remote sensing image by combining data fusion and feature selection
    Qin, Dengda
    Wan, Li
    He, Peien
    Zhang, Yi
    Guo, Ya
    Chen, Jie
    National Remote Sensing Bulletin, 2022, 26 (08) : 1662 - 1673
  • [42] Deformable Convolution-Guided Multiscale Feature Learning and Fusion for UAV Object Detection
    Shi, Ya
    Wang, Chenyi
    Xu, Shengjun
    Yuan, Ming-Dong
    Liu, Feixiang
    Zhang, Lele
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [43] MFFSODNet: Multiscale Feature Fusion Small Object Detection Network for UAV Aerial Images
    Jiang, Lingjie
    Yuan, Baoxi
    Du, Jiawei
    Chen, Boyu
    Xie, Hanfei
    Tian, Juan
    Yuan, Ziqi
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 14
  • [44] Dual-modal nanoplatform integrated with smartphone for hierarchical diabetic detection
    Huang, Lin
    Zhou, Yan
    Zhu, Yuexing
    Su, Haiyang
    Yang, Shouzhi
    Feng, Lei
    Zhao, Liang
    Liu, Shanrong
    Qian, Kun
    BIOSENSORS & BIOELECTRONICS, 2022, 210
  • [45] Enhanced Multi-Scale Feature Cross-Fusion Network for Impedance-Optical Dual-Modal Imaging
    Liu, Zhe
    Zhao, Renjie
    Anderson, Graham
    Bagnaninchi, Pierre-Olivier
    Yang, Yunjie
    IEEE SENSORS JOURNAL, 2023, 23 (05) : 4455 - 4465
  • [46] Dual-Modal Approach for Ship Detection: Fusing Synthetic Aperture Radar and Optical Satellite Imagery
    Ahmed, Mahmoud
    El-Sheimy, Naser
    Leung, Henry
    SENSORS, 2025, 25 (02)
  • [47] Sequential Feature Fusion for Object Detection
    Wang, Qiang
    Han, Yahong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 689 - 699
  • [48] Dual-modal Physiological Feature Fusion-based Sleep Recognition Using CFS and RF Algorithm附视频
    BingTao Zhang
    XiaoPeng Wang
    Yu Shen
    Tao Lei
    International Journal of Automation and Computing, 2019, (03) : 286 - 296
  • [49] Language-Guided Dual-Modal Local Correspondence for Single Object Tracking
    Yu, Jun
    Cai, Zhongpeng
    Li, Yihao
    Wang, Lei
    Gao, Fang
    Yu, Ye
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 10637 - 10650
  • [50] A Dual-Modal Fusion Network Using Optical Coherence Tomography and Fundus Images in Detection of Glaucomatous Optic Neuropathy
    Xu, Yongli
    Sun, Run
    Hu, Man
    Zeng, Hui
    CURRENT EYE RESEARCH, 2024, 49 (12) : 1253 - 1259