Multiscale Feature Fusion Approach for Dual-Modal Object Detection

被引:0
|
作者
Zhang, Rui [1 ]
Li, Yunchen [1 ]
Wang, Jiabao [1 ]
Chen, Yao [1 ]
Wang, Ziqi [1 ]
Li, Yang [1 ]
机构
[1] College of Command and Control Engineering, Army Engineering University of PLA, Nanjing,210007, China
关键词
Benchmarking - Feature extraction - Image enhancement - Image fusion - Image texture - Large datasets - Modal analysis - Object detection - Object recognition;
D O I
10.3778/j.issn.1002-8331.2305-0412
中图分类号
学科分类号
摘要
Object detection based on visible images is difficult to adapt to complex lighting conditions such as low light, no light, strong light, etc., while object detection based on infrared images is greatly affected by background noise. Infrared objects lack color information and have weak texture features, which pose a greater challenge. To address these problems, a dual-modal object detection approach that can effectively fuse the features of visible and infrared dual-modal images is proposed. A multiscale feature attention module is proposed, which can extract the multiscale features of the input IR and RGB images separately. Meanwhile, channel attention and spatial pixel attention is introduced to focus the multiscale feature information of dual-modal images from both channel and pixel dimensions. Finally, a dual-modal feature fusion module is proposed to adaptively fuse the feature information of dual-modal images. On the large-scale dual-modal image dataset DroneVehicle, compared with the benchmark algorithm YOLOv5s using visible or infrared single-modal image detection, the proposed algorithm improves the detection accuracy by 13.42 and 2.27 percentage points, and the detection speed reaches 164 frame/s, with ultra-real-time end-to-end detection capability. The proposed algorithm effectively improves the robustness and accuracy of object detection in complex scenes, which has good application prospects. © 2024 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:233 / 242
相关论文
共 50 条
  • [1] Object Detection Algorithm Based on Dual-modal Fusion Network
    Sun Ying
    Hou Zhiqiang
    Yang Chen
    Ma Sugang
    Fan Jiulun
    ACTA PHOTONICA SINICA, 2023, 52 (01)
  • [2] Airfield concrete pavement joint detection network based on dual-modal feature fusion
    Yuan, Bo
    Sun, Zhaoyun
    Pei, Lili
    Li, Wei
    Hu, Yuanjiao
    AL-Soswa, Mohammed
    AUTOMATION IN CONSTRUCTION, 2023, 151
  • [3] RGBT dual-modal Siamese tracking network with feature fusion
    Shen Y.
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2021, 50 (03):
  • [4] Object Detection Algorithm with Dual-Modal Rectification Fusion Based on Self-Guided Attention
    Zhang, Jinglei
    Gong, Wenhao
    Jia, Xin
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (09): : 793 - 805
  • [5] An object detection algorithm based on infrared-visible dual modal feature fusion
    Hou, Zhiqiang
    Yang, Chen
    Sun, Ying
    Ma, Sugang
    Yang, Xiaobao
    Fan, Jiulun
    INFRARED PHYSICS & TECHNOLOGY, 2024, 137
  • [6] Object Detection Algorithm Based on CNN-Transformer Dual Modal Feature Fusion
    Yang Chen
    Hou Zhiqiang
    Li Xinyue
    Ma Sugang
    Yang Xiaobao
    ACTA PHOTONICA SINICA, 2024, 53 (03)
  • [7] Electromagnetic Modulation Signal Classification Using Dual-Modal Feature Fusion CNN
    Bai, Jiansheng
    Yao, Jinjie
    Qi, Juncheng
    Wang, Liming
    ENTROPY, 2022, 24 (05)
  • [8] Dual-modal edible oil impurity dataset for weak feature detection
    Wang, Huiyu
    Chen, Qianghua
    Zhao, Jianding
    Xu, Liwen
    Li, Ming
    Zhao, Ying
    Zhao, Qinpei
    Lu, Qin
    SCIENTIFIC DATA, 2024, 11 (01)
  • [9] Multiscale Feature Fusion and Anchor Adaptive Object Detection Algorithm
    Zhang Runmei
    Bi Lijun
    Wang Fangbin
    Yuan Bin
    Luo Gu'an
    Jiang Huaizhen
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (12)
  • [10] IMAGE FUSION NETWORK FOR DUAL-MODAL RESTORATION
    Zhang, Ying
    Ren, Xuhua
    Clifford, Bryan Alexander
    Wang, Qian
    Zhang, Xiaoqun
    INVERSE PROBLEMS AND IMAGING, 2021, 15 (06) : 1409 - 1419