Multiscale Feature Fusion Approach for Dual-Modal Object Detection

被引:0
|
作者
Zhang, Rui [1 ]
Li, Yunchen [1 ]
Wang, Jiabao [1 ]
Chen, Yao [1 ]
Wang, Ziqi [1 ]
Li, Yang [1 ]
机构
[1] College of Command and Control Engineering, Army Engineering University of PLA, Nanjing,210007, China
关键词
Benchmarking - Feature extraction - Image enhancement - Image fusion - Image texture - Large datasets - Modal analysis - Object detection - Object recognition;
D O I
10.3778/j.issn.1002-8331.2305-0412
中图分类号
学科分类号
摘要
Object detection based on visible images is difficult to adapt to complex lighting conditions such as low light, no light, strong light, etc., while object detection based on infrared images is greatly affected by background noise. Infrared objects lack color information and have weak texture features, which pose a greater challenge. To address these problems, a dual-modal object detection approach that can effectively fuse the features of visible and infrared dual-modal images is proposed. A multiscale feature attention module is proposed, which can extract the multiscale features of the input IR and RGB images separately. Meanwhile, channel attention and spatial pixel attention is introduced to focus the multiscale feature information of dual-modal images from both channel and pixel dimensions. Finally, a dual-modal feature fusion module is proposed to adaptively fuse the feature information of dual-modal images. On the large-scale dual-modal image dataset DroneVehicle, compared with the benchmark algorithm YOLOv5s using visible or infrared single-modal image detection, the proposed algorithm improves the detection accuracy by 13.42 and 2.27 percentage points, and the detection speed reaches 164 frame/s, with ultra-real-time end-to-end detection capability. The proposed algorithm effectively improves the robustness and accuracy of object detection in complex scenes, which has good application prospects. © 2024 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:233 / 242
相关论文
共 50 条
  • [31] Dual-Modal Drowsiness Detection to Enhance Driver Safety
    Chew, Yi Xuan
    Razak, Siti Fatimah Abdul
    Yogarayan, Sumendra
    Ismail, Sharifah Noor Masidayu Sayed
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (03): : 4397 - 4417
  • [32] Real-Time Runway Detection Using Dual-Modal Fusion of Visible and Infrared Data
    Yang, Lichun
    Wu, Jianghao
    Li, Hongguang
    Liu, Chunlei
    Wei, Shize
    REMOTE SENSING, 2025, 17 (04)
  • [33] Dual-Modal Photoelectrochemical and Visualized Detection of Copper Ions
    Zhang, Nan
    Dai, Danqin
    Hu, Peiwen
    Guo, Shuangming
    Yang, Hong
    ACS OMEGA, 2022, 7 (06): : 5415 - 5420
  • [34] Dual-Modal Information Bottleneck Network for Seizure Detection
    Wang, Jiale
    Ge, Xinting
    Shi, Yunfeng
    Sun, Mengxue
    Gong, Qingtao
    Wang, Haipeng
    Huang, Wenhui
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2023, 33 (01)
  • [35] Dual Attention Feature Fusion for Visible-Infrared Object Detection
    Hu, Yuxuan
    Shi, Limin
    Yao, Libo
    Weng, Lubin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 53 - 65
  • [36] Dual-Branch Feature Fusion Network for Salient Object Detection
    Song, Zhehan
    Xu, Zhihai
    Wang, Jing
    Feng, Huajun
    Li, Qi
    PHOTONICS, 2022, 9 (01)
  • [37] Automatic Crop Pest Detection Oriented Multiscale Feature Fusion Approach
    Dong, Shifeng
    Du, Jianming
    Jiao, Lin
    Wang, Fenmei
    Liu, Kang
    Teng, Yue
    Wang, Rujing
    INSECTS, 2022, 13 (06)
  • [38] EMFF-Net: effective multiscale feature fusion network for traffic object detection
    Zhong Qu
    Shize Fan
    Xuehui Yin
    Signal, Image and Video Processing, 2025, 19 (6)
  • [39] Adaptively Attentional Feature Fusion Oriented to Multiscale Object Detection in Remote Sensing Images
    Zhao, Wenqing
    Kang, Yijin
    Chen, Hao
    Zhao, Zhenhuan
    Zhao, Zhenbing
    Zhai, Yongjie
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [40] Deformable Convolution-Guided Multiscale Feature Learning and Fusion for UAV Object Detection
    Shi, Ya
    Wang, Chenyi
    Xu, Shengjun
    Yuan, Ming-Dong
    Liu, Feixiang
    Zhang, Lele
    IEEE Geoscience and Remote Sensing Letters, 2024, 21 : 1 - 5