Hierarchical Attention-Based Sensor Fusion Strategy for Depth Estimation in Diverse Weather

被引:1
|
作者
Xiong, Mengchen [1 ]
Xu, Xiao [1 ]
Yang, Dong [1 ]
Seguel, Fabian [1 ]
Steinbach, Eckehard [1 ]
机构
[1] Tech Univ Munich, Sch Computat Informat & Technol, Dept Comp Engn, Chair Media Technol,Munich Inst Robot & Machine In, D-80333 Munich, Germany
关键词
Adaptive sensor fusion; depth estimation; diverse weather; attention mechanism; LASER; LIDAR;
D O I
10.1142/S1793351X23500022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a hierarchical attention-based sensor fusion strategy for depth estimation under various weather conditions. Multiple-sensor fusion is proven as a promising solution for predicting accurate depth maps in diverse weather conditions, especially for extreme weather conditions. However, most of the current studies simply fuse the information from different sensors without jointly considering the difference in their performance at the sensor level and feature level. To fill this gap, our hierarchical attention-based fusion strategy uses two attention mask-generation modules that weigh sensor data from branches (i.e. different sensors) and features. With the cooperation of these two masks, our system is able to determine the adaptive contribution of each sensor as well as the individual contribution of each feature in the sensor regarding their performance in different weather. We compare the proposed methods with the baseline, i.e. the late fusion Sparse-to-Dense model, and two extended models individually with the branch-wise-only and feature-wise-only masks. The results show a robust and superior performance of our methods even in clear environments where the baseline already performs well enough. Moreover, we investigate the performance of RGB camera, radar, and LiDAR in foggy environments comprehensively by visualizing the generated mask. Our results show a significantly increased importance of radar sensors in extreme weather conditions, e.g. dense fog.
引用
收藏
页码:455 / 475
页数:21
相关论文
共 50 条
  • [31] Residual Attention-based Fusion for Video Classification
    Pouyanfar, Samira
    Wang, Tianyi
    Chen, Shu-Ching
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 478 - 480
  • [32] Attention-Based Multimodal Fusion for Video Description
    Hori, Chiori
    Hori, Takaaki
    Lee, Teng-Yok
    Zhang, Ziming
    Harsham, Bret
    Hershey, John R.
    Marks, Tim K.
    Sumi, Kazuhiko
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4203 - 4212
  • [33] Illumination Insensitive Monocular Depth Estimation Based on Scene Object Attention and Depth Map Fusion
    Wen, Jing
    Ma, Haojiang
    Yang, Jie
    Zhang, Songsong
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 358 - 370
  • [34] Attention-based hierarchical denoised deep clustering network
    Dong, Yongfeng
    Wang, Ziqiu
    Du, Jiapeng
    Fang, Weidong
    Li, Linhao
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (01): : 441 - 459
  • [35] Attention-based Image Compression in Sensor Assembly
    Meier, Sven
    Erkan, Acelya
    Thielen, Nils
    Klarmann, Steffen
    Franke, Jorg
    2022 IEEE 28TH INTERNATIONAL SYMPOSIUM FOR DESIGN AND TECHNOLOGY IN ELECTRONIC PACKAGING (SIITME), 2022, : 136 - 141
  • [36] Attention-based hierarchical denoised deep clustering network
    Yongfeng Dong
    Ziqiu Wang
    Jiapeng Du
    Weidong Fang
    Linhao Li
    World Wide Web, 2023, 26 : 441 - 459
  • [37] H-Net: Unsupervised Attention-based Stereo Depth Estimation Leveraging Epipolar Geometry
    Huang, Baoru
    Zheng, Jian-Qing
    Giannarou, Stamatia
    Elson, Daniel S.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4459 - 4466
  • [38] A hierarchical contextual attention-based network for sequential recommendation
    Cui, Qiang
    Wu, Shu
    Huang, Yan
    Wang, Liang
    NEUROCOMPUTING, 2019, 358 : 141 - 149
  • [39] Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation
    Yan, Jiaxing
    Zhao, Hong
    Bu, Penghui
    Jin, YuSheng
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 464 - 473
  • [40] AbHE: All Attention-Based Homography Estimation
    Huo, Mingxiao
    Zhang, Zhihao
    Ren, Xinyang
    Yang, Xianqiang
    Ye, Chao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 11