SABV-Depth: A biologically inspired deep learning network for monocular depth estimation

被引:10
|
作者
Wang, Junfan [1 ,2 ]
Chen, Yi [1 ,2 ]
Dong, Zhekang [1 ,2 ,3 ]
Gao, Mingyu [1 ,2 ]
Lin, Huipin [1 ,2 ]
Miao, Qiheng [4 ]
机构
[1] Hangzhou Dianzi Univ, Sch Elect Informat, Hangzhou 310018, Zhejiang, Peoples R China
[2] Zhejiang Prov Key Lab Equipment Elect, Hangzhou 310018, Zhejiang, Peoples R China
[3] Zhejiang Univ, Dept Elect Engn, Hangzhou 310027, Zhejiang, Peoples R China
[4] Zhejiang Huaruijie Technol Co Ltd, Hangzhou 310051, Zhejiang, Peoples R China
关键词
Depth estimation; Biological vision; Mapping relationship; Self -attention mechanism; VISION; MODEL; CONSCIOUSNESS;
D O I
10.1016/j.knosys.2023.110301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monocular depth estimation makes it possible for machines to perceive the real world. The prediction performance of the depth estimation network based on deep learning will be affected due to the depth of the deep network and the locality of convolution operations. The imitation of the biological visual system and its functional structure is becoming a research hotspot. In this paper, we study the interpretability relationship between the biological visual system and the monocular depth estimation network. By concretizing the attention mechanism in biological vision, we propose a monocular depth estimation network based on the self-attention mechanism, named SABV-Depth, which can improve prediction accuracy. Inspired by the biological visual interaction mechanism, we focus on the information transfer between each module of the network and improve the information retention ability, and enable the network to output a depth map with rich object information and detailed information. Further, a decoder module with an inner-connection is proposed to recover depth maps with sharp edge contours. Our method is experimentally validated on the KITTI dataset and NYU Depth V2 dataset. The results show that compared with other works, the proposed method improves prediction accuracy. Meanwhile, the depth map has more object information and detail information, and a better edge information processing effect. (c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND
引用
收藏
页数:14
相关论文
共 50 条
  • [1] ROBUST LEARNING FOR DEEP MONOCULAR DEPTH ESTIMATION
    Irie, Go
    Kawanishi, Takahito
    Kashino, Kunio
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 964 - 968
  • [2] Deep learning for monocular depth estimation: A review
    Ming, Yue
    Meng, Xuyang
    Fan, Chunxiao
    Yu, Hui
    NEUROCOMPUTING, 2021, 438 : 14 - 33
  • [3] Deep Ordinal Regression Network for Monocular Depth Estimation
    Fu, Huan
    Gong, Mingming
    Wang, Chaohui
    Batmanghelich, Kayhan
    Tao, Dacheng
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2002 - 2011
  • [4] Monocular depth estimation based on deep learning: An overview
    ZHAO ChaoQiang
    SUN Qi Yu
    ZHANG ChongZhen
    TANG Yang
    QIAN Feng
    Science China(Technological Sciences), 2020, (09) : 1612 - 1627
  • [5] Monocular depth estimation based on deep learning: An overview
    ZHAO ChaoQiang
    SUN Qi Yu
    ZHANG ChongZhen
    TANG Yang
    QIAN Feng
    Science China(Technological Sciences), 2020, 63 (09) : 1612 - 1627
  • [6] Deep Learning Based Monocular Depth Estimation: A Survey
    Jiang J.-J.
    Li Z.-Y.
    Liu X.-M.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (06): : 1276 - 1307
  • [7] Monocular Depth Estimation Based on Deep Learning:A Survey
    Ruan Xiaogang
    Yan Wenjing
    Huang Jing
    Guo Peiyuan
    Guo Wei
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2436 - 2440
  • [8] Monocular depth estimation based on deep learning: An overview
    ChaoQiang Zhao
    QiYu Sun
    ChongZhen Zhang
    Yang Tang
    Feng Qian
    Science China Technological Sciences, 2020, 63 : 1612 - 1627
  • [9] Monocular depth estimation based on deep learning: An overview
    Zhao, ChaoQiang
    Sun, QiYu
    Zhang, ChongZhen
    Tang, Yang
    Qian, Feng
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2020, 63 (09) : 1612 - 1627
  • [10] Monocular Depth Estimation Using Deep Learning: A Review
    Masoumian, Armin
    Rashwan, Hatem A.
    Cristiano, Julian
    Asif, M. Salman
    Puig, Domenec
    SENSORS, 2022, 22 (14)