CNNapsule: A Lightweight Network with Fusion Features for Monocular Depth Estimation

被引:1
|
作者
Wang, Yinchu [1 ]
Zhu, Haijiang [1 ]
Liu, Mengze [2 ]
机构
[1] Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing, Peoples R China
[2] PetroChina Jidong Oilfield Co, Tangshan, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Monocular depth estimation; Matrix capsule; Fusion block;
D O I
10.1007/978-3-030-86362-3_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depth estimation from 2D images is a fundamental task for many applications, for example, robotics and 3D reconstruction. Because of the weak ability to perspective transformation, the existing CNN methods have limited generalization performance and large number of parameters. To solve these problems, we propose CNNapsule network for monocular depth estimation. Firstly, we extract CNN and Matrix Capsule features. Next, we propose a Fusion Block to combine the CNN with Matrix Capsule features. Then the skip connections are used to transmit the extracted and fused features. Moreover, we design the loss function with the consideration of long-tailed distribution, gradient and structural similarity. At last, we compare our method with the existing methods on NYU Depth V2 dataset. The experiment shows that our method has higher accuracy than the traditional methods and similar networks without pre-trained. Compared with the state-of-the-art, the trainable parameters of our method decrease by 65%. In the test experiment of images collected in the Internet and real images collected by mobile phone, the generalization performance of our method is further verified.
引用
收藏
页码:507 / 518
页数:12
相关论文
共 50 条
  • [41] Deep Ordinal Regression Network for Monocular Depth Estimation
    Fu, Huan
    Gong, Mingming
    Wang, Chaohui
    Batmanghelich, Kayhan
    Tao, Dacheng
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2002 - 2011
  • [42] GBNet: Gradient Boosting Network for Monocular Depth Estimation
    Han, Daechan
    Choi, Yukyung
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 342 - 346
  • [43] Monocular depth estimation with spatially coherent sliced network
    Su, Wen
    Zhang, Haifeng
    Su, Yuan
    Yu, Jun
    Wang, Zengfu
    IMAGE AND VISION COMPUTING, 2022, 124
  • [44] Multi-scale depth classification network for monocular depth estimation
    Yang, Yi
    Tian, Lihua
    Li, Chen
    Zhang, Botong
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102
  • [45] Monocular Depth Estimation Based on Dilated Convolutions and Feature Fusion
    Li, Hang
    Liu, Shuai
    Wang, Bin
    Wu, Yuanhao
    APPLIED SCIENCES-BASEL, 2024, 14 (13):
  • [46] Double Refinement Network for Efficient Monocular Depth Estimation
    Durasov, Nikita
    Romanov, Mikhail
    Bubnova, Valeriya
    Bogomolov, Pavel
    Konushin, Anton
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 5889 - 5894
  • [47] Neural Contourlet Network for Monocular 360° Depth Estimation
    Shen, Zhijie
    Lin, Chunyu
    Nie, Lang
    Liao, Kang
    Zhao, Yao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8574 - 8585
  • [48] LD-Net: A Lightweight Network for Real-Time Self-Supervised Monocular Depth Estimation
    Xiong, Mingkang
    Zhang, Zhenghong
    Zhang, Tao
    Xiong, Huilin
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 882 - 886
  • [49] A tightly-coupled dense monocular Visual-Inertial Odometry system with lightweight depth estimation network
    Wang, Xin
    Zhang, Zuoming
    Li, Luchen
    APPLIED SOFT COMPUTING, 2025, 171
  • [50] Deep Monocular Depth Estimation Based on Content and Contextual Features
    Abdulwahab, Saddam
    Rashwan, Hatem A.
    Sharaf, Najwa
    Khalid, Saif
    Puig, Domenec
    SENSORS, 2023, 23 (06)