Using full-scale feature fusion for self-supervised indoor depth estimation

被引:0
|
作者
Cheng, Deqiang [1 ]
Chen, Junhui [1 ]
Lv, Chen [1 ]
Han, Chenggong [1 ]
Jiang, He [1 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R China
基金
中国国家自然科学基金;
关键词
Monocular depth estimation; Feature fusion; Self-supervised; Indoor scenes; ResNeSt;
D O I
10.1007/s11042-023-16581-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Monocular depth estimation is a crucial task in computer vision, and self-supervised algorithms are gaining popularity due to their independence from expensive ground truth supervision. However, current self-supervised algorithms may not provide accurate estimation and may suffer from distorted boundaries when applied to indoor scenes. Combining multi-scale features is an important research direction in image segmentation to achieve accurate estimation and resolve boundary distortion. However, there are few studies on indoor self-supervised algorithms in this regard. To solve this issue, we propose a novel full-scale feature information fusion approach that includes a full-scale skip-connection and a full-scale feature fusion block. This approach can aggregate the high-level and low-level information of all scale feature maps during the network's encoding and decoding process to compensate for the network's loss of cross-layer feature information. The proposed full-scale feature fusion improves accuracy and reduces the decoder parameters. To fully exploit the superiority of the full-scale feature fusion module, we replace the encoder backbone from ResNet with the more advanced ResNeSt. Combining these two methods results in a significant improvement in prediction accuracy. We have extensively evaluated our approach on the indoor benchmark datasets NYU Depth V2 and ScanNet. Our experimental results demonstrate that our method outperforms existing algorithms, particularly on NYU Depth V2, where our precision is raised to 83.8%.
引用
收藏
页码:28215 / 28233
页数:19
相关论文
共 50 条
  • [21] F2Depth: Self-supervised indoor monocular depth estimation via optical flow consistency and feature map synthesis
    Guo, Xiaotong
    Zhao, Huijie
    Shao, Shuwei
    Li, Xudong
    Zhang, Baochang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [22] SPDepth: Enhancing Self-Supervised Indoor Monocular Depth Estimation via Self-Propagation
    Guo, Xiaotong
    Zhao, Huijie
    Shao, Shuwei
    Li, Xudong
    Zhang, Baochang
    Li, Na
    FUTURE INTERNET, 2024, 16 (10)
  • [23] Multi-Sensor Fusion Self-Supervised Deep Odometry and Depth Estimation
    Wan, Yingcai
    Zhao, Qiankun
    Guo, Cheng
    Xu, Chenlong
    Fang, Lijing
    REMOTE SENSING, 2022, 14 (05)
  • [24] MSDFNet: multi-scale detail feature fusion encoder-decoder network for self-supervised monocular thermal image depth estimation
    Kong, Lingjun
    Zheng, Qianhui
    Wang, Wenju
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (01)
  • [25] Distortion-Aware Self-Supervised Indoor 360°Depth Estimation via Hybrid Projection Fusion and Structural Regularities
    Wang, Xu
    Kong, Weifeng
    Zhang, Qiudan
    Yang, You
    Zhao, Tiesong
    Jiang, Jianmin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3998 - 4011
  • [26] Digging Into Self-Supervised Monocular Depth Estimation
    Godard, Clement
    Mac Aodha, Oisin
    Firman, Michael
    Brostow, Gabriel
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3827 - 3837
  • [27] Self-supervised monocular depth estimation in fog
    Tao, Bo
    Hu, Jiaxin
    Jiang, Du
    Li, Gongfa
    Chen, Baojia
    Qian, Xinbo
    OPTICAL ENGINEERING, 2023, 62 (03)
  • [28] On the uncertainty of self-supervised monocular depth estimation
    Poggi, Matteo
    Aleotti, Filippo
    Tosi, Fabio
    Mattoccia, Stefano
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3224 - 3234
  • [29] Revisiting Self-supervised Monocular Depth Estimation
    Kim, Ue-Hwan
    Lee, Gyeong-Min
    Kim, Jong-Hwan
    ROBOT INTELLIGENCE TECHNOLOGY AND APPLICATIONS 6, 2022, 429 : 336 - 350
  • [30] Self-supervised Indoor 360-Degree Depth Estimation via Structural Regularization
    Kong, Weifeng
    Zhang, Qiudan
    Yang, You
    Zhao, Tiesong
    Wu, Wenhui
    Wang, Xu
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 438 - 451