Two-Stage Deep Regression Enhanced Depth Estimation From a Single RGB Image

被引:8
|
作者
Sun, Jianyuan [1 ]
Wang, Zidong [4 ]
Yu, Hui [5 ]
Zhang, Shu [2 ]
Dong, Junyu [2 ,3 ]
Gao, Pengxiang [6 ]
机构
[1] Bournemouth Univ, Natl Ctr Comp Animat, Fac Media & Commun, Bournemouth BH12 5BB, Dorset, England
[2] Ocean Univ China, Coll Informat Sci & Engn, Qingdao 266100, Peoples R China
[3] Inst Adv Ocean Study, Qingdao 266100, Peoples R China
[4] Brunel Univ, Dept Comp Sci, West London UB8 3PH, England
[5] Univ Portsmouth, Sch Creat Technol, Portsmouth PO1 2DJ, Hants, England
[6] Qingdao Univ, Sch Data Sci & Software Engn, Qingdao 266071, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划; 英国工程与自然科学研究理事会;
关键词
Predictive models; Task analysis; Estimation; Network architecture; Residual neural networks; Computational modeling; Robot sensing systems; Depth prediction; a single RGB image; the rough depth map; neural networks; VISION;
D O I
10.1109/TETC.2020.3034559
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Depth estimation plays a significant role in industrial applications, e.g., augmented reality, robotic mapping and autonomous driving. Traditional approaches for capturing depth, such as laser or depth sensor based methods, are difficult to use in most scenarios due to the limitations of high system cost and limited operational conditions. As an inexpensive and convenient approach, using the computational models to estimate depth from a single RGB image offers a preferable way for the depth prediction. Although the design of computational models to estimate the depth map has been widely investigated, the majority of models suffers from low prediction accuracy due to the sole utilization of a one-stage regression strategy. Inspired by both theoretical and practical success of two-stage regression, we propose a two-stage deep regression model, which is composed of two state-of-the-art network architectures, i.e., the fully convolutional residual network (FCRN) and the conditional generation adversarial network (cGAN). FCRN has been proved to possess a strong prediction ability for depth prediction, but fine details in the depth map are still incomplete. Accordingly, we have improved the existing cGAN model to refine the FCRN-based depth prediction. The experimental results show that the proposed two-stage deep regression model outperforms existing state-of-the-art methods.
引用
收藏
页码:719 / 727
页数:9
相关论文
共 50 条
  • [1] Two-stage Model Fitting Approach for Human Body Shape Estimation from a Single Depth Image
    Oyama, Mei
    Kaneko, Naoshi
    Hayashi, Masaki
    Sumi, Kazuhiko
    Yoshida, Takeshi
    PROCEEDINGS OF THE FIFTEENTH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS - MVA2017, 2017, : 234 - 237
  • [2] Learning a Deep Regression Forest for Head Pose Estimation from a Single Depth Image
    Ma, Xiangtian
    Sang, Nan
    Xiao, Shihua
    Wang, Xupeng
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (08)
  • [3] Two-stage deep learning framework for occlusal crown depth image generation
    Roh, Junghyun
    Kim, Junhwi
    Lee, Jimin
    Computers in Biology and Medicine, 2024, 183
  • [4] Counterfactual Depth from a Single RGB Image
    Issaranon, Theerasit
    Zou, Chuhang
    Forsyth, David
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2129 - 2138
  • [5] Deep Depth Completion of a Single RGB-D Image
    Zhang, Yinda
    Funkhouser, Thomas
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 175 - 185
  • [6] A Two-Stage Estimation Method for Depth Estimation of Facial Landmarks
    Gong, Xun
    Fu, Zehua
    Li, Xinxin
    Feng, Lin
    2015 IEEE INTERNATIONAL CONFERENCE ON IDENTITY, SECURITY AND BEHAVIOR ANALYSIS (ISBA), 2015,
  • [7] Two-stage deep image restoration network with application to single image shadow removal
    Yeh, Chia-Hung
    Zhan, Zhi-Xiang
    Kang, Li-Wei
    APPLIED SOFT COMPUTING, 2024, 167
  • [8] Attention Unet++ for lightweight depth estimation from sparse depth samples and a single RGB image
    Tao Zhao
    Shuguo Pan
    Wang Gao
    Chao Sheng
    Yingchun Sun
    Jiansheng Wei
    The Visual Computer, 2022, 38 : 1619 - 1630
  • [9] Contextualized CNN for Scene-Aware Depth Estimation From Single RGB Image
    Song, Wenfeng
    Li, Shuai
    Liu, Ji
    Hao, Aimin
    Zhao, Qinping
    Qin, Hong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (05) : 1220 - 1233
  • [10] Two-stage single image Deblurring network based on deblur kernel estimation
    Lu, Ying Cheng
    Liu, Tzu Pu
    Lin, Chang Hong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) : 17055 - 17074