Depth estimation from single-shot monocular endoscope image using image domain adaptation and edge-aware depth estimation

被引:8
|
作者
Oda, Masahiro [1 ,2 ]
Itoh, Hayato [2 ]
Tanaka, Kiyohito [3 ]
Takabatake, Hirotsugu [4 ]
Mori, Masaki [5 ]
Natori, Hiroshi [6 ]
Mori, Kensaku [1 ,2 ,7 ]
机构
[1] Nagoya Univ, Informat & Commun, Nagoya, Aichi, Japan
[2] Nagoya Univ, Grad Sch Informat, Nagoya, Aichi, Japan
[3] Kyoto Second Red Cross Hosp, Dept Gastroenterol, Kyoto, Japan
[4] Sapporo Minami Sanjo Hosp, Dept Resp Med, Sapporo, Hokkaido, Japan
[5] Sapporo Kosei Gen Hosp, Dept Resp Med, Sapporo, Hokkaido, Japan
[6] Keiwakai Nishioka Hosp, Dept Resp Med, Sapporo, Hokkaido, Japan
[7] Natl Inst Informat, Res Ctr Med Bigdata, Tokyo, Japan
基金
日本科学技术振兴机构;
关键词
Depth estimation; single-shot monocular endoscopic image; lambertian surface translation; RECONSTRUCTION; REFLECTION; NAVIGATION;
D O I
10.1080/21681163.2021.2012835
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
We propose a depth estimation method from a single-shot monocular endoscopic image using Lambertian surface translation by domain adaptation and depth estimation using multi-scale edge loss. We employ a two-step estimation process including Lambertian surface translation from unpaired data and depth estimation. The texture and specular reflection on the surface of an organ reduce the accuracy of depth estimations. We apply Lambertian surface translation to an endoscopic image to remove these texture and reflections. Then, we estimate the depth by using a fully convolutional network (FCN). During the training of the FCN, improvement of the object edge similarity between an estimated image and a ground truth depth image is important for getting better results. We introduced a muti-scale edge loss function to improve the accuracy of depth estimation. We quantitatively evaluated the proposed method using real colonoscopic images. The estimated depth values were proportional to the real depth values. Furthermore, we applied the estimated depth images to automated anatomical location identification of colonoscopic images using a convolutional neural network. The identification accuracy of the network improved from 69.2% to 74.1% by using the estimated depth images.
引用
收藏
页码:266 / 273
页数:8
相关论文
共 50 条
  • [1] Edge-Aware Monocular Dense Depth Estimation with Morphology
    Li, Zhi
    Zhu, Xiaoyang
    Yu, Haitao
    Zhang, Qi
    Jiang, Yongshi
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2935 - 2942
  • [2] Asymmetric Edge-Aware Transformers for Monocular Endoscopic Depth Estimation
    Wu, Ming
    Qi, Hao
    Fan, Wenkang
    Ke, Sunkui
    Zeng, Hui-Qing
    Chen, Yinran
    Luo, Xiongbiao
    IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, MEDICAL IMAGING 2024, 2024, 12928
  • [3] Depth Estimation from Monocular Vision using Image Edge Complexity
    Haris, Sallehuddin Mohamed
    Zakaria, Muhammad Khalid
    Nuawi, Mohd Zaki
    2011 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2011, : 868 - 873
  • [4] Monocular Depth Estimation from a Single Infrared Image
    Han, Daechan
    Choi, Yukyung
    ELECTRONICS, 2022, 11 (11)
  • [5] Depth Estimation from a Monocular Outdoor Image
    Kuo, Tien-Ying
    Lo, Yi-Chung
    Lai, Yun-Yang
    IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE 2011), 2011, : 161 - 162
  • [6] Edge-aware depth image filtering using color segmentation
    Schmeing, Michael
    Jiang, Xiaoyi
    PATTERN RECOGNITION LETTERS, 2014, 50 : 63 - 71
  • [7] Geometry-Aware Symmetric Domain Adaptation for Monocular Depth Estimation
    Zhao, Shanshan
    Fu, Huan
    Gong, Mingming
    Tao, Dacheng
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9780 - 9790
  • [8] Monocular Depth Estimation Based on a Single Image - A Literature Review
    Tian Yuan
    Hu Xiaodong
    TWELFTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2020), 2021, 11720
  • [9] Depth Estimation from a Single CD-SEM Image Using Domain Adaptation with Multimodal Data
    Houben, Tim
    Huisman, Thomas
    Pisarenco, Maxim
    van der Sommen, Fons
    With, Peter de
    FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
  • [10] SINGLE IMAGE DEPTH ESTIMATION FROM IMAGE DESCRIPTORS
    Lin, Yu-Hsun
    Cheng, Wen-Huang
    Miao, Hsin
    Ku, Tsung-Hao
    Hsieh, Yung-Huan
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 809 - 812