Multi-scale, multi-dimensional binocular endoscopic image depth estimation network

被引:0
|
作者
Wang, Xiongzhi [1 ,2 ]
Nie, Yunfeng [3 ]
Ren, Wenqi [5 ]
Wei, Min [4 ]
Zhang, Jingang [1 ,2 ]
机构
[1] Univ Chinese Acad Sci, Sch Future Technol, Beijing 100039, Peoples R China
[2] Xidian Univ, Sch Aerosp Science&Technol, Xian 710071, Peoples R China
[3] Vrije Univ Brussel & Flanders Make, Dept Appl Phys & Photon, Brussel Photon, B-1050 Brussels, Belgium
[4] Chinese Acad Sci, State Key Lab Informat Secur, Inst Informat Engn, Beijing 100093, Peoples R China
[5] Chinese Peoples Liberat Army Gen Hosp, Med Ctr 4, Dept Orthoped, Beijing 100853, Peoples R China
基金
中国国家自然科学基金;
关键词
Depth estimation; Endoscopic datasets; Convolutional neural network; Stereoscopic vision; STEREO; COLONOSCOPY; LESIONS;
D O I
10.1016/j.compbiomed.2023.107305
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
During invasive surgery, the use of deep learning techniques to acquire depth information from lesion sites in real-time is hindered by the lack of endoscopic environmental datasets. This work aims to develop a high-accuracy three-dimensional (3D) simulation model for generating image datasets and acquiring depth information in real-time. Here, we proposed an end-to-end multi-scale supervisory depth estimation network (MMDENet) model for the depth estimation of pairs of binocular images. The proposed MMDENet highlights a multi-scale feature extraction module incorporating contextual information to enhance the correspondence precision of poorly exposed regions. A multi-dimensional information-guidance refinement module is also proposed to refine the initial coarse disparity map. Statistical experimentation demonstrated a 3.14% reduction in endpoint error compared to state-of-the-art methods. With a processing time of approximately 30fps, satisfying the requirements of real-time operation applications. In order to validate the performance of the trained MMDENet in actual endoscopic images, we conduct both qualitative and quantitative analysis with 93.38% high precision, which holds great promise for applications in surgical navigation.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Multi-Scale and Multi-Dimensional Thermal Modeling of Lithium-Ion Batteries
    Gwak, Geonhui
    Ju, Hyunchul
    ENERGIES, 2019, 12 (03)
  • [42] Age Estimation by Multi-scale Convolutional Network
    Yi, Dong
    Lei, Zhen
    Li, Stan Z.
    COMPUTER VISION - ACCV 2014, PT III, 2015, 9005 : 144 - 158
  • [43] Multi-scale Spatial Propagation Network for Depth Completion
    Wu, Zhenyu
    Wang, Haiyang
    Deng, Xiangyu
    2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS, ICCCR 2024, 2024, : 151 - 156
  • [44] Monocular Depth Estimation Based on Multi-Scale Depth Map Fusion
    Yang, Xin
    Chang, Qingling
    Liu, Xinglin
    He, Siyuan
    Cui, Yan
    IEEE ACCESS, 2021, 9 : 67696 - 67705
  • [45] DEPTH ESTIMATION OF MULTI-MODAL SCENE BASED ON MULTI-SCALE MODULATION
    Wang, Anjie
    Fang, Zhijun
    Jiang, Xiaoyan
    Gao, Yongbin
    Cao, Gaofeng
    Ma, Siwei
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2795 - 2799
  • [46] Depth Map Prediction from a Single Image using a Multi-Scale Deep Network
    Eigen, David
    Puhrsch, Christian
    Fergus, Rob
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [47] Multi-Scale Mutual Feature Convolutional Neural Network for Depth Image Denoise and Enhancement
    Liao, Xuan
    Zhang, Xin
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [48] Saliency Driven Monocular Depth Estimation Based on Multi-scale Graph Convolutional Network
    Wu, Dunquan
    Chen, Chenglizhao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 445 - 456
  • [49] Multi-Scale Dilated Convolution Network Based Depth Estimation in Intelligent Transportation Systems
    Tian, Yanling
    Zhang, Qieshi
    Ren, Ziliang
    Wu, Fuxiang
    Hao, Pengyi
    Hu, Jinglu
    IEEE ACCESS, 2019, 7 : 185179 - 185188
  • [50] Endoscopic Image Retrieval System Using Multi-scale Image Features
    Chowdhury, Manish
    Kundu, Malay Kumar
    PERCEPTION AND MACHINE INTELLIGENCE, 2015, 2015, : 64 - 70