Real-Time and High-Accuracy Switchable Stereo Depth Estimation Method Utilizing Self-Supervised Online Learning Mechanism for MIS

被引:0
|
作者
Zheng, Jieyu [1 ,2 ]
Li, Xiaojian [1 ,2 ]
Wang, Xin [3 ,4 ]
Wu, Haojun [3 ,4 ]
Li, Ling [1 ,2 ]
Ma, Xiang [1 ,2 ]
Yang, Shanlin [1 ,2 ]
机构
[1] Hefei Univ Technol, Sch Management, Key Lab Proc Optimizat & Intelligent Decis Making, Minist Educ, Hefei 230009, Peoples R China
[2] Hefei Univ Technol, Philosophy & Social Sci Lab Data Sci & Smart Soc G, Minist Educ, Hefei 230009, Peoples R China
[3] Sichuan Univ, West China Hosp, Dept Gen Surg, Div Pancreat Surg, Chengdu 610041, Sichuan, Peoples R China
[4] Sichuan Univ, West China Hosp, Dept Gen Surg, Div Biliary Surg, Chengdu, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
3-D laparoscopy; accuracy-speed tradeoff; intraoperative measurement; self-supervised online learning; stereo depth estimation;
D O I
10.1109/TIM.2024.3406835
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In minimally invasive surgery (MIS), clinicians often rely on 2-D laparoscopic images to assess the size and distances between internal structures. However, this subjective estimation introduces significant uncertainty and can increase surgical risks while reducing efficiency. Modern 3-D laparoscopes offer improved stereoscopic perception and can incorporate stereo depth estimation methods for quantitative analysis. However, existing methods struggle with real-time and high-accuracy demands in diverse surgical scenarios. To address this issue, we propose a novel intraoperative stereo depth estimation framework termed metainitialized online learning (MIOL), aiming to assist surgeons in quantitatively controlling surgical targets during the procedure. This framework features two switchable modes and does not require annotated data. One mode enables rapid depth recovery through surgical videos, providing real-time 3-D reconstruction to help surgeons understand in vivo structures. The other mode achieves high-precision measurements of critical tissues in fixed frames, assisting in surgical decision-making. Our approach employs self-supervised adaptation to train a model specific to each stereo image, eliminating the need for generalization and achieving outstanding accuracy. The framework establishes a lightweight network that converges rapidly under self-supervised losses and incorporates meta-learning pretraining, sparse optical flow guidance, and effective region identification to ensure speed and accuracy. Extensive experiments on two public datasets demonstrate the superiority of our method over existing approaches. Furthermore, we implement an intraoperative measurement system and conduct clinical trials, confirming its practical utility. The code is available at https://github.com/Darcy-vision/MIOL.
引用
收藏
页数:13
相关论文
共 39 条
  • [1] REAL-TIME COARSE-TO-FINE DEPTH ESTIMATION ON STEREO ENDOSCOPIC IMAGES WITH SELF-SUPERVISED LEARNING
    Yang, Haotian
    Kahrs, Lueder A.
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 733 - 737
  • [2] Real-Time Self-Supervised Monocular Depth Estimation Without GPU
    Poggi, Matteo
    Tosi, Fabio
    Aleotti, Filippo
    Mattoccia, Stefano
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 17342 - 17353
  • [3] Stereo Depth Estimation via Self-supervised Contrastive Representation Learning
    Tukra, Samyakh
    Giannarou, Stamatia
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VII, 2022, 13437 : 604 - 614
  • [4] Real-Time and Accurate Self-Supervised Monocular Depth Estimation on Mobile Device
    Cai, Hong
    Yin, Fei
    Singhal, Tushar
    Pendyam, Sandeep
    Noorzad, Parham
    Zhu, Yinhao
    Nguyen, Khoi
    Matai, Janarbek
    Ramaswamy, Bharath
    Mayer, Frank
    Patel, Chirag
    Khobare, Abhijit
    Porikli, Fatih
    NEURIPS 2021 COMPETITIONS AND DEMONSTRATIONS TRACK, VOL 176, 2021, 176 : 308 - 313
  • [5] LD-Net: A Lightweight Network for Real-Time Self-Supervised Monocular Depth Estimation
    Xiong, Mingkang
    Zhang, Zhenghong
    Zhang, Tao
    Xiong, Huilin
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 882 - 886
  • [6] Switchable-Encoder-Based Self-Supervised Learning Framework for Monocular Depth and Pose Estimation
    Kim, Junoh
    Gao, Rui
    Park, Jisun
    Yoon, Jinsoo
    Cho, Kyungeun
    REMOTE SENSING, 2023, 15 (24)
  • [7] Real-time self-supervised tone curve estimation for HDR image
    Chen, Xiyu
    Jia, Tao
    Xiao, Jiangjian
    Zhuang, Jiayan
    COMPUTERS & GRAPHICS-UK, 2023, 115 : 461 - 471
  • [8] Self-supervised real-time depth restoration for consumer-grade sensors
    Duarte, Alexandre
    Fernandes, Francisco
    Pereira, Joao M.
    Moreira, Catarina
    Nascimento, Jacinto C.
    Jorge, Joaquim
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (04)
  • [9] Self-supervised stereo depth estimation based on bi-directional pixel-movement learning
    Wang, Huachun
    Sang, Xinzhu
    Chen, Duo
    Wang, Peng
    Ye, Xiaoqian
    Qi, Shuai
    Yan, Binbin
    APPLIED OPTICS, 2022, 61 (07) : D7 - D14
  • [10] A High-Accuracy Clock Synchronization Method in Distributed Real-Time System
    Li, Hongliang
    Feng, Xuan
    Shi, Song
    Zheng, Fang
    Xie, Xianghui
    COMPUTER ENGINEERING AND TECHNOLOGY, NCCET 2014, 2015, 491 : 148 - 157