Surgical Tool Pose Estimation from Monocular Endoscopic Videos

被引:0
|
作者
Kumar, Suren [1 ]
Sovizi, Javad [1 ]
Narayanan, Madusudanan Sathia [2 ]
Krovi, Venkat [1 ,3 ]
机构
[1] SUNY Buffalo, Dept Mech Engn, Buffalo, NY 14260 USA
[2] Northwestern Univ, Human Agil Lab, Evanston, IL USA
[3] SUNY Buffalo, Dept Obstet & Gynecol, Buffalo, NY 14260 USA
关键词
INSTRUMENTS; TRACKING;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Surgical tool pose estimation has been proven to be useful for high-and low-level feedback tasks including safety-enhancement, semantic feedback and surgical skill assessment. Tool pose estimation using monocular camera input is a well-studied research problem as the monocular camera is one of the ubiquitous sensor across the spectrum of robotic devices. Current state-of-the art methods for visual tool pose estimation are computationally expensive and require elaborate geometric and appearance models of surgical tools. We propose a visual tool pose estimation method that maps the visual bounding box to the 3D tool pose without any explicit knowledge of tool geometry using Gaussian process regression. The proposed approach can be generalized to any surgical tool and provides tool pose estimates with a variance estimate in real-time. We demonstrate rigorous evaluation of the method under various conditions that might effect the estimation process. In order to evaluate the algorithm, we have instrumented a standard box trainer kit with two laparoscopic tools to get simultaneous ground truth pose and a video feed.
引用
收藏
页码:598 / 603
页数:6
相关论文
共 50 条
  • [41] Revisiting Monocular Satellite Pose Estimation With Transformer
    Wang, Zi
    Zhang, Zhuo
    Sun, Xiaoliang
    Li, Zhang
    Yu, Qifeng
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2022, 58 (05) : 4279 - 4294
  • [42] Monocular Vision and Machine Learning for Pose Estimation
    Tran, Quang
    Choate, Jeffrey
    Taylor, Clark N.
    Nykl, Scott
    Curtis, David
    2023 IEEE/ION POSITION, LOCATION AND NAVIGATION SYMPOSIUM, PLANS, 2023, : 128 - 136
  • [43] Corner Matching Refinement for Monocular Pose Estimation
    Gamage, Dinesh
    Drummond, Tom
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [44] An Evaluation of YouTube Videos as a Surgical Instructional Tool for Endoscopic Endonasal Approaches in Otolaryngology
    de la Torre, Amy B.
    Joe, Stephanie
    Lee, Victoria S.
    ENT-EAR NOSE & THROAT JOURNAL, 2024, 103 (07) : NP440 - NP449
  • [45] Robust multi-label surgical tool classification in noisy endoscopic videos
    Qayyum, Adnan
    Ali, Hassan
    Caputo, Massimo
    Vohra, Hunaid
    Akinosho, Taofeek
    Abioye, Sofiat
    Berrou, Ilhem
    Capik, Pawel
    Qadir, Junaid
    Bilal, Muhammad
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [46] Learning Temporal Pose Estimation from Sparsely-Labeled Videos
    Bertasius, Gedas
    Feichtenhofer, Christoph
    Tran, Du
    Shi, Jianbo
    Torresani, Lorenzo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [47] Articulated pose estimation with tiny synthetic videos
    Park, Dennis
    Ramanan, Deva
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [48] Flowing ConvNets for Human Pose Estimation in Videos
    Pfister, Tomas
    Charles, James
    Zisserman, Andrew
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1913 - 1921
  • [49] A Self-Supervised Network-Based Smoke Removal and Depth Estimation for Monocular Endoscopic Videos
    Zhang, Guo
    Gao, Xinbo
    Meng, Hongying
    Pang, Yu
    Nie, Xixi
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (09) : 6547 - 6559
  • [50] Fast Human Pose Estimation in Compressed Videos
    Liu, Huan
    Liu, Wentao
    Chi, Zhixiang
    Wang, Yang
    Yu, Yuanhao
    Chen, Jun
    Tang, Jin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1390 - 1400