Evaluation of single-stage vision models for pose estimation of surgical instruments

被引:2
|
作者
Burton, William [1 ]
Myers, Casey [1 ]
Rutherford, Matthew [2 ]
Rullkoetter, Paul [1 ]
机构
[1] Univ Denver, Ctr Orthopaed Biomech, 2155 E Wesley Ave, Denver, CO 80210 USA
[2] Univ Denver, Unmanned Syst Res Inst, 2155 E Wesley Ave, Denver, CO 80210 USA
关键词
Deep learning; Machine learning; Computer vision; Pose estimation; Surgical instruments; Open surgery; Surgical data science; OBJECTIVE STRUCTURED ASSESSMENT; OPERATING-ROOM EFFICIENCY; HEAD-MOUNTED DISPLAY; AUGMENTED REALITY; SURGERY; SKILLS; RECOGNITION; COSTS; TOOLS; VIDEO;
D O I
10.1007/s11548-023-02890-6
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Purpose Multiple applications in open surgical environments may benefit from adoption of markerless computer vision depending on associated speed and accuracy requirements. The current work evaluates vision models for 6-degree of freedom pose estimation of surgical instruments in RGB scenes. Potential use cases are discussed based on observed performance. Methods Convolutional neural nets were developed with simulated training data for 6-degree of freedom pose estimation of a representative surgical instrument in RGB scenes. Trained models were evaluated with simulated and real-world scenes. Real-world scenes were produced by using a robotic manipulator to procedurally generate a wide range of object poses. Results CNNs trained in simulation transferred to real-world evaluation scenes with a mild decrease in pose accuracy. Model performance was sensitive to input image resolution and orientation prediction format. The model with highest accuracy demonstrated mean in-plane translation error of 13 mm and mean long axis orientation error of 5 degrees in simulated evaluation scenes. Similar errors of 29 mm and 8 degrees were observed in real-world scenes. Conclusion 6-DoF pose estimators can predict object pose in RGB scenes with real-time inference speed. Observed pose accuracy suggests that applications such as coarse-grained guidance, surgical skill evaluation, or instrument tracking for tray optimization may benefit from markerless pose estimation.
引用
收藏
页码:2125 / 2142
页数:18
相关论文
共 50 条
  • [1] Evaluation of single-stage vision models for pose estimation of surgical instruments
    William Burton
    Casey Myers
    Matthew Rutherford
    Paul Rullkoetter
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 2125 - 2142
  • [2] RetinaHand: Towards Accurate Single-Stage Hand Pose Estimation
    Xiao, Zilong
    Lin, Luojun
    Yang, Yuanxi
    Yu, Yuanlong
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 639 - 647
  • [3] Single-Stage 6D Object Pose Estimation
    Hu, Yinlin
    Fua, Pascal
    Wang, Wei
    Salzmann, Mathieu
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2927 - 2936
  • [4] FDNet: Feature decoupling for single-stage pose estimation in complex scenes
    Wang, Qianqian
    Liu, Qiong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [5] RealHePoNet: a robust single-stage ConvNet for head pose estimation in the wild
    Berral-Soler, Rafael
    Madrid-Cuevas, Francisco J.
    Munoz-Salinas, Rafael
    Marin-Jimenez, Manuel J.
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (13): : 7673 - 7689
  • [6] RealHePoNet: a robust single-stage ConvNet for head pose estimation in the wild
    Rafael Berral-Soler
    Francisco J. Madrid-Cuevas
    Rafael Muñoz-Salinas
    Manuel J. Marín-Jiménez
    Neural Computing and Applications, 2021, 33 : 7673 - 7689
  • [7] Distribution-Aware Single-Stage Models for Multi-Person 3D Pose Estimation
    Wang, Zitian
    Nie, Xuecheng
    Qu, Xiaochao
    Chen, Yunpeng
    Liu, Si
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13086 - 13095
  • [8] Spacecraft Homography Pose Estimation with Single-Stage Deep Convolutional Neural Network
    Chen, Shengpeng
    Yang, Wenyi
    Wang, Wei
    Mai, Jianting
    Liang, Jian
    Zhang, Xiaohu
    SENSORS, 2024, 24 (06)
  • [9] A Compact and Powerful Single-Stage Network for Multi-Person Pose Estimation
    Xiao, Yabo
    Wang, Xiaojuan
    He, Mingshu
    Jin, Lei
    Song, Mei
    Zhao, Jian
    ELECTRONICS, 2023, 12 (04)
  • [10] Rethinking the Person Localization for Single-Stage Multi-Person Pose Estimation
    Jin, Lei
    Wang, Xiaojuan
    Nie, Xuecheng
    Wang, Wendong
    Guo, Yandong
    Yan, Shuicheng
    Zhao, Jian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1436 - 1447