Evaluation of single-stage vision models for pose estimation of surgical instruments

被引：2

作者：

Burton, William ^{[1
]}

Myers, Casey ^{[1
]}

Rutherford, Matthew ^{[2
]}

Rullkoetter, Paul ^{[1
]}

机构：

[1] Univ Denver, Ctr Orthopaed Biomech, 2155 E Wesley Ave, Denver, CO 80210 USA

[2] Univ Denver, Unmanned Syst Res Inst, 2155 E Wesley Ave, Denver, CO 80210 USA

来源：

INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY | 2023年 / 18卷 / 12期

关键词：

Deep learning; Machine learning; Computer vision; Pose estimation; Surgical instruments; Open surgery; Surgical data science; OBJECTIVE STRUCTURED ASSESSMENT; OPERATING-ROOM EFFICIENCY; HEAD-MOUNTED DISPLAY; AUGMENTED REALITY; SURGERY; SKILLS; RECOGNITION; COSTS; TOOLS; VIDEO;

D O I：

10.1007/s11548-023-02890-6

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Purpose Multiple applications in open surgical environments may benefit from adoption of markerless computer vision depending on associated speed and accuracy requirements. The current work evaluates vision models for 6-degree of freedom pose estimation of surgical instruments in RGB scenes. Potential use cases are discussed based on observed performance. Methods Convolutional neural nets were developed with simulated training data for 6-degree of freedom pose estimation of a representative surgical instrument in RGB scenes. Trained models were evaluated with simulated and real-world scenes. Real-world scenes were produced by using a robotic manipulator to procedurally generate a wide range of object poses. Results CNNs trained in simulation transferred to real-world evaluation scenes with a mild decrease in pose accuracy. Model performance was sensitive to input image resolution and orientation prediction format. The model with highest accuracy demonstrated mean in-plane translation error of 13 mm and mean long axis orientation error of 5 degrees in simulated evaluation scenes. Similar errors of 29 mm and 8 degrees were observed in real-world scenes. Conclusion 6-DoF pose estimators can predict object pose in RGB scenes with real-time inference speed. Observed pose accuracy suggests that applications such as coarse-grained guidance, surgical skill evaluation, or instrument tracking for tray optimization may benefit from markerless pose estimation.

引用

页码：2125 / 2142

页数：18

共 50 条

[1] Evaluation of single-stage vision models for pose estimation of surgical instruments
William Burton
Casey Myers
Matthew Rutherford
Paul Rullkoetter
International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 2125 - 2142
[2] RetinaHand: Towards Accurate Single-Stage Hand Pose Estimation
Xiao, Zilong
Lin, Luojun
Yang, Yuanxi
Yu, Yuanlong
ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 639 - 647
[3] Single-Stage 6D Object Pose Estimation
Hu, Yinlin
Fua, Pascal
Wang, Wei
Salzmann, Mathieu
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2927 - 2936
[4] FDNet: Feature decoupling for single-stage pose estimation in complex scenes
Wang, Qianqian
Liu, Qiong
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
[5] RealHePoNet: a robust single-stage ConvNet for head pose estimation in the wild
Berral-Soler, Rafael
Madrid-Cuevas, Francisco J.
Munoz-Salinas, Rafael
Marin-Jimenez, Manuel J.
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (13): : 7673 - 7689
[6] RealHePoNet: a robust single-stage ConvNet for head pose estimation in the wild
Rafael Berral-Soler
Francisco J. Madrid-Cuevas
Rafael Muñoz-Salinas
Manuel J. Marín-Jiménez
Neural Computing and Applications, 2021, 33 : 7673 - 7689
[7] Distribution-Aware Single-Stage Models for Multi-Person 3D Pose Estimation
Wang, Zitian
Nie, Xuecheng
Qu, Xiaochao
Chen, Yunpeng
Liu, Si
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13086 - 13095
[8] Spacecraft Homography Pose Estimation with Single-Stage Deep Convolutional Neural Network
Chen, Shengpeng
Yang, Wenyi
Wang, Wei
Mai, Jianting
Liang, Jian
Zhang, Xiaohu
SENSORS, 2024, 24 (06)
[9] A Compact and Powerful Single-Stage Network for Multi-Person Pose Estimation
Xiao, Yabo
Wang, Xiaojuan
He, Mingshu
Jin, Lei
Song, Mei
Zhao, Jian
ELECTRONICS, 2023, 12 (04)
[10] Rethinking the Person Localization for Single-Stage Multi-Person Pose Estimation
Jin, Lei
Wang, Xiaojuan
Nie, Xuecheng
Wang, Wendong
Guo, Yandong
Yan, Shuicheng
Zhao, Jian
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1436 - 1447

← 1 2 3 4 5 →