Interactive Surgical Training in Neuroendoscopy: Real-Time Anatomical Feature Localization Using Natural Language Expressions

被引:2
|
作者
Matasyoh, Nevin M. [1 ]
Schmidt, Ruediger [2 ,3 ]
Zeineldin, Ramy A. [1 ]
Spetzger, Uwe [4 ,5 ]
Mathis-Ullrich, Franziska [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Dept Artificial Intelligence Biomed Engn, D-91052 Erlangen, Germany
[2] Klinikum Karlsruhe, Dept Neurosurg, Karlsruhe, Germany
[3] Klin Hirslanden, Ctr Endoscop & Minimally Invas Neurosurg, Zurich, Switzerland
[4] Karlsruhe Inst Technol, Inst Anthropomat & Robot, Karlsruhe, Germany
[5] Dept Neurosurg, Aachen, Germany
关键词
Surgery; Training; Neurosurgery; Biomedical imaging; Transformers; Visualization; Task analysis; Anatomical feature localization; endoscopic third ventriculostomy; feature fusion; multimodal deep learning; neuroendoscopy; surgical training; transformer;
D O I
10.1109/TBME.2024.3405814
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
- Objective: This study addresses challenges in surgical education, particularly in neuroendoscopy, where the demand for optimized workflow conflicts with the need for trainees' active participation in surgeries. To overcome these challenges, we propose a framework that accurately identifies anatomical structures within images guided by language descriptions, facilitating authentic and interactive learning experiences in neuroendoscopy. Methods: Utilizing the encoder-decoder architecture of a conventional transformer, our framework processes multimodal inputs (images and language descriptions) to identify and localize features in neuroendoscopic images. We curate a dataset from recorded endoscopic third ventriculostomy (ETV) procedures for training and evaluation. Utilizing evaluation metrics, including "R@n," "IoU=theta," "mIoU," and top-1 accuracy, we systematically benchmark our framework against state-of-the-art methodologies. Results: The framework demonstrates excellent generalization, surpassing the compared methods with 93.67% % accuracy and 76.08% % mIoU on unseen data. It also exhibits better computational speed compared with other methods. Qualitative results affirms the framework's effectiveness in precise localization of referred anatomical features within neuroendoscopic images. Conclusion: The framework's adeptness at localizing anatomical features using language descriptions positions it as a valuable tool for integration into future interactive clinical learning systems, enhancing surgical training in neuroendoscopy. Significance: The exemplary performance reinforces the framework's potential in enhancing surgical education, leading to improved skills and outcomes for trainees in neuroendoscopy.
引用
收藏
页码:2991 / 2999
页数:9
相关论文
共 50 条
  • [1] Tearing of Membranes for Interactive Real-Time Surgical Training
    Grimm, Johannes
    MEDICINE MEETS VIRTUAL REALITY 13: THE MAGICAL NEXT BECOMES THE MEDICAL NOW, 2005, 111 : 153 - 159
  • [3] A real-time interactive surgical simulator for catheter navigation
    Lim, HL
    Shetty, BR
    Chui, CK
    Wang, YP
    Cai, YY
    SURGICAL-ASSIST SYSTEMS, PROCEEDINGS OF, 1998, 3262 : 4 - 14
  • [4] LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
    De la Torre, Fernanda
    Fang, Cathy Mengying
    Huang, Han
    Banburski-Fahey, Andrzej
    Fernandez, Judith Amores
    Lanier, Jaron
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,
  • [5] REAL-TIME GENERATION OF NATURAL-LANGUAGE
    PATTEN, T
    STOOPS, DS
    IEEE EXPERT-INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1991, 6 (05): : 15 - 22
  • [6] A System for Real-Time Interactive Analysis of Deep Learning Training
    Shah, Shital
    Fernandez, Roland
    Drucker, Steven
    PROCEEDINGS OF THE ACM SIGCHI SYMPOSIUM ON ENGINEERING INTERACTIVE COMPUTING SYSTEMS (EICS'19), 2019,
  • [7] A real-time interactive biofeedback system for sports training and rehabilitation
    Alahakone, A. U.
    Senanayake, A.
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART P-JOURNAL OF SPORTS ENGINEERING AND TECHNOLOGY, 2010, 224 (P2) : 181 - 190
  • [8] Real-time and Authentic Blood Simulation for Surgical Training
    Xiao, Changsheng
    Feng, Yuanjing
    Li, Yongqiang
    Zeng, Qingrun
    Zhang, Jun
    Wu, Ye
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 6832 - 6837
  • [9] A real-time natural language command interpreter for robots
    Nair, SB
    Prasad, PB
    SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 3972 - 3974
  • [10] A FINITE AND REAL-TIME PROCESSOR FOR NATURAL-LANGUAGE
    BLANK, GD
    COMMUNICATIONS OF THE ACM, 1989, 32 (10) : 1174 - 1189