Model-Based Offline Reinforcement Learning for Autonomous Delivery of Guidewire

被引:0
|
作者
Li, Hao [1 ]
Zhou, Xiao-Hu [1 ]
Xie, Xiao-Liang [1 ]
Liu, Shi-Qi [1 ]
Feng, Zhen-Qiu [1 ]
Gui, Mei-Jiang [1 ]
Xiang, Tian-Yu [1 ]
Huang, De-Xing [1 ]
Hou, Zeng-Guang [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing 100190, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Data models; Training; Arteries; Reinforcement learning; Instruments; Catheters; Predictive models; Offline reinforcement learning; deep neural network; vascular robotic system; robot assisted intervention; PERCUTANEOUS CORONARY INTERVENTION;
D O I
10.1109/TMRB.2024.3407349
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Guidewire delivery is a fundamental procedure in percutaneous coronary intervention. The inherent flexibility of the guidewire poses challenges in precise control, necessitating long-term training and substantial expertise. In response, this paper proposes a novel offline reinforcement learning (RL) algorithm, Conservative Offline Reinforcement Learning with Variational Environment Model (CORVE), for autonomous delivery of guidewire. CORVE first uses offline data to train an environment model and then optimizes the policy with both offline and model-generated data. The proposed method shares an encoder between the environmental model, policy, and Q-function, mitigating the common sample inefficiency in image-based RL. Besides, CORVE utilizes model prediction errors to forecast wrong deliveries in inference, which is an attribute absent in existing methods. The experimental results show that CORVE obtains superior performance in guidewire deliveries, achieving notably higher success rates and smoother movements than existing methods. These findings suggest that CORVE holds significant potential for enhancing the autonomy of vascular robotic systems in clinical settings.
引用
收藏
页码:1054 / 1062
页数:9
相关论文
共 50 条
  • [41] Autonomous Navigation of UAV by Using Real-Time Model-Based Reinforcement Learning
    Imanberdiyev, Nursultan
    Fu, Changhong
    Kayacan, Erdal
    Chen, I-Ming
    2016 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2016,
  • [42] An Analysis of Offline Model-Based Learning with Action Noise
    Li, Haoya
    Gangwani, Tanmay
    Ying, Lexing
    JOURNAL OF SCIENTIFIC COMPUTING, 2025, 103 (02)
  • [43] Autonomous Building Control Using Offline Reinforcement Learning
    Schepers, Jorren
    Eyckerman, Reinout
    Elmaz, Furkan
    Casteels, Wim
    Latre, Steven
    Hellinckx, Peter
    ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING, 3PGCIC-2021, 2022, 343 : 246 - 255
  • [44] Learning to Paint With Model-based Deep Reinforcement Learning
    Huang, Zhewei
    Heng, Wen
    Zhou, Shuchang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8708 - 8717
  • [45] Incremental model-based reinforcement learning with model constraint
    Yang, Zhiyou
    Fu, Mingsheng
    Qu, Hong
    Li, Fan
    Shi, Shuqing
    Hu, Wang
    NEURAL NETWORKS, 2025, 185
  • [46] Objective Mismatch in Model-based Reinforcement Learning
    Lambert, Nathan
    Amos, Brandon
    Yadan, Omry
    Calandra, Roberto
    LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 761 - 770
  • [47] Model-based reinforcement learning with dimension reduction
    Tangkaratt, Voot
    Morimoto, Jun
    Sugiyama, Masashi
    NEURAL NETWORKS, 2016, 84 : 1 - 16
  • [48] On Effective Scheduling of Model-based Reinforcement Learning
    Lai, Hang
    Shen, Jian
    Zhang, Weinan
    Huang, Yimin
    Zhang, Xing
    Tang, Ruiming
    Yu, Yong
    Li, Zhenguo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [49] Transferring Instances for Model-Based Reinforcement Learning
    Taylor, Matthew E.
    Jong, Nicholas K.
    Stone, Peter
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 488 - 505
  • [50] Modeling Survival in model-based Reinforcement Learning
    Moazami, Saeed
    Doerschuk, Peggy
    2020 SECOND INTERNATIONAL CONFERENCE ON TRANSDISCIPLINARY AI (TRANSAI 2020), 2020, : 17 - 24