Hierarchical Imitation Learning via Subgoal Representation Learning for Dynamic Treatment Recommendation

被引:8
|
作者
Wang, Lu [1 ,2 ]
Tang, Ruiming [2 ]
He, Xiaofeng [1 ]
He, Xiuqiang [2 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Huawei Noahs Ark Lab, Hong Kong, Peoples R China
关键词
Dynamic Treatment Recommendation; Hierarchical Imitation Learning; Subgoal Representation Learning;
D O I
10.1145/3488560.3498535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic Treatment Recommendation (DTR) is a sequence of tailored treatment decision rules which can be grouped as individual sub-tasks. As the reward signals in DTR are hard to design, Imitation Learning (IL) has achieved great success as it is effective in mimicking doctors' behaviors from their demonstrations without explicit reward signals. As a patient may have several different symptoms, the behaviors in doctors' demonstrations can often be grouped to handle individual symptoms. However, a single flat policy learned by IL is difficult to mimic doctors' demonstrations with such hierarchical structure, where low-level behaviors are switching from one symptom to another controlled by high-level decisions. Due to this observation, we consider Hierarchical Imitation Learning methods as good solutions for DTR. In this paper, we propose a novel Subgoal conditioned HIL framework (short for SHIL), where a high-level policy sequentially sets a subgoal for each sub-task without prior knowledge, and the low-level policy for sub-tasks is learned to reach the subgoal. To get rid of prior knowledge, a self-supervised learning method is proposed to learn an effective representation for each subgoal. More specifically, we carefully designed to encourage diverse representations among different subgoals. To demonstrate that SHIL is able to learn meaningful high-level policy and low-level policy that accurately reproduces complex doctors' demonstrations, we conduct experiments on a real-world medical data from health care domain, MIMIC-III. Compared with state-of-the-art baselines, SHIL improves the likelihood of patient survival by a significant margin and provides explainable recommendation with hierarchical structure.
引用
收藏
页码:1081 / 1089
页数:9
相关论文
共 50 条
  • [31] Learning and Assessing Optimal Dynamic Treatment Regimes Through Cooperative Imitation Learning
    Shah, Syed Ihtesham Hussain
    Coronato, Antonio
    Naeem, Muddasar
    De Pietro, Giuseppe
    IEEE ACCESS, 2022, 10 : 78148 - 78158
  • [32] Learning and Assessing Optimal Dynamic Treatment Regimes Through Cooperative Imitation Learning
    Shah, Syed Ihtesham Hussain
    Coronato, Antonio
    Naeem, Muddasar
    De Pietro, Giuseppe
    IEEE Access, 2022, 10 : 78148 - 78158
  • [33] Robotic Manipulation with Reinforcement Learning, State Representation Learning, and Imitation Learning
    Chen, Hanxiao
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15769 - 15770
  • [34] Disentangled Representation Learning for Recommendation
    Wang, Xin
    Chen, Hong
    Zhou, Yuwei
    Ma, Jianxin
    Zhu, Wenwu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 408 - 424
  • [35] Dynamic interest modeling via dual learning for recommendation
    Jian, Meng
    Yang, Ran
    Wang, Xinling
    Wu, Lifang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 34373 - 34392
  • [36] Dynamic interest modeling via dual learning for recommendation
    Meng Jian
    Ran Yang
    Xinling Wang
    Lifang Wu
    Multimedia Tools and Applications, 2024, 83 : 34373 - 34392
  • [37] Hierarchical spatio-temporal morphable models for representation of complex movements for imitation learning
    Ilg, W
    Bakir, GH
    Franz, MO
    Giese, MA
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS 2003, VOL 1-3, 2003, : 453 - 458
  • [38] Third-Person Visual Imitation Learning via Decoupled Hierarchical Controller
    Sharma, Pratyusha
    Pathak, Deepak
    Gupta, Abhinav
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [39] End-to-End Hierarchical Reinforcement Learning With Integrated Subgoal Discovery
    Pateria, Shubham
    Subagdja, Budhitama
    Tan, Ah-Hwee
    Quek, Chai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7778 - 7790
  • [40] Connect-based subgoal discovery for options in hierarchical reinforcement learning
    Chen, Fei
    Gao, Yang
    Chen, Shifu
    Ma, Zhenduo
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2007, : 698 - +