Hierarchical Imitation Learning via Subgoal Representation Learning for Dynamic Treatment Recommendation

被引:8
|
作者
Wang, Lu [1 ,2 ]
Tang, Ruiming [2 ]
He, Xiaofeng [1 ]
He, Xiuqiang [2 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Huawei Noahs Ark Lab, Hong Kong, Peoples R China
关键词
Dynamic Treatment Recommendation; Hierarchical Imitation Learning; Subgoal Representation Learning;
D O I
10.1145/3488560.3498535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic Treatment Recommendation (DTR) is a sequence of tailored treatment decision rules which can be grouped as individual sub-tasks. As the reward signals in DTR are hard to design, Imitation Learning (IL) has achieved great success as it is effective in mimicking doctors' behaviors from their demonstrations without explicit reward signals. As a patient may have several different symptoms, the behaviors in doctors' demonstrations can often be grouped to handle individual symptoms. However, a single flat policy learned by IL is difficult to mimic doctors' demonstrations with such hierarchical structure, where low-level behaviors are switching from one symptom to another controlled by high-level decisions. Due to this observation, we consider Hierarchical Imitation Learning methods as good solutions for DTR. In this paper, we propose a novel Subgoal conditioned HIL framework (short for SHIL), where a high-level policy sequentially sets a subgoal for each sub-task without prior knowledge, and the low-level policy for sub-tasks is learned to reach the subgoal. To get rid of prior knowledge, a self-supervised learning method is proposed to learn an effective representation for each subgoal. More specifically, we carefully designed to encourage diverse representations among different subgoals. To demonstrate that SHIL is able to learn meaningful high-level policy and low-level policy that accurately reproduces complex doctors' demonstrations, we conduct experiments on a real-world medical data from health care domain, MIMIC-III. Compared with state-of-the-art baselines, SHIL improves the likelihood of patient survival by a significant margin and provides explainable recommendation with hierarchical structure.
引用
收藏
页码:1081 / 1089
页数:9
相关论文
共 50 条
  • [21] Hierarchical reinforcement learning with dynamic recurrent mechanism for course recommendation
    Lin, Yuanguo
    Lin, Fan
    Zeng, Wenhua
    Xiahou, Jianbing
    Li, Li
    Wu, Pengcheng
    Liu, Yong
    Miao, Chunyan
    KNOWLEDGE-BASED SYSTEMS, 2022, 244
  • [22] Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
    Kim, Junsu
    Seo, Younggyo
    Shin, Jinwoo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [23] Graph Contrastive Learning via Hierarchical Multiview Enhancement for Recommendation
    Liu, Zhi
    Xiang, Hengjing
    Liang, Ruxia
    Xiang, Jinhai
    Wen, Chaodong
    Liu, Sannyuya
    Sun, Jianwen
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2025, 21 (03) : 2403 - 2412
  • [24] Adversarial Cooperative Imitation Learning for Dynamic Treatment Regimes
    Wang, Lu
    Yu, Wenchao
    Cheng, Wei
    Min, Martin Renqiang
    Zong, Bo
    He, Xiaofeng
    Zha, Hongyuan
    Wang, Wei
    Chen, Haifeng
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 1785 - 1795
  • [25] Learning Topological Representation for Networks via Hierarchical Sampling
    Fu, Guoji
    Hou, Chengbin
    Yao, Xin
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [26] Session-Based Recommendation via Hierarchical Graph Learning
    Yu, Li
    Gao, Zihao
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14875 : 464 - 475
  • [27] Interpretable Motion Planner for Urban Driving via Hierarchical Imitation Learning
    Wang, Bikun
    Wang, Zhipeng
    Zhu, Chenhao
    Zhang, Zhiqiang
    Wang, Zhichen
    Lin, Penghong
    Liu, Jingchu
    Zhang, Qian
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1691 - 1696
  • [28] Hierarchical Imitation Learning for Stochastic Environments
    Igl, Maximilian
    Shah, Punit
    Mougin, Paul
    Srinivasan, Sirish
    Gupta, Tarun
    White, Brandyn
    Shiarlis, Kyriacos
    Whiteson, Shimon
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1697 - 1704
  • [29] Active Imitation Learning of Hierarchical Policies
    Hamidi, Mandana
    Tadepalli, Prasad
    Goetschalckx, Robby
    Fern, Alan
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3554 - 3560
  • [30] Anchor: The achieved goal to replace the subgoal for hierarchical reinforcement learning
    Li, Ruijia
    Cai, Zhiling
    Huang, Tianyi
    Zhu, William
    KNOWLEDGE-BASED SYSTEMS, 2021, 225