Hierarchical Imitation Learning via Subgoal Representation Learning for Dynamic Treatment Recommendation

被引:8
|
作者
Wang, Lu [1 ,2 ]
Tang, Ruiming [2 ]
He, Xiaofeng [1 ]
He, Xiuqiang [2 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Huawei Noahs Ark Lab, Hong Kong, Peoples R China
关键词
Dynamic Treatment Recommendation; Hierarchical Imitation Learning; Subgoal Representation Learning;
D O I
10.1145/3488560.3498535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic Treatment Recommendation (DTR) is a sequence of tailored treatment decision rules which can be grouped as individual sub-tasks. As the reward signals in DTR are hard to design, Imitation Learning (IL) has achieved great success as it is effective in mimicking doctors' behaviors from their demonstrations without explicit reward signals. As a patient may have several different symptoms, the behaviors in doctors' demonstrations can often be grouped to handle individual symptoms. However, a single flat policy learned by IL is difficult to mimic doctors' demonstrations with such hierarchical structure, where low-level behaviors are switching from one symptom to another controlled by high-level decisions. Due to this observation, we consider Hierarchical Imitation Learning methods as good solutions for DTR. In this paper, we propose a novel Subgoal conditioned HIL framework (short for SHIL), where a high-level policy sequentially sets a subgoal for each sub-task without prior knowledge, and the low-level policy for sub-tasks is learned to reach the subgoal. To get rid of prior knowledge, a self-supervised learning method is proposed to learn an effective representation for each subgoal. More specifically, we carefully designed to encourage diverse representations among different subgoals. To demonstrate that SHIL is able to learn meaningful high-level policy and low-level policy that accurately reproduces complex doctors' demonstrations, we conduct experiments on a real-world medical data from health care domain, MIMIC-III. Compared with state-of-the-art baselines, SHIL improves the likelihood of patient survival by a significant margin and provides explainable recommendation with hierarchical structure.
引用
收藏
页码:1081 / 1089
页数:9
相关论文
共 50 条
  • [1] Hierarchical representation learning for next basket recommendation
    Zeng, Wenhua
    Liu, Junjie
    Zhang, Bo
    ARRAY, 2024, 23
  • [2] Provable Hierarchical Imitation Learning via EM
    Zhang, Zhiyu
    Paschalidis, Ioannis Ch
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [3] Where to go Next: Learning a Subgoal Recommendation Policy for Navigation in Dynamic Environments
    Brito, Bruno
    Everett, Michael
    How, Jonathan P.
    Alonso-Mora, Javier
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03) : 4616 - 4623
  • [4] Learning Hierarchical Representation Model for Next Basket Recommendation
    Wang, Pengfei
    Guo, Jiafeng
    Lan, Yanyan
    Xu, Jun
    Wan, Shengxian
    Cheng, Xueqi
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 403 - 412
  • [5] Subgoal Discovery for Hierarchical Dialogue Policy Learning
    Tang, Da
    Li, Xiujun
    Gao, Jianfeng
    Wang, Chong
    Li, Lihong
    Jebara, Tony
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2298 - 2309
  • [6] Graph-based Subtask Representation Learning via Imitation Learning
    Yoo, Se-Wook
    Seo, Seung-Woo
    2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
  • [7] Dynamic and Static Representation Learning Network for Recommendation
    Liu, Tongcun
    Lou, Siyuan
    Liao, Jianxin
    Feng, Hailin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 831 - 841
  • [8] ReGRL: An Informative Graph Representation via Hierarchical Recursive Learning for Legal Case Recommendation
    Chen, Xueyuan
    Wei, Xiao
    Yu, Hang
    Luo Xiangfeng
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [9] Cross-language Citation Recommendation via Hierarchical Representation Learning on Heterogeneous Graph
    Jiang, Zhuoren
    Yin, Yue
    Gao, Liangcai
    Lu, Yao
    Liu, Xiaozhong
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 635 - 644
  • [10] Provable Representation Learning for Imitation Learning via Bi-level Optimization
    Arora, Sanjeev
    Du, Simon S.
    Kakade, Sham
    Luo, Yuping
    Saunshi, Nikunj
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119