Hierarchical Imitation Learning via Subgoal Representation Learning for Dynamic Treatment Recommendation

被引：8

作者：

Wang, Lu ^{[1
,2
]}

Tang, Ruiming ^{[2
]}

He, Xiaofeng ^{[1
]}

He, Xiuqiang ^{[2
]}

机构：

[1] East China Normal Univ, Shanghai, Peoples R China

[2] Huawei Noahs Ark Lab, Hong Kong, Peoples R China

来源：

WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING | 2022年

关键词：

Dynamic Treatment Recommendation; Hierarchical Imitation Learning; Subgoal Representation Learning;

D O I：

10.1145/3488560.3498535

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Dynamic Treatment Recommendation (DTR) is a sequence of tailored treatment decision rules which can be grouped as individual sub-tasks. As the reward signals in DTR are hard to design, Imitation Learning (IL) has achieved great success as it is effective in mimicking doctors' behaviors from their demonstrations without explicit reward signals. As a patient may have several different symptoms, the behaviors in doctors' demonstrations can often be grouped to handle individual symptoms. However, a single flat policy learned by IL is difficult to mimic doctors' demonstrations with such hierarchical structure, where low-level behaviors are switching from one symptom to another controlled by high-level decisions. Due to this observation, we consider Hierarchical Imitation Learning methods as good solutions for DTR. In this paper, we propose a novel Subgoal conditioned HIL framework (short for SHIL), where a high-level policy sequentially sets a subgoal for each sub-task without prior knowledge, and the low-level policy for sub-tasks is learned to reach the subgoal. To get rid of prior knowledge, a self-supervised learning method is proposed to learn an effective representation for each subgoal. More specifically, we carefully designed to encourage diverse representations among different subgoals. To demonstrate that SHIL is able to learn meaningful high-level policy and low-level policy that accurately reproduces complex doctors' demonstrations, we conduct experiments on a real-world medical data from health care domain, MIMIC-III. Compared with state-of-the-art baselines, SHIL improves the likelihood of patient survival by a significant margin and provides explainable recommendation with hierarchical structure.

引用

页码：1081 / 1089

页数：9

共 50 条

[21] Hierarchical reinforcement learning with dynamic recurrent mechanism for course recommendation
Lin, Yuanguo
Lin, Fan
Zeng, Wenhua
Xiahou, Jianbing
Li, Li
Wu, Pengcheng
Liu, Yong
Miao, Chunyan
KNOWLEDGE-BASED SYSTEMS, 2022, 244
[22] Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
Kim, Junsu
Seo, Younggyo
Shin, Jinwoo
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[23] Graph Contrastive Learning via Hierarchical Multiview Enhancement for Recommendation
Liu, Zhi
Xiang, Hengjing
Liang, Ruxia
Xiang, Jinhai
Wen, Chaodong
Liu, Sannyuya
Sun, Jianwen
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2025, 21 (03) : 2403 - 2412
[24] Adversarial Cooperative Imitation Learning for Dynamic Treatment Regimes
Wang, Lu
Yu, Wenchao
Cheng, Wei
Min, Martin Renqiang
Zong, Bo
He, Xiaofeng
Zha, Hongyuan
Wang, Wei
Chen, Haifeng
WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 1785 - 1795
[25] Learning Topological Representation for Networks via Hierarchical Sampling
Fu, Guoji
Hou, Chengbin
Yao, Xin
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[26] Session-Based Recommendation via Hierarchical Graph Learning
Yu, Li
Gao, Zihao
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14875 : 464 - 475
[27] Interpretable Motion Planner for Urban Driving via Hierarchical Imitation Learning
Wang, Bikun
Wang, Zhipeng
Zhu, Chenhao
Zhang, Zhiqiang
Wang, Zhichen
Lin, Penghong
Liu, Jingchu
Zhang, Qian
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1691 - 1696
[28] Hierarchical Imitation Learning for Stochastic Environments
Igl, Maximilian
Shah, Punit
Mougin, Paul
Srinivasan, Sirish
Gupta, Tarun
White, Brandyn
Shiarlis, Kyriacos
Whiteson, Shimon
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1697 - 1704
[29] Active Imitation Learning of Hierarchical Policies
Hamidi, Mandana
Tadepalli, Prasad
Goetschalckx, Robby
Fern, Alan
PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3554 - 3560
[30] Anchor: The achieved goal to replace the subgoal for hierarchical reinforcement learning
Li, Ruijia
Cai, Zhiling
Huang, Tianyi
Zhu, William
KNOWLEDGE-BASED SYSTEMS, 2021, 225

← 1 2 3 4 5 →