Hierarchical reinforcement learning with dynamic recurrent mechanism for course recommendation

被引:22
|
作者
Lin, Yuanguo [1 ]
Lin, Fan [1 ]
Zeng, Wenhua [1 ]
Xiahou, Jianbing [1 ]
Li, Li [1 ]
Wu, Pengcheng [2 ]
Liu, Yong [2 ]
Miao, Chunyan [2 ,3 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen, Peoples R China
[2] Nanyang Technol Univ, Joint NTU UBC Res Ctr Excellence Act Living Elder, Singapore, Singapore
[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Recommender systems; Hierarchical reinforcement learning; Course recommendation; Policy gradient; SYSTEM;
D O I
10.1016/j.knosys.2022.108546
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In online learning scenarios, the learners usually hope to find courses that meet their preferences and the needs for their future developments. Thus, there is a great need to develop effective personalized course recommender systems that can guide the learners to choose suitable courses. In practice, Reinforcement Learning (RL) can be applied to build dynamic user profiles from users' interactions with courses, which is the key to the success of a course recommender system. However, existing RL-based course recommendation methods usually suffer from the trade-off between exploration and exploitation. In this paper, we propose a novel course recommendation model, namely Hierarchical rEinforcement Learning with dynAmic Recurrent mechanism (HELAR), in which a profile constructor with autonomous learning ability is designed to make personalized course recommendation. To address the exploration-exploitation trade-off issue in constructing user profiles, we propose a novel policy gradient method. It employs a recurrent scheme by context-aware learning to exploit the current knowledge, while utilizing a dynamic baseline to explore the user's future preferences. Extensive experiments are conducted on two real-world datasets to evaluate the performance of the proposed HELAR model, and the experimental results demonstrate the advantage of HELAR over state-of-the-art course recommendation methods. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A Service Recommendation System Based on Dynamic User Groups and Reinforcement Learning
    Zhang, En
    Ma, Wenming
    Zhang, Jinkai
    Xia, Xuchen
    ELECTRONICS, 2023, 12 (24)
  • [22] Resilient Navigation Among Dynamic Agents with Hierarchical Reinforcement Learning
    Wang, Sijia
    Jiang, Hao
    Wang, Zhaoqi
    ADVANCES IN COMPUTER GRAPHICS, CGI 2021, 2021, 13002 : 504 - 516
  • [23] Multi-agent Hierarchical Reinforcement Learning with Dynamic Termination
    Han, Dongge
    Bohmer, Wendelin
    Wooldridge, Michael
    Rogers, Alex
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11671 : 80 - 92
  • [24] Dynamic Multichannel Sensing in Cognitive Radio: Hierarchical Reinforcement Learning
    Liu, Shuai
    Wu, Jiayun
    He, Jing
    IEEE ACCESS, 2021, 9 : 25473 - 25481
  • [25] Multi-Agent Hierarchical Reinforcement Learning with Dynamic Termination
    Han, Dongge
    Boehmer, Wendelin
    Wooldridge, Michael
    Rogers, Alex
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2006 - 2008
  • [26] Hierarchical Reinforcement Learning for Playing a Dynamic Dungeon Crawler Game
    Niel, Remi
    Wiering, Marco A.
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 1159 - 1166
  • [27] DENOISING-ORIENTED DEEP HIERARCHICAL REINFORCEMENT LEARNING FOR NEXT-BASKET RECOMMENDATION
    Du, Qihan
    Yu, Li
    Li, Huiyuan
    Leng, Youfang
    Ou, Ningrui
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4093 - 4097
  • [28] Reinforcement learning of dynamic behavior by using recurrent neural networks
    Ahmet Onat
    Hajime Kita
    Yoshikazu Nishikawa
    Artificial Life and Robotics, 1997, 1 (3) : 117 - 121
  • [29] Deep Reinforcement Learning for Dynamic Things of Interest Recommendation in Intelligent Ambient Environment
    Altulyan, May S.
    Huang, Chaoran
    Yao, Lina
    Wang, Xianzhi
    Kanhere, Salil
    AI 2021: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13151 : 393 - 404
  • [30] Learning Path Recommendation Based on Reinforcement Learning
    Li, Ji
    Yu, Simiao
    Zhang, Tiancheng
    ENGINEERING LETTERS, 2024, 32 (09) : 1823 - 1832