Text-Based Interactive Recommendation via Offline Reinforcement Learning

被引:0
|
作者
Zhang, Ruiyi [1 ]
Yu, Tong [2 ]
Shen, Yilin [2 ]
Jin, Hongxia [2 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
[2] Samsung Res Amer, Mountain View, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interactive recommendation with natural-language feedback can provide richer user feedback and has demonstrated advantages over traditional recommender systems. However, the classical online paradigm involves iteratively collecting experience via interaction with users, which is expensive and risky. We consider an offline interactive recommendation to exploit arbitrary experience collected by multiple unknown policies. A direct application of policy learning with such fixed experience suffers from the distribution shift. To tackle this issue, we develop a behavior-agnostic off-policy correction framework to make offline interactive recommendation possible. Specifically, we leverage the conservative Q-function to perform off-policy evaluation, which enables learning effective policies from fixed datasets without further interactions. Empirical results on the simulator derived from real-world datasets demonstrate the effectiveness of our proposed offline training framework.
引用
收藏
页码:11694 / 11702
页数:9
相关论文
共 50 条
  • [1] A Text-Based Deep Reinforcement Learning Framework for Interactive Recommendation
    Wang, Chaoyang
    Guo, Zhiqiang
    Li, Jianjun
    Pan, Peng
    Li, Guohui
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 537 - 544
  • [2] A General Offline Reinforcement Learning Framework for Interactive Recommendation
    Xiao, Teng
    Wang, Donglin
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 4512 - 4520
  • [3] Policy-based Reinforcement Learning for Generalisation in Interactive Text-based Environments
    Toledo, Edan
    Buys, Jan
    Shock, Jonathan
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1230 - 1242
  • [4] Generalization in Text-based Games via Hierarchical Reinforcement Learning
    Xu, Yunqiu
    Fang, Meng
    Chen, Ling
    Du, Yali
    Zhang, Chengqi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1343 - 1353
  • [5] Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive Recommendation
    Gao, Chongming
    Huang, Kexin
    Chen, Jiawei
    Zhang, Yuan
    Li, Biao
    Jiang, Peng
    Wang, Shiqi
    Zhang, Zhong
    He, Xiangnan
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 238 - 248
  • [6] Session-based Interactive Recommendation via Deep Reinforcement Learning
    Shi, Longxiang
    Zhang, Zilin
    Wang, Shoujin
    Zhang, Qi
    Wu, Minghui
    Yang, Cheng
    Li, Shijian
    Proceedings - IEEE International Conference on Data Mining, ICDM, 2023, : 1319 - 1324
  • [7] Session-based Interactive Recommendation via Deep Reinforcement Learning
    Shi, Longxiang
    Zhang, Zilin
    Wang, Shoujin
    Zhang, Qi
    Wu, Minghui
    Yang, Cheng
    Li, Shijian
    23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1319 - 1324
  • [8] INTERACTIVE EFFECTS OF TEXT-BASED AND TASK-BASED IMPORTANCE ON LEARNING FROM TEXT
    SCHRAW, G
    WADE, SE
    KARDASH, CAM
    JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1993, 85 (04) : 652 - 661
  • [9] A Review of Text-Based Recommendation Systems
    Kanwal, Safia
    Nawaz, Sidra
    Malik, Muhammad Kamran
    Nawaz, Zubair
    IEEE ACCESS, 2021, 9 : 31638 - 31661
  • [10] Learning to Play Text-Based Adventure Games with Maximum Entropy Reinforcement Learning
    Li, Weichen
    Devidze, Rati
    Fellenz, Sophie
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 39 - 54