Text-Based Interactive Recommendation via Offline Reinforcement Learning

被引:0
|
作者
Zhang, Ruiyi [1 ]
Yu, Tong [2 ]
Shen, Yilin [2 ]
Jin, Hongxia [2 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
[2] Samsung Res Amer, Mountain View, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interactive recommendation with natural-language feedback can provide richer user feedback and has demonstrated advantages over traditional recommender systems. However, the classical online paradigm involves iteratively collecting experience via interaction with users, which is expensive and risky. We consider an offline interactive recommendation to exploit arbitrary experience collected by multiple unknown policies. A direct application of policy learning with such fixed experience suffers from the distribution shift. To tackle this issue, we develop a behavior-agnostic off-policy correction framework to make offline interactive recommendation possible. Specifically, we leverage the conservative Q-function to perform off-policy evaluation, which enables learning effective policies from fixed datasets without further interactions. Empirical results on the simulator derived from real-world datasets demonstrate the effectiveness of our proposed offline training framework.
引用
收藏
页码:11694 / 11702
页数:9
相关论文
共 50 条
  • [31] TiGAN: Text-Based Interactive Image Generation and Manipulation
    Zhou, Yufan
    Zhang, Ruiyi
    Gu, Jiuxiang
    Tensmeyer, Chris
    Yu, Tong
    Chen, Changyou
    Xu, Jinhui
    Sun, Tong
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3580 - 3588
  • [32] Semi-Offline Reinforcement Learning for Optimized Text Generation
    Chen, Changyu
    Wang, Xiting
    Jin, Yiqiao
    Dong, Victor Ye
    Dong, Li
    Cao, Jie
    Liu, Yi
    Yan, Rui
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [33] Text-Based Price Recommendation System for Online Rental Houses
    Lujia Shen
    Qianjun Liu
    Gong Chen
    Shouling Ji
    Big Data Mining and Analytics, 2020, (02) : 143 - 152
  • [34] RecGPT: Generative Pre-training for Text-based Recommendation
    Mang Ngo
    Dat Quoc Nguyen
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2: SHORT PAPERS, 2024, : 302 - 313
  • [35] Text-Based Price Recommendation System for Online Rental Houses
    Shen, Lujia
    Liu, Qianjun
    Chen, Gong
    Ji, Shouling
    BIG DATA MINING AND ANALYTICS, 2020, 3 (02): : 143 - 152
  • [36] Learning Personalized Health Recommendations via Offline Reinforcement Learning
    Preuett, Larry
    PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 1355 - 1357
  • [37] The Development of a Virtual World Problem-Based Learning Tutorial and Comparison With Interactive Text-Based Tutorials
    Jivram, Trupti
    Kavia, Sheetal
    Poulton, Ella
    Hernandez, Aurora Sese
    Woodham, Luke A.
    Poulton, Terry
    FRONTIERS IN DIGITAL HEALTH, 2021, 3
  • [38] Text-based Person Search via Multi-Granularity Embedding Learning
    Wang, Chengji
    Luo, Zhiming
    Lin, Yaojin
    Li, Shaozi
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1068 - 1074
  • [39] Text-based person search via cross-modal alignment learning
    Ke, Xiao
    Liu, Hao
    Xu, Peirong
    Lin, Xinru
    Guo, Wenzhong
    PATTERN RECOGNITION, 2024, 152
  • [40] Reinforcement learning via offline trajectory planning based on iteratively approximated models
    Pritzkoleit, Max
    Heedt, Robert
    Knoll, Carsten
    Roebenack, Klaus
    AT-AUTOMATISIERUNGSTECHNIK, 2020, 68 (08) : 612 - 624