Text-Based Interactive Recommendation via Offline Reinforcement Learning

被引:0
|
作者
Zhang, Ruiyi [1 ]
Yu, Tong [2 ]
Shen, Yilin [2 ]
Jin, Hongxia [2 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
[2] Samsung Res Amer, Mountain View, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interactive recommendation with natural-language feedback can provide richer user feedback and has demonstrated advantages over traditional recommender systems. However, the classical online paradigm involves iteratively collecting experience via interaction with users, which is expensive and risky. We consider an offline interactive recommendation to exploit arbitrary experience collected by multiple unknown policies. A direct application of policy learning with such fixed experience suffers from the distribution shift. To tackle this issue, we develop a behavior-agnostic off-policy correction framework to make offline interactive recommendation possible. Specifically, we leverage the conservative Q-function to perform off-policy evaluation, which enables learning effective policies from fixed datasets without further interactions. Empirical results on the simulator derived from real-world datasets demonstrate the effectiveness of our proposed offline training framework.
引用
收藏
页码:11694 / 11702
页数:9
相关论文
共 50 条
  • [41] Exploring Effective Interactive Text-Based Video Search in vitrivr
    Sauter, Loris
    Gasser, Ralph
    Heller, Silvan
    Rossetto, Luca
    Saladin, Colin
    Spiess, Florian
    Schuldt, Heiko
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 646 - 651
  • [42] DotCHA: An Interactive 3D Text-based CAPTCHA
    Kim, Suzi
    Choi, Sunghee
    JOURNAL OF WEB ENGINEERING, 2020, 18 (08): : 837 - 863
  • [43] Personalized style recommendation via reinforcement learning
    Luo, Jiyun
    Hazra, Kurchi Subhra
    Huo, Wenyu
    Li, Rui
    Mahabal, Abhijit
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 290 - 293
  • [44] Exploration Based Language Learning for Text-Based Games
    Madotto, Andrea
    Namazifar, Mahdi
    Huizinga, Joost
    Molino, Piero
    Ecoffet, Adrien
    Zheng, Huaixiu
    Yu, Dian
    Papangelis, Alexandros
    Khatri, Chandra
    Tur, Gokhan
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1488 - 1494
  • [45] A Review of Offline Reinforcement Learning Based on Representation Learning
    Wang X.-S.
    Wang R.-R.
    Cheng Y.-H.
    Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (06): : 1104 - 1128
  • [46] GraphDRL: GNN-based deep reinforcement learning for interactive recommendation with sparse data
    Li, Wenxin
    Song, Xiao
    Tu, Yuchun
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 273
  • [47] Improving ranking function and diversification in interactive recommendation systems based on deep reinforcement learning
    Baghi, Vahid
    Motehayeri, Seyed Mohammad Seyed
    Moeini, Ali
    Abedian, Rooholah
    2021 26TH INTERNATIONAL COMPUTER CONFERENCE, COMPUTER SOCIETY OF IRAN (CSICC), 2021,
  • [48] Balancing Between Accuracy and Fairness for Interactive Recommendation with Reinforcement Learning
    Liu, Weiwen
    Liu, Feng
    Tang, Ruiming
    Liao, Ben
    Chen, Guangyong
    Heng, Pheng Ann
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I, 2020, 12084 : 155 - 167
  • [49] Interactive Recommendation with User-Specific Deep Reinforcement Learning
    Lei, Yu
    Li, Wenjie
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2019, 13 (06)
  • [50] Exploration in Interactive Personalized Music Recommendation: A Reinforcement Learning Approach
    Wang, Xinxi
    Wang, Yi
    Hsu, David
    Wang, Ye
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2014, 11 (01)