Text-Based Interactive Recommendation via Offline Reinforcement Learning

被引:0
|
作者
Zhang, Ruiyi [1 ]
Yu, Tong [2 ]
Shen, Yilin [2 ]
Jin, Hongxia [2 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
[2] Samsung Res Amer, Mountain View, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interactive recommendation with natural-language feedback can provide richer user feedback and has demonstrated advantages over traditional recommender systems. However, the classical online paradigm involves iteratively collecting experience via interaction with users, which is expensive and risky. We consider an offline interactive recommendation to exploit arbitrary experience collected by multiple unknown policies. A direct application of policy learning with such fixed experience suffers from the distribution shift. To tackle this issue, we develop a behavior-agnostic off-policy correction framework to make offline interactive recommendation possible. Specifically, we leverage the conservative Q-function to perform off-policy evaluation, which enables learning effective policies from fixed datasets without further interactions. Empirical results on the simulator derived from real-world datasets demonstrate the effectiveness of our proposed offline training framework.
引用
收藏
页码:11694 / 11702
页数:9
相关论文
共 50 条
  • [21] An Interactive Learning Platform with Machine Translation for Practicing Text-Based Conversational English
    Rusli, Andre
    Shishido, Makoto
    2022 JOINT 12TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 23RD INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS&ISIS), 2022,
  • [22] RLISR: A Deep Reinforcement Learning Based Interactive Service Recommendation Model
    Zhang, Mingwei
    Qu, Yingjie
    Li, Yage
    Wen, Xingyu
    Zhou, Yi
    IEEE ACCESS, 2024, 12 : 90204 - 90217
  • [23] Improved estimation of the correlation matrix using reinforcement learning and text-based networks
    Lu, Cheng
    Ndiaye, Papa Momar
    Simaan, Majeed
    INTERNATIONAL REVIEW OF FINANCIAL ANALYSIS, 2024, 96
  • [24] Interactive Interior Design Recommendation via Coarse-to-fine Multimodal Reinforcement Learning
    Zhang, He
    Sun, Ying
    Guo, Weiyu
    Liu, Yafei
    Lu, Haonan
    Lin, Xiaodong
    Xiong, Hui
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6472 - 6480
  • [25] Grandmaster: Interactive text-based analytics of social media
    Fabian, Nathan
    Davis, Warren
    Raybourn, Elaine M.
    Lakkaraju, Kiran
    Whetzel, Jon
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1375 - 1381
  • [26] Chatting with interactive memory for text-based person retrieval
    He, Chen
    Li, Shenshen
    Wang, Zheng
    Chen, Hua
    Shen, Fumin
    Xu, Xing
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [27] Toward Collaborative Reinforcement Learning Agents that Communicate Through Text-Based Natural Language
    Eloff, Kevin M.
    Engelbrecht, Herman A.
    2021 SOUTHERN AFRICAN UNIVERSITIES POWER ENGINEERING CONFERENCE/ROBOTICS AND MECHATRONICS/PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA (SAUPEC/ROBMECH/PRASA), 2021,
  • [28] An interactive food recommendation system using reinforcement learning
    Liu, Liangliang
    Guan, Yi
    Wang, Zi
    Shen, Rujia
    Zheng, Guowei
    Fu, Xuelian
    Yu, Xuehui
    Jiang, Jingchi
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 254
  • [29] Abstract then Play: A Skill-centric Reinforcement Learning Framework for Text-based Games
    Zhu, Anjie
    Zhang, Peng-Fei
    Zhang, Yi
    Huang, Zi
    Shao, Jie
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 13225 - 13236
  • [30] Efficient Text-based Reinforcement Learning by Jointly Leveraging State and Commonsense Graph Representations
    Murugesan, Keerthiram
    Atzeni, Mattia
    Kapanipathi, Pavan
    Talamadupula, Kartik
    Sachan, Mrinmaya
    Campbell, Murray
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 719 - 725