Reinforcement Recommendation with User Multi-aspect Preference

被引:12
|
作者
Chen, Xu [1 ]
Du, Yali [2 ]
Xia, Long [3 ]
Wang, Jun [2 ]
机构
[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing Key Lab Big Data Management & Anal Method, Beijing, Peoples R China
[2] UCL, Dept Comp Sci, London, England
[3] York Univ, Sch Informat Technol, Toronto, ON, Canada
基金
中国国家自然科学基金;
关键词
Recommender system; Reinforcement learning; Multi-objective optimization;
D O I
10.1145/3442381.3449846
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Formulating recommender system with reinforcement learning (RL) frameworks has attracted increasing attention from both academic and industry communities. While many promising results have been achieved, existing models mostly simulate the environment reward with a unified value, which may hinder the understanding of users' complex preferences and limit the model performance. In this paper, we consider how to model user multi-aspect preferences in the context of RL-based recommender system. More specifically, we base our model on the framework of deterministic policy gradient (DPG), which is effective in dealing with large action spaces. A major challenge for modeling user multi-aspect preferences lies in the fact that they may contradict with each other. To solve this problem, we introduce Pareto optimization into the DPG framework. We assign each aspect with a tailored critic, and all the critics share the same actor. The Pareto optimization is realized by a gradient-based method, which can be easily integrated into the actor and critic learning process. Based on the designed model, we theoretically analyze its gradient bias in the optimization process, and we design a weight-reuse mechanism to lower the upper bound of this bias, which is shown to be effective for improving the model performance. We conduct extensive experiments based on three real-world datasets to demonstrate our model's superiorities.
引用
收藏
页码:425 / 435
页数:11
相关论文
共 50 条
  • [21] CupMar: A deep learning model for personalized news recommendation based on contextual user-profile and multi-aspect article representation
    Dai Hoang Tran
    Quan Z. Sheng
    Wei Emma Zhang
    Nguyen H. Tran
    Nguyen Lu Dang Khoa
    World Wide Web, 2023, 26 : 713 - 732
  • [22] MGMASR: Multi-Graph and Multi-Aspect Neural Network for Service Recommendation in Internet of Services
    Jia, Zhixuan
    Fan, Yushun
    Zhang, Jia
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (03): : 2668 - 2681
  • [23] CupMar: A deep learning model for personalized news recommendation based on contextual user-profile and multi-aspect article representation
    Dai Hoang Tran
    Sheng, Quan Z.
    Zhang, Wei Emma
    Tran, Nguyen H.
    Nguyen Lu Dang Khoa
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (02): : 713 - 732
  • [24] Multi-Aspect Dense Retrieval
    Kong, Weize
    Khadanga, Swaraj
    Li, Cheng
    Gupta, Shaleen Kumar
    Zhang, Mingyang
    Xu, Wensong
    Bendersky, Michael
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3178 - 3186
  • [25] Comparison of Multi-Aspect Multi-Baseline SAR Interferometry and Multi-Aspect TomoSAR Reconstruction Results
    Schmitt, Michael
    Stilla, Uwe
    10TH EUROPEAN CONFERENCE ON SYNTHETIC APERTURE RADAR (EUSAR 2014), 2014,
  • [26] Multi-Aspect User Ontology for Intelligent Decision Support Based on Digital Footprints
    A. V. Smirnov
    T. V. Levashova
    Scientific and Technical Information Processing, 2022, 49 : 486 - 496
  • [27] Multi-aspect Knowledge-enhanced Hypergraph Attention Network for Conversational Recommendation Systems
    Li, Xiaokang
    Zhang, Yihao
    Huang, Yonghao
    Li, Kaibei
    Zhang, Yunjia
    Wang, Xibin
    KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [28] Multi-Aspect User Ontology for Intelligent Decision Support Based on Digital Footprints
    Smirnov, A. V.
    Levashova, T. V.
    SCIENTIFIC AND TECHNICAL INFORMATION PROCESSING, 2022, 49 (06) : 486 - 496
  • [29] Aspect-Driven User Preference and News Representation Learning for News Recommendation
    Lu, Wenpeng
    Wang, Rongyao
    Wang, Shoujin
    Peng, Xueping
    Wu, Hao
    Zhang, Qian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25297 - 25307
  • [30] Multi-aspect user interaction models for distributed robotic systems: Collaborative activity generation
    Sato, Keiichi
    Bruder, Ralph
    2007 RO-MAN: 16TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, VOLS 1-3, 2007, : 87 - +