Reinforcement Recommendation with User Multi-aspect Preference

被引:12
|
作者
Chen, Xu [1 ]
Du, Yali [2 ]
Xia, Long [3 ]
Wang, Jun [2 ]
机构
[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing Key Lab Big Data Management & Anal Method, Beijing, Peoples R China
[2] UCL, Dept Comp Sci, London, England
[3] York Univ, Sch Informat Technol, Toronto, ON, Canada
基金
中国国家自然科学基金;
关键词
Recommender system; Reinforcement learning; Multi-objective optimization;
D O I
10.1145/3442381.3449846
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Formulating recommender system with reinforcement learning (RL) frameworks has attracted increasing attention from both academic and industry communities. While many promising results have been achieved, existing models mostly simulate the environment reward with a unified value, which may hinder the understanding of users' complex preferences and limit the model performance. In this paper, we consider how to model user multi-aspect preferences in the context of RL-based recommender system. More specifically, we base our model on the framework of deterministic policy gradient (DPG), which is effective in dealing with large action spaces. A major challenge for modeling user multi-aspect preferences lies in the fact that they may contradict with each other. To solve this problem, we introduce Pareto optimization into the DPG framework. We assign each aspect with a tailored critic, and all the critics share the same actor. The Pareto optimization is realized by a gradient-based method, which can be easily integrated into the actor and critic learning process. Based on the designed model, we theoretically analyze its gradient bias in the optimization process, and we design a weight-reuse mechanism to lower the upper bound of this bias, which is shown to be effective for improving the model performance. We conduct extensive experiments based on three real-world datasets to demonstrate our model's superiorities.
引用
收藏
页码:425 / 435
页数:11
相关论文
共 50 条
  • [41] Automatic reconstruction of 3D objects from multi-aspect Part II: Multi-aspect reconstruction
    Key Laboratory of Wave Scattering and Remote Sensing Information, Fudan University, Shanghai 200433, China
    Dianbo Kexue Xuebao, 2008, 1 (23-33):
  • [42] Multi-Aspect Embedding of Dynamic Graphs
    Sun, Aimin
    Gong, Zhiguo
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4520 - 4524
  • [43] Multi-aspect synthetic aperture sonar
    Fernandez, JE
    Christoff, JT
    OCEANS 2000 MTS/IEEE - WHERE MARINE SCIENCE AND TECHNOLOGY MEET, VOLS 1-3, CONFERENCE PROCEEDINGS, 2000, : 177 - 180
  • [44] Leveraging Long Short-Term User Preference in Conversational Recommendation via Multi-agent Reinforcement Learning
    Deng, Yang
    Li, Yaliang
    Ding, Bolin
    Lam, Wai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (11) : 11541 - 11555
  • [45] MULTIPLE: Multi-level User Preference Learning for List Recommendation
    Li, Beibei
    Jin, Beihong
    Dong, Xinzhou
    Zhuo, Wei
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2021, PT II, 2021, 13081 : 221 - 236
  • [46] Multi-Aspect Rating Inference with Aspect-Based Segmentation
    Zhu, Jingbo
    Zhang, Chunliang
    Ma, Matthew Y.
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2012, 3 (04) : 469 - 481
  • [47] A multi-objective framework for location recommendation based on user preference
    Wang, Shanfeng
    Gong, Maoguo
    Qin, Can
    Yang, Junwei
    2017 13TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2017, : 39 - 43
  • [48] Social recommendation via multi-view user preference learning
    Lu, Hanqing
    Chen, Chaochao
    Kong, Ming
    Zhang, Hanyi
    Zhao, Zhou
    NEUROCOMPUTING, 2016, 216 : 61 - 71
  • [49] Multi-view User Preference Learning with Knowledge Graph for Recommendation
    Zhang, Yiming
    Pang, Yitong
    Wei, Zhihuai
    2022 IEEE 2ND INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SOFTWARE ENGINEERING (ICICSE 2022), 2022, : 66 - 72
  • [50] Multi-Aspect Mining of Complex Sensor Sequences
    Honda, Takato
    Matsubara, Yasuko
    Neyama, Ryo
    Abe, Mutsumi
    Sakurai, Yasushi
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 299 - 308