Reinforcement Recommendation with User Multi-aspect Preference

被引:12
|
作者
Chen, Xu [1 ]
Du, Yali [2 ]
Xia, Long [3 ]
Wang, Jun [2 ]
机构
[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing Key Lab Big Data Management & Anal Method, Beijing, Peoples R China
[2] UCL, Dept Comp Sci, London, England
[3] York Univ, Sch Informat Technol, Toronto, ON, Canada
基金
中国国家自然科学基金;
关键词
Recommender system; Reinforcement learning; Multi-objective optimization;
D O I
10.1145/3442381.3449846
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Formulating recommender system with reinforcement learning (RL) frameworks has attracted increasing attention from both academic and industry communities. While many promising results have been achieved, existing models mostly simulate the environment reward with a unified value, which may hinder the understanding of users' complex preferences and limit the model performance. In this paper, we consider how to model user multi-aspect preferences in the context of RL-based recommender system. More specifically, we base our model on the framework of deterministic policy gradient (DPG), which is effective in dealing with large action spaces. A major challenge for modeling user multi-aspect preferences lies in the fact that they may contradict with each other. To solve this problem, we introduce Pareto optimization into the DPG framework. We assign each aspect with a tailored critic, and all the critics share the same actor. The Pareto optimization is realized by a gradient-based method, which can be easily integrated into the actor and critic learning process. Based on the designed model, we theoretically analyze its gradient bias in the optimization process, and we design a weight-reuse mechanism to lower the upper bound of this bias, which is shown to be effective for improving the model performance. We conduct extensive experiments based on three real-world datasets to demonstrate our model's superiorities.
引用
收藏
页码:425 / 435
页数:11
相关论文
共 50 条
  • [31] The multi-aspect tests in the presence of ties
    Yamaguchi, Hikaru
    Murakami, Hidetoshi
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 180
  • [32] Multi-aspect typology of philosophical terminology
    Kanichová, R
    FILOZOFIA, 2005, 60 (01): : 8 - 32
  • [33] Point-of-interest recommendation based on LBSN with multi-aspect fusion of social and individual features
    Zhang Y.
    Liu Y.
    Neural Computing and Applications, 2024, 36 (20) : 12163 - 12184
  • [34] MULTI-ASPECT MINING IN HEPATITIS DATA
    Ohshima, Muneaki
    Zhong, Ning
    Dong, Juzhen
    Yokoi, Hideto
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2009, 8 (03) : 445 - 472
  • [35] Temporal Graph Multi-Aspect Embeddings
    Sun, Aimin
    Gong, Zhiguo
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 7102 - 7114
  • [36] Keyphrase Generation: A Multi-Aspect Survey
    Cano, Erion
    Bojar, Ondrej
    PROCEEDINGS OF THE 2019 25TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2019, : 85 - 94
  • [37] Sentiment-Aware Representation Learning Framework Fusion With Multi-Aspect Information for POI Recommendation
    Gong, Weihua
    Shen, Genhang
    Yang, Lianghuai
    Liang, Haoran
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (05) : 2850 - 2861
  • [38] Multi-Aspect Streaming Tensor Completion
    Song, Qingquan
    Huang, Xiao
    Ge, Hancheng
    Caverlee, James
    Hu, Xia
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 435 - 443
  • [39] Multi-aspect renewable energy forecasting
    Corizzo, Roberto
    Ceci, Michelangelo
    Fanaee-T, Hadi
    Gama, Joao
    INFORMATION SCIENCES, 2021, 546 : 701 - 722
  • [40] Multi-aspect gene relation analysis
    Yamakawa, H
    Maruhashi, K
    Nakao, Y
    Yamaguchi, M
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2005, 2005, : 233 - 244