Reinforcement Recommendation with User Multi-aspect Preference

被引：12

作者：

Chen, Xu ^{[1
]}

Du, Yali ^{[2
]}

Xia, Long ^{[3
]}

Wang, Jun ^{[2
]}

机构：

[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing Key Lab Big Data Management & Anal Method, Beijing, Peoples R China

[2] UCL, Dept Comp Sci, London, England

[3] York Univ, Sch Informat Technol, Toronto, ON, Canada

来源：

PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021) | 2021年

基金：

中国国家自然科学基金;

关键词：

Recommender system; Reinforcement learning; Multi-objective optimization;

D O I：

10.1145/3442381.3449846

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Formulating recommender system with reinforcement learning (RL) frameworks has attracted increasing attention from both academic and industry communities. While many promising results have been achieved, existing models mostly simulate the environment reward with a unified value, which may hinder the understanding of users' complex preferences and limit the model performance. In this paper, we consider how to model user multi-aspect preferences in the context of RL-based recommender system. More specifically, we base our model on the framework of deterministic policy gradient (DPG), which is effective in dealing with large action spaces. A major challenge for modeling user multi-aspect preferences lies in the fact that they may contradict with each other. To solve this problem, we introduce Pareto optimization into the DPG framework. We assign each aspect with a tailored critic, and all the critics share the same actor. The Pareto optimization is realized by a gradient-based method, which can be easily integrated into the actor and critic learning process. Based on the designed model, we theoretically analyze its gradient bias in the optimization process, and we design a weight-reuse mechanism to lower the upper bound of this bias, which is shown to be effective for improving the model performance. We conduct extensive experiments based on three real-world datasets to demonstrate our model's superiorities.

引用

页码：425 / 435

页数：11

共 50 条

[21] CupMar: A deep learning model for personalized news recommendation based on contextual user-profile and multi-aspect article representation
Dai Hoang Tran
Quan Z. Sheng
Wei Emma Zhang
Nguyen H. Tran
Nguyen Lu Dang Khoa
World Wide Web, 2023, 26 : 713 - 732
[22] MGMASR: Multi-Graph and Multi-Aspect Neural Network for Service Recommendation in Internet of Services
Jia, Zhixuan
Fan, Yushun
Zhang, Jia
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (03): : 2668 - 2681
[23] CupMar: A deep learning model for personalized news recommendation based on contextual user-profile and multi-aspect article representation
Dai Hoang Tran
Sheng, Quan Z.
Zhang, Wei Emma
Tran, Nguyen H.
Nguyen Lu Dang Khoa
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (02): : 713 - 732
[24] Multi-Aspect Dense Retrieval
Kong, Weize
Khadanga, Swaraj
Li, Cheng
Gupta, Shaleen Kumar
Zhang, Mingyang
Xu, Wensong
Bendersky, Michael
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3178 - 3186
[25] Comparison of Multi-Aspect Multi-Baseline SAR Interferometry and Multi-Aspect TomoSAR Reconstruction Results
Schmitt, Michael
Stilla, Uwe
10TH EUROPEAN CONFERENCE ON SYNTHETIC APERTURE RADAR (EUSAR 2014), 2014,
[26] Multi-Aspect User Ontology for Intelligent Decision Support Based on Digital Footprints
A. V. Smirnov
T. V. Levashova
Scientific and Technical Information Processing, 2022, 49 : 486 - 496
[27] Multi-aspect Knowledge-enhanced Hypergraph Attention Network for Conversational Recommendation Systems
Li, Xiaokang
Zhang, Yihao
Huang, Yonghao
Li, Kaibei
Zhang, Yunjia
Wang, Xibin
KNOWLEDGE-BASED SYSTEMS, 2024, 299
[28] Multi-Aspect User Ontology for Intelligent Decision Support Based on Digital Footprints
Smirnov, A. V.
Levashova, T. V.
SCIENTIFIC AND TECHNICAL INFORMATION PROCESSING, 2022, 49 (06) : 486 - 496
[29] Aspect-Driven User Preference and News Representation Learning for News Recommendation
Lu, Wenpeng
Wang, Rongyao
Wang, Shoujin
Peng, Xueping
Wu, Hao
Zhang, Qian
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25297 - 25307
[30] Multi-aspect user interaction models for distributed robotic systems: Collaborative activity generation
Sato, Keiichi
Bruder, Ralph
2007 RO-MAN: 16TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, VOLS 1-3, 2007, : 87 - +

← 1 2 3 4 5 →