DEN-DQL: Quick Convergent Deep Q-Learning with Double Exploration Networks for News Recommendation

被引：0

作者：

Song, Zhanghan ^{[1
]}

Zhang, Dian ^{[1
]}

Shi, Xiaochuan ^{[1
]}

Li, Wei ^{[2
]}

Ma, Chao ^{[1
]}

Wu, Libing ^{[1
]}

机构：

[1] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan, Peoples R China

[2] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi, Jiangsu, Peoples R China

来源：

2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2021年

关键词：

Reinforcement Learning; Deep Q-Learning; News Recommendation; Double Exploration Networks;

D O I：

10.1109/IJCNN52387.2021.9533818

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Due to the dynamic characteristics of news and user preferences, personalized recommendation is a challenging problem. Traditional recommendation methods simply focus on current reward, which just recommend items to maximize the number of current clicks. And this may reduce users' interest in similar items. Although the news recommendation framework based on deep reinforcement learning preciously proposed (i.e, DRL, based on deep Q-learning) has the advantages of focusing on future total rewards and dynamic interactive recommendation, it has two issues. First, its exploration method is slow to converge, which may bring new users a bad experience. Second, it is hard to train on off-line data set because the reward is difficult to be determined. In order to address the aforementioned issues, we propose a framework named DEN-DQL for news recommendation based on deep Q-learning with double exploration networks. Also, we develop a new method to calculate rewards and use an off-line data set to simulate the online news clicking environment to train DEN-DQL. Then, the well trained DEN-DQL is tested in the online environment of the same data set, which demonstrates at least 10% improvement of the proposed DEN-DQL.

引用

页数：8

共 50 条

[41] An Efficient Multimodal Emotion Identification Using FOX Optimized Double Deep Q-Learning
Selvi, R.
Vijayakumaran, C.
WIRELESS PERSONAL COMMUNICATIONS, 2023, 132 (04) : 2387 - 2406
[42] An Efficient Multimodal Emotion Identification Using FOX Optimized Double Deep Q-Learning
R. Selvi
C. Vijayakumaran
Wireless Personal Communications, 2023, 132 (4) : 2387 - 2406
[43] Constrained Double Deep Q-learning Network for EVs Charging Scheduling with Renewable Energy
Ming, Fangzhu
Gao, Feng
Liu, Kun
Wu, Jiang
Xu, Zhanbo
Li, Wenming
2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 636 - 641
[44] Double Deep Q-Learning With Prioritized Experience Replay for Anomaly Detection in Smart Environments
Fahrmann, Daniel
Jorek, Nils
Damer, Naser
Kirchbuchner, Florian
Kuijper, Arjan
IEEE ACCESS, 2022, 10 : 60836 - 60848
[45] Improved residential energy management system using priority double deep Q-learning
Mathew, Alwyn
Jolly, Milan Jeetendra
Mathew, Jimson
SUSTAINABLE CITIES AND SOCIETY, 2021, 69
[46] Human-like Autonomous Vehicle Speed Control by Deep Reinforcement Learning with Double Q-Learning
Zhang, Yi
Sun, Ping
Yin, Yuhan
Lin, Lin
Wang, Xuesong
2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1251 - 1256
[47] Energy management based on reinforcement learning with double deep Q-learning for a hybrid electric tracked vehicle
Han, Xuefeng
He, Hongwen
Wu, Jingda
Peng, Jiankun
Li, Yuecheng
APPLIED ENERGY, 2019, 254
[48] Optimized Trajectory Design in UAV Based Cellular Networks: A Double Q-Learning Approach
Liu, Xuanlin
Chen, Mingzhe
Yin, Changchuan
PROCEEDINGS OF 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS (ICCS 2018), 2018, : 13 - 18
[49] Onboard Double Q-Learning for Airborne Data Capture in Wireless Powered IoT Networks
Li, Kai
Ni, Wei
Wei, Bo
Tovar, Eduardo
Li, Kai (kai@isep.ipp.pt), 1600, Institute of Electrical and Electronics Engineers Inc., United States (02): : 71 - 75
[50] WIP: Demand-Driven Power Allocation in Wireless Networks with Deep Q-Learning
Giannopoulos, A.
Spantideas, S.
Capsalis, N.
Gkonis, P.
Karkazis, P.
Sarakis, L.
Trakadas, P.
Capsalis, C.
2021 IEEE 22ND INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (WOWMOM 2021), 2021, : 248 - 251

← 1 2 3 4 5 →