DEN-DQL: Quick Convergent Deep Q-Learning with Double Exploration Networks for News Recommendation

被引:0
|
作者
Song, Zhanghan [1 ]
Zhang, Dian [1 ]
Shi, Xiaochuan [1 ]
Li, Wei [2 ]
Ma, Chao [1 ]
Wu, Libing [1 ]
机构
[1] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan, Peoples R China
[2] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi, Jiangsu, Peoples R China
来源
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2021年
关键词
Reinforcement Learning; Deep Q-Learning; News Recommendation; Double Exploration Networks;
D O I
10.1109/IJCNN52387.2021.9533818
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the dynamic characteristics of news and user preferences, personalized recommendation is a challenging problem. Traditional recommendation methods simply focus on current reward, which just recommend items to maximize the number of current clicks. And this may reduce users' interest in similar items. Although the news recommendation framework based on deep reinforcement learning preciously proposed (i.e, DRL, based on deep Q-learning) has the advantages of focusing on future total rewards and dynamic interactive recommendation, it has two issues. First, its exploration method is slow to converge, which may bring new users a bad experience. Second, it is hard to train on off-line data set because the reward is difficult to be determined. In order to address the aforementioned issues, we propose a framework named DEN-DQL for news recommendation based on deep Q-learning with double exploration networks. Also, we develop a new method to calculate rewards and use an off-line data set to simulate the online news clicking environment to train DEN-DQL. Then, the well trained DEN-DQL is tested in the online environment of the same data set, which demonstrates at least 10% improvement of the proposed DEN-DQL.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] An Efficient Multimodal Emotion Identification Using FOX Optimized Double Deep Q-Learning
    Selvi, R.
    Vijayakumaran, C.
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 132 (04) : 2387 - 2406
  • [42] An Efficient Multimodal Emotion Identification Using FOX Optimized Double Deep Q-Learning
    R. Selvi
    C. Vijayakumaran
    Wireless Personal Communications, 2023, 132 (4) : 2387 - 2406
  • [43] Constrained Double Deep Q-learning Network for EVs Charging Scheduling with Renewable Energy
    Ming, Fangzhu
    Gao, Feng
    Liu, Kun
    Wu, Jiang
    Xu, Zhanbo
    Li, Wenming
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 636 - 641
  • [44] Double Deep Q-Learning With Prioritized Experience Replay for Anomaly Detection in Smart Environments
    Fahrmann, Daniel
    Jorek, Nils
    Damer, Naser
    Kirchbuchner, Florian
    Kuijper, Arjan
    IEEE ACCESS, 2022, 10 : 60836 - 60848
  • [45] Improved residential energy management system using priority double deep Q-learning
    Mathew, Alwyn
    Jolly, Milan Jeetendra
    Mathew, Jimson
    SUSTAINABLE CITIES AND SOCIETY, 2021, 69
  • [46] Human-like Autonomous Vehicle Speed Control by Deep Reinforcement Learning with Double Q-Learning
    Zhang, Yi
    Sun, Ping
    Yin, Yuhan
    Lin, Lin
    Wang, Xuesong
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1251 - 1256
  • [47] Energy management based on reinforcement learning with double deep Q-learning for a hybrid electric tracked vehicle
    Han, Xuefeng
    He, Hongwen
    Wu, Jingda
    Peng, Jiankun
    Li, Yuecheng
    APPLIED ENERGY, 2019, 254
  • [48] Optimized Trajectory Design in UAV Based Cellular Networks: A Double Q-Learning Approach
    Liu, Xuanlin
    Chen, Mingzhe
    Yin, Changchuan
    PROCEEDINGS OF 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS (ICCS 2018), 2018, : 13 - 18
  • [49] Onboard Double Q-Learning for Airborne Data Capture in Wireless Powered IoT Networks
    Li, Kai
    Ni, Wei
    Wei, Bo
    Tovar, Eduardo
    Li, Kai (kai@isep.ipp.pt), 1600, Institute of Electrical and Electronics Engineers Inc., United States (02): : 71 - 75
  • [50] WIP: Demand-Driven Power Allocation in Wireless Networks with Deep Q-Learning
    Giannopoulos, A.
    Spantideas, S.
    Capsalis, N.
    Gkonis, P.
    Karkazis, P.
    Sarakis, L.
    Trakadas, P.
    Capsalis, C.
    2021 IEEE 22ND INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (WOWMOM 2021), 2021, : 248 - 251