Uncovering instabilities in variational-quantum deep Q-networks

被引:8
|
作者
Franz, Maja [1 ]
Wolf, Lucas [1 ]
Periyasamy, Maniraman [2 ]
Ufrecht, Christian [2 ]
Scherer, Daniel D. [2 ]
Plinge, Axel [2 ]
Mutschler, Christopher [2 ]
Mauerer, Wolfgang [1 ,3 ]
机构
[1] Tech Univ Appl Sci, Regensburg, Germany
[2] Fraunhofer Inst Integrated Circuits IIS, Fraunhofer IIS, Div Positioning & Networks, Nurnberg, Germany
[3] Siemens AG, Corp Res, Munich, Germany
关键词
REINFORCEMENT; GAME; GO;
D O I
10.1016/j.jfranklin.2022.08.021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Reinforcement Learning (RL) has considerably advanced over the past decade. At the same time, state-of-the-art RL algorithms require a large computational budget in terms of training time to converge. Recent work has started to approach this problem through the lens of quantum computing, which promises theoretical speed-ups for several traditionally hard tasks. In this work, we examine a class of hybrid quantum-classical RL algorithms that we collectively refer to as variational quantum deep Q-networks (VQ-DQN). We show that VQ-DQN approaches are subject to instabilities that cause the learned policy to diverge, study the extent to which this afflicts reproducibility of established results based on classical simulation, and perform systematic experiments to identify potential explanations for the observed instabilities. Additionally, and in contrast to most existing work on quantum reinforcement learning, we execute RL algorithms on an actual quantum processing unit (an IBM Quantum Device) and investigate differences in behaviour between simulated and physical quantum systems that suffer from implementation deficiencies. Our experiments show that, contrary to opposite claims in the literature, it cannot be conclusively decided if known quantum approaches, even if simulated without physical imperfections, can provide an advantage as compared to classical approaches. Finally, we provide a robust, universal and well-tested implementation of VQ-DQN as a reproducible testbed for future experiments. (c) 2022 The Author(s). Published by Elsevier Ltd on behalf of The Franklin Institute. This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/ )
引用
收藏
页码:13822 / 13844
页数:23
相关论文
共 50 条
  • [1] Deep Abstract Q-Networks
    Roderick, Melrose
    Grimm, Christopher
    Tellex, Stefanie
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 131 - 138
  • [2] Weakly Coupled Deep Q-Networks
    El Shar, Ibrahim
    Jiang, Daniel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Episodic Memory Deep Q-Networks
    Lin, Zichuan
    Zhao, Tianqi
    Yang, Guangwen
    Zhang, Lintao
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2433 - 2439
  • [4] Towards Better Interpretability in Deep Q-Networks
    Annasamy, Raghuram Mandyam
    Sycara, Katia
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4561 - 4569
  • [5] Detecting Malicious Websites by using Deep Q-Networks
    Khanh Nguyen
    Park, Younghee
    2024 SILICON VALLEY CYBERSECURITY CONFERENCE, SVCC 2024, 2024,
  • [6] Efficient Exploration through Bayesian Deep Q-Networks
    Azizzadenesheli, Kamyar
    Brunskill, Emma
    Anandkumar, Animashree
    2018 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2018,
  • [7] Social Attentive Deep Q-Networks for Recommender Systems
    Lei, Yu
    Wang, Zhitao
    Li, Wenjie
    Pei, Hongbin
    Dai, Quanyu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (05) : 2443 - 2457
  • [8] Multi-agent Double Deep Q-Networks
    Simoes, David
    Lau, Nuno
    Reis, Luis Paulo
    PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2017), 2017, 10423 : 123 - 134
  • [9] Historical Best Q-Networks for Deep Reinforcement Learning
    Yu, Wenwu
    Wang, Rui
    Li, Ruiying
    Gao, Jing
    Hu, Xiaohui
    2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2018, : 6 - 11
  • [10] DQNViz: A Visual Analytics Approach to Understand Deep Q-Networks
    Wang, Junpeng
    Gou, Liang
    Shen, Han-Wei
    Yang, Hao
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (01) : 288 - 298