Trax Solver on Zynq with Deep Q-Network

被引:0
|
作者
Sugimoto, Naru [1 ]
Mitsuishi, Takuji [1 ]
Kaneda, Takahiro [1 ]
Tsuruta, Chiharu [1 ]
Sakai, Ryotaro [1 ]
Shimura, Hideki [1 ]
Amano, Hideharu [1 ]
机构
[1] Keio Univ, Yokohama, Kanagawa 2238522, Japan
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A software/hardware co-design system for a Trax solver is proposed. Implementation of Trax AI is challenging due to its complicated rules, so we adopted an embedded system called Zynq (Zynq-7000 AP SoC) and introduced a High Level Synthesis (HLS) design. We also added Deep Q-Network, a machine learning algorithm, to the system for use as an evaluation function. Our solver automatically optimizes its own evaluation function through games with humans or other AIs. The implemented solver works with a 150-MHz clock on the Xilinx XC7Z020-CLG484 of a Digilent ZedBoard. A part of the Deep Q-Network job can be executed on the FPGA of the Zynq board more than 26 times faster than with ARM Coretex-A9 650-MHz software.
引用
收藏
页码:272 / 275
页数:4
相关论文
共 50 条
  • [21] Manufacturing Resource Scheduling Based on Deep Q-Network
    ZHANG Yufei
    ZOU Yuanhao
    ZHAO Xiaodong
    Wuhan University Journal of Natural Sciences, 2022, 27 (06) : 531 - 538
  • [22] Inhomogeneous deep Q-network for time sensitive applications
    Chen, Xu
    Wang, Jun
    ARTIFICIAL INTELLIGENCE, 2022, 312
  • [23] Dynamic Parallel Machine Scheduling With Deep Q-Network
    Liu, Chien-Liang
    Tseng, Chun-Jan
    Huang, Tzu-Hsuan
    Wang, Jhih-Wun
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (11): : 6792 - 6804
  • [24] Multiagent Learning and Coordination with Clustered Deep Q-Network
    Pageaud, Simon
    Deslandres, Veronique
    Lehoux, Vassilissa
    Hassas, Salima
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2156 - 2158
  • [25] Deep Reinforcement Learning. Case Study: Deep Q-Network
    Vrejoiu, Mihnea Horia
    ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2019, 29 (03): : 65 - 78
  • [26] Deep Reinforcement Learning Pairs Trading with a Double Deep Q-Network
    Brim, Andrew
    2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 222 - 227
  • [27] FPGA Trax Solver based on a Neural Network Design
    Fujimori, Takumi
    Akabe, Tomoya
    Ito, Yoshizumi
    Akagi, Kouta
    Furukawa, Shinya
    Shinba, Hiroki
    Tanibata, Aoi
    Watanabe, Minoru
    2015 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (FPT), 2015, : 260 - 263
  • [28] Deep Q-Network for Radar Task-Scheduling Problem
    George, Taylor
    Wagner, Kevin
    Rademacher, Paul
    2022 IEEE RADAR CONFERENCE (RADARCONF'22), 2022,
  • [29] Hierarchical Deep Q-Network from imperfect demonstrations in Minecraft
    Skrynnik, Alexey
    Staroverov, Aleksey
    Aitygulov, Ermek
    Aksenov, Kirill
    Davydov, Vasilii
    Panov, Aleksandr I.
    COGNITIVE SYSTEMS RESEARCH, 2021, 65 : 74 - 78
  • [30] Double Deep Q-Network by Fusing Contrastive Predictive Coding
    Liu, Jianfeng
    Pu, Jiexin
    Sun, Lifan
    Computer Engineering and Applications, 2023, 59 (06) : 162 - 170