Trax Solver on Zynq with Deep Q-Network

被引：0

作者：

Sugimoto, Naru ^{[1
]}

Mitsuishi, Takuji ^{[1
]}

Kaneda, Takahiro ^{[1
]}

Tsuruta, Chiharu ^{[1
]}

Sakai, Ryotaro ^{[1
]}

Shimura, Hideki ^{[1
]}

Amano, Hideharu ^{[1
]}

机构：

[1] Keio Univ, Yokohama, Kanagawa 2238522, Japan

来源：

2015 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (FPT) | 2015年

关键词：

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

A software/hardware co-design system for a Trax solver is proposed. Implementation of Trax AI is challenging due to its complicated rules, so we adopted an embedded system called Zynq (Zynq-7000 AP SoC) and introduced a High Level Synthesis (HLS) design. We also added Deep Q-Network, a machine learning algorithm, to the system for use as an evaluation function. Our solver automatically optimizes its own evaluation function through games with humans or other AIs. The implemented solver works with a 150-MHz clock on the Xilinx XC7Z020-CLG484 of a Digilent ZedBoard. A part of the Deep Q-Network job can be executed on the FPGA of the Zynq board more than 26 times faster than with ARM Coretex-A9 650-MHz software.

引用

页码：272 / 275

页数：4

共 50 条

[21] Manufacturing Resource Scheduling Based on Deep Q-Network
ZHANG Yufei
ZOU Yuanhao
ZHAO Xiaodong
Wuhan University Journal of Natural Sciences, 2022, 27 (06) : 531 - 538
[22] Inhomogeneous deep Q-network for time sensitive applications
Chen, Xu
Wang, Jun
ARTIFICIAL INTELLIGENCE, 2022, 312
[23] Dynamic Parallel Machine Scheduling With Deep Q-Network
Liu, Chien-Liang
Tseng, Chun-Jan
Huang, Tzu-Hsuan
Wang, Jhih-Wun
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (11): : 6792 - 6804
[24] Multiagent Learning and Coordination with Clustered Deep Q-Network
Pageaud, Simon
Deslandres, Veronique
Lehoux, Vassilissa
Hassas, Salima
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2156 - 2158
[25] Deep Reinforcement Learning. Case Study: Deep Q-Network
Vrejoiu, Mihnea Horia
ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2019, 29 (03): : 65 - 78
[26] Deep Reinforcement Learning Pairs Trading with a Double Deep Q-Network
Brim, Andrew
2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 222 - 227
[27] FPGA Trax Solver based on a Neural Network Design
Fujimori, Takumi
Akabe, Tomoya
Ito, Yoshizumi
Akagi, Kouta
Furukawa, Shinya
Shinba, Hiroki
Tanibata, Aoi
Watanabe, Minoru
2015 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (FPT), 2015, : 260 - 263
[28] Deep Q-Network for Radar Task-Scheduling Problem
George, Taylor
Wagner, Kevin
Rademacher, Paul
2022 IEEE RADAR CONFERENCE (RADARCONF'22), 2022,
[29] Hierarchical Deep Q-Network from imperfect demonstrations in Minecraft
Skrynnik, Alexey
Staroverov, Aleksey
Aitygulov, Ermek
Aksenov, Kirill
Davydov, Vasilii
Panov, Aleksandr I.
COGNITIVE SYSTEMS RESEARCH, 2021, 65 : 74 - 78
[30] Double Deep Q-Network by Fusing Contrastive Predictive Coding
Liu, Jianfeng
Pu, Jiexin
Sun, Lifan
Computer Engineering and Applications, 2023, 59 (06) : 162 - 170

← 1 2 3 4 5 →