MapZero: Mapping for Coarse-grained Reconfigurable Architectures with Reinforcement Learning and Monte-Carlo Tree Search

被引:6
|
作者
Kong, Xiangyu [1 ]
Huang, Yi [1 ]
Zhu, Jianfeng [1 ]
Man, Xingchen [1 ]
Liu, Yang [2 ]
Feng, Chunyang [2 ]
Gou, Pengfei [3 ]
Tang, Minggui [3 ]
Wei, Shaojun [1 ]
Liu, Leibo [1 ]
机构
[1] Tsinghua Univ, BNRist, Sch Integrated Circuits, Beijing, Peoples R China
[2] GBA, Innovat Inst High Performance Server, Guangzhou, Guangdong, Peoples R China
[3] HEXIN Technol Co Ltd, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Coarse-Grained Reconfigurable Architecture; Compiler; Graph Neural Network; Reinforcement Learning; DATA-FLOW GRAPH; CGRA; ALGORITHM; FRAMEWORK; SHOGI; CHESS; GO;
D O I
10.1145/3579371.3589081
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Coarse-grained reconfigurable architecture (CGRA) has become a promising candidate for data-intensive computing due to its flexibility and high energy efficiency. CGRA compilers map data flow graphs (DFGs) extracted from applications onto CGRAs, playing a fundamental role in fully exploiting hardware resources for acceleration. Yet the existing compilers are time-demanding and cannot guarantee optimal results due to the traversal search of enormous search spaces brought about by the spatio-temporal flexibility of CGRA structures and the complexity of DFGs. Inspired by the amazing progress in reinforcement learning (RL) and Monte-Carlo tree search (MCTS) for real-world problems, we consider constructing a compiler that can learn from past experiences and comprehensively understand the target DFG and CGRA. In this paper, we propose an architecture-aware compiler for CGRAs based on RL and MCTS, called MapZero - a framework to automatically extract the characteristics of DFG and CGRA hardware and map operations onto varied CGRA fabrics. We apply Graph Attention Network to generate an adaptive embedding for DFGs and also model the functionality and interconnection status of the CGRA, aiming at training an RL agent to perform placement and routing intelligently. Experimental results show that MapZero can generate superior-quality mappings and reduce compilation time hundreds of times compared to state-of-the-art methods. MapZero can find high-quality mappings very quickly when the feasible solution space is rather small and all other compilers fail. We also demonstrate the scalability and broad applicability of our framework.
引用
收藏
页码:646 / 659
页数:14
相关论文
共 50 条
  • [31] REGIMap: Register-Aware Application Mapping on Coarse-Grained Reconfigurable Architectures (CGRAs)
    Hamzeh, Mahdi
    Shrivastava, Aviral
    Vrudhula, Sarma
    2013 50TH ACM / EDAC / IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2013,
  • [32] Automated Machine Learning with Monte-Carlo Tree Search
    Rakotoarison, Herilalaina
    Schoenauer, Marc
    Sebag, Michele
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3296 - 3303
  • [33] Still image processing on coarse-grained reconfigurable array architectures
    Hartmann, Matthias
    Pantazis, Vassilis
    Aa, Tom Vander
    Berekovic, Mladen
    Hochberger, Christian
    de Sutter, Bjorn
    2007 IEEE/ACM/IFIP WORKSHOP ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA, 2007, : 67 - +
  • [34] Selective Validations for Efficient Protections on Coarse-Grained Reconfigurable Architectures
    Kang, Jihoon
    Ko, Yohan
    Lee, Jongwon
    Kim, Yongjoo
    So, Hwisoo
    Lee, Kyoungwoo
    Paek, Yunheung
    PROCEEDINGS OF THE 2013 IEEE 24TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 13), 2013, : 95 - 98
  • [35] An Elastic Task Scheduling Scheme on Coarse-Grained Reconfigurable Architectures
    Chen, Longlong
    Zhu, Jianfeng
    Deng, Yangdong
    Li, Zhaoshi
    Chen, Jian
    Jiang, Xiaowei
    Yin, Shouyi
    Wei, Shaojun
    Liu, Leibo
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (12) : 3066 - 3080
  • [36] Efficient Execution of Stream Graphs on Coarse-Grained Reconfigurable Architectures
    Oh, Sangyun
    Lee, Hongsik
    Lee, Jongeun
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2017, 36 (12) : 1978 - 1988
  • [37] Improving Nested Loop Pipelining on Coarse-Grained Reconfigurable Architectures
    Yin, Shouyi
    Liu, Dajiang
    Peng, Yu
    Liu, Leibo
    Wei, Shaojun
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2016, 24 (02) : 507 - 520
  • [38] Still Image Processing on Coarse-Grained Reconfigurable Array Architectures
    Hartmann, Matthias
    Pantazis, Vasileios
    Vander Aa, Tom
    Berekovic, Mladen
    Hochberger, Christian
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2010, 60 (02): : 225 - 237
  • [39] Coarse-grained reconfigurable architectures for radio baseband processing: A survey
    Hassan, Zohaib
    Ometov, Aleksandr
    Lohan, Elena Simona
    Nurmi, Jari
    JOURNAL OF SYSTEMS ARCHITECTURE, 2024, 154
  • [40] A Data Prefetch and Reuse Strategy for Coarse-Grained Reconfigurable Architectures
    Ge, Wei
    Qi, Zhi
    Du, Yue
    Ma, Lu
    Shi, Longxing
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (03): : 616 - 623