High-efficiency Compressor Trees for Latest AMD FPGAs

被引:0
|
作者
Hossfeld, Konstantin J. [1 ]
Damsgaard, Hans Jakob [2 ]
Nurmi, Jari [3 ]
Blott, Michaela [4 ]
Preusser, Thomas B. [1 ]
机构
[1] Regus Altmarkt, AMD, Altmarkt 10 b-d, D-01067 Dresden, Germany
[2] Tampere Univ, Tampere, Finland
[3] Tampere Univ, Fac Informat Technol & Commun Sci, Korkeakoulunkatu 1, Tampere 33720, Finland
[4] AMD, 2020 Bianconi Ave,Citywest Campus, Dublin D24 T683, Ireland
基金
欧盟地平线“2020”;
关键词
Compressor tree; matrix compression; parallel counters;
D O I
10.1145/3645097
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
High-fan-in dot product computations are ubiquitous in highly relevant application domains, such as signal processing and machine learning. Particularly, the diverse set of data formats used in machine learning poses a challenge for flexible efficient design solutions. Ideally, a dot product summation is composed from a carry-free compressor tree followed by a terminal carry-propagate addition. On FPGA, these compressor trees are constructed from generalized parallel counters whose architecture is closely tied to the underlying reconfigurable fabric. This work reviews known counter designs and proposes new ones in the context of the new AMD VersalT fabric. On this basis, we develop a compressor generator featuring variable-sized counters, novel counter composition heuristics, explicit clustering strategies, and case-specific optimizations like logic gate absorption. In comparison to the VivadoT default implementation, the combination of such a compressor with a novel, highly efficient quaternary adder reduces the LUT footprint across different bit matrix input shapes by 45% for a plain summation and by 46% for a terminal accumulation at a slight cost in critical path delay still allowing an operation well above 500 MHz. We demonstrate the aptness of our solution at examples of low-precision integer dot product accumulation units.
引用
收藏
页数:32
相关论文
共 50 条
  • [11] Model analysis of a novel compressor with a dual chamber for high-efficiency systems
    Shin, Myeongsu
    Na, Sangkyung
    Lee, Jaeyeol
    Min, Byungchae
    Choi, Gyungmin
    APPLIED THERMAL ENGINEERING, 2019, 158
  • [12] Improving Synthesis of Compressor Trees on FPGAs via Integer Linear Programming
    Parandeh-Afshar, Hadi
    Brisk, Philip
    Ienne, Paolo
    2008 DESIGN, AUTOMATION AND TEST IN EUROPE, VOLS 1-3, 2008, : 1098 - +
  • [13] EXPLOITING FAST CARRY-CHAINS OF FPGAS FOR DESIGNING COMPRESSOR TREES
    Parandeh-Afshar, Hadi
    Brisk, Philip
    Ienne, Paolo
    FPL: 2009 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, 2009, : 242 - 249
  • [14] Development and experimental study of a high-efficiency helium twin-screw compressor
    Wang, Chuang
    Xing, Ziwen
    Sun, Shizhong
    Yu, Zhiqiang
    CRYOGENICS, 2021, 116 (116)
  • [15] Experimental validation of an advanced heat pump system with high-efficiency centrifugal compressor
    Liu, Hua
    Zhao, Baiyang
    Zhang, Zhiping
    Li, Hongbo
    Hu, Bin
    Wang, R. Z.
    ENERGY, 2020, 213
  • [16] Field Programmable Compressor Trees: Acceleration of Multi-Input Addition on FPGAs
    Cevrero, Alessandro
    Athanasopoulos, Panagiotis
    Parandeh-Afshar, Hadi
    Verma, Ajay K.
    Attarzadeh-Niaki, Seyed-Hosein
    Nicopoulos, Chrysostomos
    Gurkaynak, Frank K.
    Brisk, Philip
    Leblebici, Yusuf
    Ienne, Paolo
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2009, 2 (02)
  • [17] CONSERVING WATER THROUGH THE LATEST UPDATES TO HIGH-EFFICIENCY PLUMBING FIXTURES IN COMMERCIAL RESTROOMS
    Lozier, Paige
    ARCHITECT, 2015, 104 (11): : 126 - 129
  • [18] Design of high-efficiency diffraction gratings based on total internal reflection for pulse compressor
    Liu, Shijie
    Ma, Jianyong
    Wei, Chaoyang
    Shen, Zicai
    Huang, Jianbin
    Jin, Yunxia
    Shao, Jianda
    Fan, Zhengxiu
    OPTICS COMMUNICATIONS, 2007, 273 (01) : 290 - 295
  • [19] Design of a Cost-Efficient High-Speed High-Efficiency PM Machine for Compressor Applications
    Gilson, A.
    Tavernier, S.
    Gerber, M.
    Espanet, C.
    Dubas, F.
    Depernet, D.
    2015 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2015, : 3852 - 3856
  • [20] Area Optimized Synthesis of Compressor Trees on Xilinx FPGAs Using Generalized Parallel Counters
    Yuan, Yuelai
    Tu, Le
    Huang, Kan
    Zhang, Xiaoqiang
    Zhang, Tiejun
    Chen, Dihu
    Wang, Zixin
    IEEE ACCESS, 2019, 7 : 134815 - 134827