MiniFloats on RISC-V Cores: ISA Extensions With Mixed-Precision Short Dot Products

被引:2
|
作者
Bertaccini, Luca [1 ]
Paulin, Gianna [1 ]
Cavalcante, Matheus [1 ]
Fischer, Tim [1 ]
Mach, Stefan [2 ]
Benini, Luca [1 ,3 ]
机构
[1] Swiss Fed Inst Technol, Integrated Syst Lab IIS, CH-8092 Zurich, Switzerland
[2] Axelera AI, CH-8038 Zurich, Switzerland
[3] Univ Bologna, Dept Elect Elect & Informat Engn DEI, I-40126 Bologna, Italy
关键词
Training; Computer architecture; Hardware; Dynamic range; Artificial neural networks; Stochastic processes; Computational modeling; Transprecision computing; RISC-V; ISA extension; floating-point architectures; widening dot product; NN training; ARCHITECTURE; ALGORITHMS; ADD;
D O I
10.1109/TETC.2024.3365354
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Low-precision floating-point (FP) formats have recently been intensely investigated in the context of machine learning inference and training applications. While 16-bit formats are already widely used, 8-bit FP data types have lately emerged as a viable option for neural network training when employed in a mixed-precision scenario and combined with rounding methods increasing the precision in compound additions, such as stochastic rounding. So far, hardware implementations supporting FP8 are mostly implemented within domain-specific accelerators. We propose two RISC-V instruction set architecture (ISA) extensions, enhancing respectively scalar and vector general-purpose cores with low and mixed-precision capabilities. The extensions support two 8-bit and two 16-bit FP formats and are based on dot-product instructions accumulating at higher precision. We develop a hardware unit supporting mixed-precision dot products and stochastic rounding and integrate it into an open-source floating-point unit (FPU). Finally, we integrate the enhanced FPU into a cluster of scalar cores, as well as a cluster of vector cores, and implement them in a 12 nm FinFET technology. The former achieves 575 GFLOPS/W on FP8-to-FP16 matrix multiplications at 0.8 V, 1.26 GHz; the latter reaches 860 GFLOPS/W at 0.8 V, 1.08 GHz, 1.93x higher efficiency than computing on FP16-to-FP32.
引用
收藏
页码:1040 / 1055
页数:16
相关论文
共 18 条
  • [1] SLM ISA and Hardware Extensions for RISC-V Processors
    Ghasemi, S. Maryam
    Meschkov, Sergej
    Krautter, Jonas
    Gnad, Dennis R. E.
    Tahoori, Mehdi B.
    2023 IEEE 29TH INTERNATIONAL SYMPOSIUM ON ON-LINE TESTING AND ROBUST SYSTEM DESIGN, IOLTS, 2023,
  • [2] A Mixed-Precision RISC-V Processor for Extreme-Edge DNN Inference
    Ottavi, Gianmarco
    Garofalo, Angelo
    Tagliavini, Giuseppe
    Conti, Francesco
    Benini, Luca
    Rossi, Davide
    2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020), 2020, : 512 - 517
  • [3] CNN Specific ISA Extensions Based on RISC-V Processors
    Yu, Xiang
    Yang, Zhijie
    Peng, Linghui
    Lin, Bo
    Yang, Wenjing
    Wang, Lei
    2022 5TH INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS AND SIMULATION (ICCSS 2022), 2022, : 116 - 120
  • [4] Design and Evaluation of SmallFloat SIMD extensions to the RISC-V ISA
    Tagliavini, Giuseppe
    Mach, Stefan
    Rossi, Davide
    Marongiu, Andrea
    Benini, Luca
    2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2019, : 654 - 657
  • [5] Hardware Acceleration Method Using RISC-V Core with No ISA Extensions
    Wygrzywalski, Mateusz
    Skrzypiec, Pawel
    Szczygiel, Robert
    2024 31ST INTERNATIONAL CONFERENCE ON MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEM, MIXDES 2024, 2024, : 265 - 269
  • [6] DBPS: Dynamic Block Size and Precision Scaling for Efficient DNN Training Supported by RISC-V ISA Extensions
    Lee, Seunghyun
    Choi, Jeik
    Noh, Seockhwan
    Koo, Jahyun
    Kung, Jaeha
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [7] Isa extensions for finite field arithmetic accelerating kyber and newhope on risc-v
    Alkim E.
    Evkan H.
    Lahr N.
    Niederhagen R.
    Petri R.
    1600, Ruhr-University of Bochum (2020): : 219 - 242
  • [8] XpulpNN: Accelerating Quantized Neural Networks on RISC-V Processors Through ISA Extensions
    Garofalo, Angelo
    Tagliavini, Giuseppe
    Conti, Francesco
    Rossi, Davide
    Benini, Luca
    PROCEEDINGS OF THE 2020 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2020), 2020, : 186 - 191
  • [9] SCAIE-V: An Open-Source SCAlable Interface for ISA Extensions for RISC-V Processors
    Damian, Mihaela
    Oppermann, Julian
    Spang, Christoph
    Koch, Andreas
    PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 169 - 174
  • [10] Xvpfloat: RISC-V ISA Extension for Variable Extended Precision Floating Point Computation
    Guthmuller, Eric
    Fuguet, Cesar
    Bocco, Andrea
    Fereyre, Jerome
    Alidori, Riccardo
    Tahir, Ihsane
    Durand, Yves
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (07) : 1683 - 1697