The vector floating-point unit in a synergistic processor element of a CELL processor

被引:29
|
作者
Mueller, SM
Jacobi, C
Oh, HJ
Tran, KD
Cottier, SR
Michael, BW
Nishikawa, H
Totsuka, Y
Namatame, T
Yano, N
Machida, T
Dhong, SH
机构
来源
17th IEEE Symposium on Computer Arithmetic, Proceedings | 2005年
关键词
D O I
10.1109/ARITH.2005.45
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The floating-point unit in the Synergistic Processor Element of the 1st generation multi-core CELL Processor is described. The FPU supports 4-way SIMD single precision and integer operations and 2-way SIMD double precision operations. The design required a high-frequency, low latency, power and area efficiency with primary application to the multimedia streaming workloads, such as 3D graphics. The FPU has 3 different latencies, optimizing the performance critical single precision FMA operations, which are executed with a 6-cycle latency at an 11FO4 cycle time. The latency includes the global forwarding of the result. These challenging performance, power and area goals were achieved through the co-design of architecture and implementation with optimizations at all levels of the design. This paper focuses on the logical and algorithmic aspects of the FPU we developed, to achieve these goals.
引用
收藏
页码:59 / 67
页数:9
相关论文
共 50 条
  • [32] Design of High Hardware Efficiency Approximate Floating-Point FFT Processor
    Yan, Chenggang
    Zhao, Xuan
    Zhang, Tingting
    Ge, Jipeng
    Wang, Chenghua
    Liu, Weiqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (11) : 4283 - 4294
  • [34] FLOATING-POINT PROCESSOR SERVES COPROCESSOR AND DATA-PATH MACHINES
    WILSON, R
    COMPUTER DESIGN, 1988, 27 (06): : 31 - 32
  • [35] 80-MHZ RISC PROCESSOR SCREAMS THROUGH FLOATING-POINT
    WILSON, D
    COMPUTER DESIGN, 1991, 30 (08): : 40 - +
  • [36] FLOATING-POINT SIGNAL PROCESSOR PACKS HIGH-LEVEL INSTRUCTIONS
    BURSKY, D
    ELECTRONIC DESIGN, 1987, 35 (22) : 55 - &
  • [37] ARCHITECTURE OF A FLOATING-POINT BUTTERFLY EXECUTION UNIT IN A 400-MFLOPS PROCESSOR VLSI AND ITS IMPLEMENTATION
    YAMAUCHI, H
    MIYANAGA, H
    IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (11): : 3852 - 3860
  • [38] A CMOS FLOATING-POINT VECTOR-ARITHMETIC UNIT
    TIMMERMANN, D
    RIX, B
    HAHN, H
    HOSTICKA, BJ
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1994, 29 (05) : 634 - 639
  • [39] The microarchitecture of the synergistic processor for a cell processor
    Flachs, B
    Asano, S
    Dhong, SH
    Hofstee, HP
    Gervais, G
    Kim, R
    Le, T
    Liu, PC
    Leenstra, J
    Liberty, J
    Michael, B
    Oh, HJ
    Mueller, SM
    Takahashi, O
    Hatakeyama, A
    Watanabe, Y
    Yano, N
    Brokenshire, DA
    Peyravian, M
    To, V
    Iwata, E
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2006, 41 (01) : 63 - 70
  • [40] AN INTEGRATED FLOATING POINT VECTOR PROCESSOR FOR DSP AND SCIENTIFIC COMPUTING
    SPADERNA, D
    GREEN, P
    TAM, K
    DATTA, T
    KUMAR, M
    PROCEEDINGS - IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN : VLSI IN COMPUTERS & PROCESSORS, 1989, : 8 - 13