New method for high performance multiply-accumulator design

被引:10
|
作者
Xia, Bing-jie [1 ]
Liu, Peng [1 ]
Yao, Qing-dong [1 ]
机构
[1] Zhejiang Univ, Dept Informat Sci & Elect Engn, Hangzhou 310027, Zhejiang, Peoples R China
来源
JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A | 2009年 / 10卷 / 07期
基金
中国国家自然科学基金;
关键词
Multiply-accumulator (MAC); Pipeline; Compressor; Partial product reduction tree (PPRT); Split structure; LOW-POWER; ADDER; ARCHITECTURE; TREE;
D O I
10.1631/jzus.A0820566
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This study presents a new method of 4-pipelined high-performance split multiply-accumulator (MAC) architecture, which is capable of supporting multiple precisions developed for media processors. To speed up the design further, a novel partial product compression circuit based on interleaved adders and a modified hybrid partial product reduction tree (PPRT) scheme are proposed. The MAC can perform 1-way 32-bit, 4-way 16-bit signed/unsigned multiply or multiply-accumulate operations and 2-way parallel multiply add (PMADD) operations at a high frequency of 1.25 GHz under worst-case conditions and 1.67 GHz under typical-case conditions, respectively. Compared with the MAC in 32-bit microprocessor without interlocked piped stages (MIPS), the proposed design shows a great advantage in speed. Moreover, an improvement of up to 32% in throughput is achieved. The MAC design has been fabricated with Taiwan Semiconductor Manufacturing Company (TSMC) 90-nm CMOS standard cell technology and has passed a functional test.
引用
收藏
页码:1067 / 1074
页数:8
相关论文
共 50 条
  • [21] Multiply-Accumulator Using Modified Booth Encoders Designed for Application in 16-bit RISC Processor
    He Jing-Yu
    Li Li-Li
    Zhu Yan-Chao
    Yang Wen-Tao
    Yang Jian-Hong
    2013 2ND INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION AND MEASUREMENT, SENSOR NETWORK AND AUTOMATION (IMSNA), 2013, : 416 - 419
  • [22] FPGA Implementation of a Single-Precision Floating-Point Multiply-Accumulator with Single-Cycle Accumulation
    Paidimarri, Arun
    Cevrero, Alessandro
    Brisk, Philip
    Ienne, Paolo
    PROCEEDINGS OF THE 2009 17TH IEEE SYMPOSIUM ON FIELD PROGRAMMABLE CUSTOM COMPUTING MACHINES, 2009, : 267 - +
  • [23] Time-Domain Multiply-Accumulator using Digital-to-Time Multiplier for CNN Processors in 28-nm CMOS
    Wu, Xutong
    Siriburanon, Teerachot
    Staszewski, Robert Bogdan
    2020 31ST IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2020, : 177 - 180
  • [24] A novel multiply multiple accumulator component for low power PDSP design
    Sundararajan, V
    Parhi, KK
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3247 - 3250
  • [25] New design of strip accumulator
    Tyrtov, A.S.
    Baranov, V.N.
    Volkov, V.M.
    Metallurg, 2000, (03):
  • [26] Design of High Performance Multiply-Accumulate Computation Unit
    Ahish, S.
    Kumar, Y. B. N.
    Sharma, Dheeraj
    Vasantha, M. H.
    2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2015, : 915 - 918
  • [27] A 90mW/GFlop 3.4GHz Reconfigurable Fused/Continuous Multiply-Accumulator for Floating-point and Integer Operands in 65nm
    Jain, Shailendra
    Erraguntla, Vasantha
    Vangal, Sriram R.
    Hoskote, Yatin
    Borkar, Nitin
    Mandepudi, Tulasi
    Karthik, V. P.
    23RD INTERNATIONAL CONFERENCE ON VLSI DESIGN, 2010, : 252 - +
  • [28] NEW SOURCE OF HIGH-PERFORMANCE MULTIPLY CHARGED IONS
    BRIAND, P
    GELLER, R
    JACQUOT, B
    JACQUOT, C
    NUCLEAR INSTRUMENTS & METHODS, 1975, 131 (03): : 407 - 409
  • [29] A HIGH-PERFORMANCE BICMOS SOG WITH NEW DESIGN METHOD SGA
    KUMAGAI, K
    TAGAMI, Y
    AKAGI, K
    NAKAZATO, H
    MINE, H
    TSUTSUI, H
    MASUDA, H
    NEC RESEARCH & DEVELOPMENT, 1991, 32 (03): : 323 - 331
  • [30] DESIGN AND METHOD OF CALCULATION OF AN INDUCTIVE ACCUMULATOR FOR LASER PUMPING
    ARTAMONOV, II
    BARIKHIN, BA
    BOROVKOV, VV
    KASHINTSOV, VI
    KVANTOVAYA ELEKTRONIKA, 1979, 6 (01): : 127 - 132