An improved architecture for bit-level matrix multiplication

被引:0
|
作者
Grover, RS [1 ]
Shang, WJ [1 ]
Li, Q [1 ]
机构
[1] Santa Clara Univ, Dept Comp Engn, Santa Clara, CA 95053 USA
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V | 2000年
关键词
bit-level matrix multiplication; FPGA array; mapping algorithms to hardware; reconfigurable computing;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present a novel bit-level architecture where each processing element does a simple operation of adding three to six bits to generate one partial sum bit and one to two carryout bits. We gain speedup over word-level because individual bits of a word do not have to be processed as a unit in a bit-level architecture. In [1], two bit-level architectures for fixed point matrix multiplication are proposed that are O(log p) times faster than the fastest word-level architecture where p is the word length. The architecture presented in this paper is even faster than the two in [1] by breaking the critical path in the dependence graph into half: We show basic ideas of how to gain speedup in our design, how to establish the dependence structure and how to derive the final design. We also show our design is time optimal for our dependence structure and has a speedup of 50% or more over the designs presented in [1]. We are implementing the design on a Xilinx FPGA chip, which shows a potential speedup over Xilinx multiplier macro. Our approach can be used to map algorithms to hardware.
引用
收藏
页码:2257 / 2264
页数:8
相关论文
共 50 条
  • [41] A BIT-LEVEL SYSTOLIC IMPLEMENTATION OF THE MEDIAN FILTER
    HU, Z
    KING, GA
    MICROPROCESSORS AND MICROSYSTEMS, 1995, 19 (04) : 185 - 186
  • [42] On Bit-Level Decoding of Nonbinary LDPC Codes
    Zhang, Mu
    Cai, Kui
    Huang, Qin
    Yuan, Shuai
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2018, 66 (09) : 3736 - 3748
  • [43] A BIT-LEVEL SYSTOLIC ARRAY FOR MEDIAN FILTER
    CHANG, LW
    LIN, JH
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (08) : 2079 - 2083
  • [44] 4-Bit Serial-Parallel Multiplier and Bit-Level Systolic Architecture for Implementation of Discrete Orthogonal Transforms
    Murty, M. N.
    Nayak, S. S.
    Padhy, B.
    Panda, S. N.
    GLOBAL TRENDS IN COMPUTING AND COMMUNICATION SYSTEMS, PT 1, 2012, 269 : 91 - +
  • [45] ON THE SPECIFIC EXPRESSION OF BIT-LEVEL ARITHMETIC CODING
    赵风光
    蒋尔雄
    倪兴芳
    "Numerical Mathematics A Journal of Chinese Universities(English Series) N", 1998, (02) : 211 - 220
  • [46] Effect of Bit-Level Correlation In Stochastic Computing
    Parhi, Megha
    Riedel, Marc D.
    Parhi, Keshab K.
    2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 463 - 467
  • [47] Bit-Level Probabilistically Shaped Coded Modulation
    Pikus, Marcin
    Xu, Wen
    IEEE COMMUNICATIONS LETTERS, 2017, 21 (09) : 1929 - 1932
  • [48] Bit Matrix Multiplication in Commodity Processors
    Hilewitz, Yedidya
    Lauradoux, Cedric
    Lee, Ruby B.
    2008 INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2008, : 7 - 12
  • [49] A PROGRAMMABLE FINITE RING BIT-LEVEL SYSTOLIC CELL
    JULLIEN, GA
    ERICKSON, B
    MILLER, WC
    SYSTOLIC ARRAY PROCESSORS, 1989, : 225 - 234
  • [50] An Application to ensure Security through Bit-level Encryption
    Ghosh, Mrinmoy
    Paul, Pranam
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (11): : 228 - 233