An efficient hardware algorithm for parallel prefix computation with resource constraints

被引:0
|
作者
Park, JH [1 ]
机构
[1] SUNY New Paltz, Dept Math & Comp Sci, New Paltz, NY 12561 USA
关键词
parallel prefix; dataflow; VLSI; stream; special purpose hardware; PRAM; systolic array;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an efficient parallel hardware algorithm far the prefix computation Since the proposed scheme is based on dataflow, it does not require any preprocessing time or memory to store the data to accomplish the task, and it is suitable for the VLSI implementation. A linear systolic array architecture with simple basic cells is presented. To control the degree of the parallelism, the design uses multiple sub-streams for input and output. The design receives multiple input streams of elements in parallel, and produces output streams in parallel. Since the degree of the parallelism is controllable, the design has a great advantage when we consider the resource constraints of the system. The time complexity of the design is O(d + (N-d) / d) where d and N are the parallelism degree and the stream size respectively. When the stream size is very big, the initial trigger time d in the time complexity can be ignored and we get O(N/d). In the case with enough resources, the optimal degree of the parallelism is found at N-1/2. The proposed design is able to work on infinite length input elements.
引用
收藏
页码:1545 / 1550
页数:6
相关论文
共 50 条
  • [41] Improved HW/SW Partitioning Algorithm on Efficient Use of Hardware Resource
    Wang, Hao
    Zhang, Hongquan
    2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 2, 2010, : 682 - 685
  • [42] Parallel prefix computation on extended multi-mesh network
    Jana, PK
    Naidu, BD
    Kumar, S
    Arora, M
    Sinha, BP
    INFORMATION PROCESSING LETTERS, 2002, 84 (06) : 295 - 303
  • [43] Matrix exponentials and parallel prefix computation in a quantum control problem
    Auckenthaler, T.
    Bader, M.
    Huckle, T.
    Spoerl, A.
    Waldherr, K.
    PARALLEL COMPUTING, 2010, 36 (5-6) : 359 - 369
  • [44] FAST COMPUTATION OF OPTIMAL PATHS USING A PARALLEL DIJKSTRA ALGORITHM WITH EMBEDDED CONSTRAINTS
    SOLKA, JL
    PERRY, JC
    POELLINGER, BR
    ROGERS, GW
    NEUROCOMPUTING, 1995, 8 (02) : 195 - 212
  • [45] An Efficient Computation Algorithm for Time Optimal Trajectory Planning With Physical Constraints
    Yuan, Mingxing
    Yao, Bin
    Zhu, Xiaocong
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 3100 - 3105
  • [46] An Efficient Algorithm for Resource Allocation in Parallel and Distributed Computing Systems
    El-Zoghdy, S. F.
    Nofal, M.
    Shohla, M. A.
    El-sawy, A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (02) : 251 - 259
  • [47] An Efficient High-Speed CORDIC Algorithm Using Parallel-Prefix Adders (PPA)
    Venkatesh, Vutukuri
    Yeswanth, Balaji
    Akhil, Repala
    Jatoth, Ravi Kumar
    ADVANCES IN VLSI, COMMUNICATION, AND SIGNAL PROCESSING, 2020, 587 : 805 - 813
  • [48] An efficient computation offloading and resource allocation algorithm in RIS empowered MEC?
    Zhang, Xiangjun
    Wu, Weiguo
    Liu, Song
    Wang, Jinyu
    COMPUTER COMMUNICATIONS, 2023, 197 : 113 - 123
  • [49] Resource Constraints on Computation and Communication in the Brain
    Varma, Sashank
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 611 - 616
  • [50] Hardware efficient architectures for eigenvalue computation
    Liu, Yang
    Bouganis, Christos-Savvas
    Cheung, Peter Y. K.
    Leong, Philip H. W.
    Motley, Stephen J.
    2006 DESIGN AUTOMATION AND TEST IN EUROPE, VOLS 1-3, PROCEEDINGS, 2006, : 951 - +