An efficient hardware algorithm for parallel prefix computation with resource constraints

被引:0
|
作者
Park, JH [1 ]
机构
[1] SUNY New Paltz, Dept Math & Comp Sci, New Paltz, NY 12561 USA
关键词
parallel prefix; dataflow; VLSI; stream; special purpose hardware; PRAM; systolic array;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an efficient parallel hardware algorithm far the prefix computation Since the proposed scheme is based on dataflow, it does not require any preprocessing time or memory to store the data to accomplish the task, and it is suitable for the VLSI implementation. A linear systolic array architecture with simple basic cells is presented. To control the degree of the parallelism, the design uses multiple sub-streams for input and output. The design receives multiple input streams of elements in parallel, and produces output streams in parallel. Since the degree of the parallelism is controllable, the design has a great advantage when we consider the resource constraints of the system. The time complexity of the design is O(d + (N-d) / d) where d and N are the parallelism degree and the stream size respectively. When the stream size is very big, the initial trigger time d in the time complexity can be ignored and we get O(N/d). In the case with enough resources, the optimal degree of the parallelism is found at N-1/2. The proposed design is able to work on infinite length input elements.
引用
收藏
页码:1545 / 1550
页数:6
相关论文
共 50 条
  • [21] Efficient parallel prefix algorithms on multicomputers
    Lin, YC
    Lin, CM
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2000, 16 (01) : 41 - 64
  • [22] An Improved Parallel Prefix Sums Algorithm
    Bahig, Hazem M.
    Fathy, Khaled A.
    PARALLEL PROCESSING LETTERS, 2022, 32 (03N04)
  • [23] Parallel Dynamics Computation Using Prefix Sum Operations
    Yang, Yajue
    Wu, Yuanqing
    Pan, Jia
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (03): : 1296 - 1303
  • [24] OPTIMAL SCHEDULES FOR PARALLEL PREFIX COMPUTATION WITH BOUNDED RESOURCES
    NICOLAU, A
    WANG, H
    SIGPLAN NOTICES, 1991, 26 (07): : 1 - 10
  • [25] Parallel Prefix Computation in the Recursive Dual-Net
    Li, Yamin
    Peng, Shietung
    Chu, Wanming
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT 1, PROCEEDINGS, 2010, 6081 : 54 - +
  • [26] Parallel Prefix Sum Computation on Multi Mesh of Trees
    Rakesh, Nitin
    Nitin, D.
    2009 ANNUAL IEEE INDIA CONFERENCE (INDICON 2009), 2009, : 533 - 536
  • [27] An efficient hardware implementation of parallel EBCOT algorithm for JPEG 2000
    Saidani, Taoufik
    Atri, Mohamed
    Khriji, Lazhar
    Tourki, Rached
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2016, 11 (01) : 63 - 74
  • [28] An efficient hardware implementation of parallel EBCOT algorithm for JPEG 2000
    Taoufik Saidani
    Mohamed Atri
    Lazhar Khriji
    Rached Tourki
    Journal of Real-Time Image Processing, 2016, 11 : 63 - 74
  • [29] An efficient crossover architecture for hardware parallel implementation of genetic algorithm
    Faraji, Rasoul
    Naji, Hamid Reza
    NEUROCOMPUTING, 2014, 128 : 316 - 327
  • [30] Resource Efficient Hardware Architecture for Fast Computation of Running Max/Min Filters
    Torres-Huitzil, Cesar
    SCIENTIFIC WORLD JOURNAL, 2013,