An efficient hardware algorithm for parallel prefix computation with resource constraints

被引：0

作者：

Park, JH ^{[1
]}

机构：

[1] SUNY New Paltz, Dept Math & Comp Sci, New Paltz, NY 12561 USA

来源：

PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V | 2000年

关键词：

parallel prefix; dataflow; VLSI; stream; special purpose hardware; PRAM; systolic array;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents an efficient parallel hardware algorithm far the prefix computation Since the proposed scheme is based on dataflow, it does not require any preprocessing time or memory to store the data to accomplish the task, and it is suitable for the VLSI implementation. A linear systolic array architecture with simple basic cells is presented. To control the degree of the parallelism, the design uses multiple sub-streams for input and output. The design receives multiple input streams of elements in parallel, and produces output streams in parallel. Since the degree of the parallelism is controllable, the design has a great advantage when we consider the resource constraints of the system. The time complexity of the design is O(d + (N-d) / d) where d and N are the parallelism degree and the stream size respectively. When the stream size is very big, the initial trigger time d in the time complexity can be ignored and we get O(N/d). In the case with enough resources, the optimal degree of the parallelism is found at N-1/2. The proposed design is able to work on infinite length input elements.

引用

页码：1545 / 1550

页数：6

共 50 条

[21] Efficient parallel prefix algorithms on multicomputers
Lin, YC
Lin, CM
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2000, 16 (01) : 41 - 64
[22] An Improved Parallel Prefix Sums Algorithm
Bahig, Hazem M.
Fathy, Khaled A.
PARALLEL PROCESSING LETTERS, 2022, 32 (03N04)
[23] Parallel Dynamics Computation Using Prefix Sum Operations
Yang, Yajue
Wu, Yuanqing
Pan, Jia
IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (03): : 1296 - 1303
[24] OPTIMAL SCHEDULES FOR PARALLEL PREFIX COMPUTATION WITH BOUNDED RESOURCES
NICOLAU, A
WANG, H
SIGPLAN NOTICES, 1991, 26 (07): : 1 - 10
[25] Parallel Prefix Computation in the Recursive Dual-Net
Li, Yamin
Peng, Shietung
Chu, Wanming
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT 1, PROCEEDINGS, 2010, 6081 : 54 - +
[26] Parallel Prefix Sum Computation on Multi Mesh of Trees
Rakesh, Nitin
Nitin, D.
2009 ANNUAL IEEE INDIA CONFERENCE (INDICON 2009), 2009, : 533 - 536
[27] An efficient hardware implementation of parallel EBCOT algorithm for JPEG 2000
Saidani, Taoufik
Atri, Mohamed
Khriji, Lazhar
Tourki, Rached
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2016, 11 (01) : 63 - 74
[28] An efficient hardware implementation of parallel EBCOT algorithm for JPEG 2000
Taoufik Saidani
Mohamed Atri
Lazhar Khriji
Rached Tourki
Journal of Real-Time Image Processing, 2016, 11 : 63 - 74
[29] An efficient crossover architecture for hardware parallel implementation of genetic algorithm
Faraji, Rasoul
Naji, Hamid Reza
NEUROCOMPUTING, 2014, 128 : 316 - 327
[30] Resource Efficient Hardware Architecture for Fast Computation of Running Max/Min Filters
Torres-Huitzil, Cesar
SCIENTIFIC WORLD JOURNAL, 2013,

← 1 2 3 4 5 →