FFTS IN EXTERNAL OR HIERARCHICAL MEMORY

被引:147
作者
BAILEY, DH [1 ]
机构
[1] NASA,AMES RES CTR,DIV NUMER AERODYNAM SIMULAT SYST,MOFFETT FIELD,CA 94035
关键词
D O I
10.1007/BF00162341
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional algorithms for computing large one-dimensional fast Fourier transforms (FFTs), even those algorithms recently developed for vector and parallel computers, are largely unsuitable for systems with external or hierarchical memory. The principal reason for this is the fact that most FFT algorithms require at least m complete passes through the data set to compute a 2m-point FFT. This paper describes some advanced techniques for computing an ordered FFT on a computer with external or hierarchical memory. These algorithms (1) require as few as two passes through the external data set, (2) employ strictly unit stride, long vector transfers between main memory and external storage, (3) require only a modest amount of scratch space in main memory, and (4) are well suited for vector and parallel computation. Performance figures are included for implementations of some of these algorithms on Cray supercomputers. Of interest is the fact that a main memory version outperforms the current Cray library FFT routines on the CRAY-2, the CRAY X-MP, and the CRAY Y-MP systems. Using all eight processors on the CRAY Y-MP, this main memory routine runs at nearly two gigaflops. © 1990 Kluwer Academic Publishers.
引用
收藏
页码:23 / 35
页数:13
相关论文
共 10 条
[1]   FOURIER-TRANSFORM AND CONVOLUTION SUBROUTINES FOR THE IBM-3090 VECTOR FACILITY [J].
AGARWAL, RC ;
COOLEY, JW .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1986, 30 (02) :145-162
[2]   THE INPUT OUTPUT COMPLEXITY OF SORTING AND RELATED PROBLEMS [J].
AGGARWAL, A ;
VITTER, JS .
COMMUNICATIONS OF THE ACM, 1988, 31 (09) :1116-1127
[3]  
Armstrong J., 1988, Journal of Supercomputing, V2, P415, DOI 10.1007/BF00156677
[4]   A SEGMENTED FFT ALGORITHM FOR VECTOR COMPUTERS [J].
ASHWORTH, M ;
LYNE, AG .
PARALLEL COMPUTING, 1988, 6 (02) :217-224
[5]  
Bailey D. H., 1987, Journal of Supercomputing, V1, P43, DOI 10.1007/BF00138605
[6]   A HIGH-PERFORMANCE FFT ALGORITHM FOR VECTOR SUPERCOMPUTERS [J].
BAILEY, DH .
INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1988, 2 (01) :82-87
[7]   ARRAY PERMUTATION BY INDEX-DIGIT PERMUTATION [J].
FRASER, D .
JOURNAL OF THE ACM, 1976, 23 (02) :298-309
[8]  
GENTLEMAN WM, 1966, AFIPS P, V29, P563
[9]   MULTIPROCESSOR FFTS [J].
SWARZTRAUBER, PN .
PARALLEL COMPUTING, 1987, 5 (1-2) :197-210
[10]   FFT ALGORITHMS FOR VECTOR COMPUTERS [J].
SWARZTRAUBER, PN .
PARALLEL COMPUTING, 1984, 1 (01) :45-63