共 40 条
- [1] Cipra BA., The best of the 20th century: Editors name top 10 algorithms, SIAM News, 33, 4, pp. 1-2, (2000)
- [2] Luszczek P, Dongarra JJ, Koester D, Et al., Introduction to the HPC challenge benchmark suite, Office of Scientific & Technical Information Technical Reports, (2005)
- [3] Fu H, Liao J, Yang J, Et al., The Sunway TaihuLight supercomputer: System and applications, Science China Information Sciences, 59, 7, (2016)
- [4] Frigo M, Johnson SG., The design and implementation of FFTW3, Proc. of the IEEE, 93, 2, pp. 216-231, (2005)
- [5] Frigo M, Johnson SG., FFTW: An adaptive software architecture for the FFT, Proc. of the IEEE Int'l Conf. on Acoustics, Speech and Signal Processing, pp. 1381-1384, (2002)
- [6] Ali A, Johnsson L, Subhlok J., Scheduling FFT computation on SMP and multicore systems, Proc. of the Int'l Conf. on Supercomputing, ICS 2007, pp. 293-301, (2007)
- [7] Puschel M, Moura JMF, Johnson JR, Et al., SPIRAL: Code generation for DSP transforms, Proc. of the IEEE, 93, 2, pp. 232-275, (2005)
- [8] Pekurovsky D., P3DFFT: A framework for parallel computations of Fourier transforms in three dimensions, SIAM Journal on Scientific Computing, 34, 4, pp. C192-C209, (2012)
- [9] Ayala O, Wang LP., Parallel implementation and scalability analysis of 3D fast Fourier transform using 2D domain decomposition, Parallel Computing, 39, 1, pp. 58-77, (2013)
- [10] Pippig M., PFFT: An extension of FFTW to massively parallel architectures, SIAM Journal on Scientific Computing, 35, 3, pp. C213-C236, (2013)