共 50 条
- [1] CPU-side High Performance BLAS Library Optimization in Heterogeneous HPL Algorithm Ruan Jian Xue Bao/Journal of Software, 2021, 32 (08): : 2289 - 2306
- [2] Toward a BLAS library truly portable across different accelerator types The Journal of Supercomputing, 2019, 75 : 7101 - 7124
- [3] Toward a BLAS library truly portable across different accelerator types JOURNAL OF SUPERCOMPUTING, 2019, 75 (11): : 7101 - 7124
- [5] CLBlast: A Tuned OpenCL BLAS Library IWOCL'18: PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON OPENCL, 2018, : 22 - 31
- [6] PORTABLE PARALLEL IMPLEMENTATION OF BLAS-3 CONCURRENCY-PRACTICE AND EXPERIENCE, 1994, 6 (05): : 411 - 459
- [7] FT-BLAS: A High Performance BLAS Implementation With Online Fault Tolerance PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ICS 2021, 2021, : 127 - 138
- [8] Superscalar GEMM-based level 3 BLAS -: The on-going evolution of a portable and high-performance library APPLIED PARALLEL COMPUTING: LARGE SCALE SCIENTIFIC AND INDUSTRIAL PROBLEMS, 1998, 1541 : 207 - 215