共 50 条
- [41] High Performance Matrix Multiplication on Many Cores EURO-PAR 2009: PARALLEL PROCESSING, PROCEEDINGS, 2009, 5704 : 948 - 959
- [42] High-Performance Tensor Contractions for GPUs INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 108 - 118
- [43] EXPLOITING FAST MATRIX MULTIPLICATION WITHIN THE LEVEL 3-BLAS ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1990, 16 (04): : 352 - 368
- [45] High-performance and Memory-saving Sparse General Matrix-Matrix Multiplication for NVIDIA Pascal GPU 2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2017, : 101 - 110
- [46] High-Performance Modular Multiplication on the Cell Processor ARITHMETIC OF FINITE FIELDS, PROCEEDINGS, 2010, 6087 : 7 - 24
- [47] Design Fast Matrix Algorithms on High-Performance Cloud Platforms 2012 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2012,
- [48] On the Performance Prediction of BLAS-based Tensor Contractions HIGH PERFORMANCE COMPUTING SYSTEMS: PERFORMANCE MODELING, BENCHMARKING, AND SIMULATION, 2015, 8966 : 193 - 212
- [49] HIGH PERFORMANCE REARRANGEMENT AND MULTIPLICATION ROUTINES FOR SPARSE TENSOR ARITHMETIC SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2018, 40 (02): : C258 - C281