共 50 条
- [31] An Input-Adaptive and In-Place Approach to Dense Tensor-Times-Matrix Multiply PROCEEDINGS OF SC15: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2015,
- [32] "Wide or Tall" and "Sparse Matrix Dense Matrix" Multiplications HIGH PERFORMANCE COMPUTING SYMPOSIUM 2011 (HPC 2011) - 2011 SPRING SIMULATION MULTICONFERENCE - BK 6 OF 8, 2011, 43 (02): : 159 - 165
- [34] Predicting optimal sparse general matrix-matrix multiplication algorithm on GPUs INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2024, 38 (03): : 245 - 259
- [35] Advancing on an efficient sparse matrix multiplication kernel for modern GPUs CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (20):
- [36] RDMA-Based Algorithms for Sparse Matrix Multiplication on GPUs PROCEEDINGS OF THE 38TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ACM ICS 2024, 2024, : 225 - 235
- [37] Optimizing Sparse Matrix Operations on GPUs using Merge Path 2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 407 - 416
- [39] A new approach for sparse matrix vector product on NVIDIA GPUs CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2011, 23 (08): : 815 - 826
- [40] On Implementing Sparse Matrix Multi-Vector Multiplication on GPUs 2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 1117 - 1124