共 50 条
- [2] Dense and Sparse Matrix-Vector Multiplication on Maxwell GPUs with PyCUDA HIGH PERFORMANCE COMPUTING CARLA 2016, 2017, 697 : 219 - 229
- [3] Balanced and Compressed Coordinate Layout for the Sparse Matrix-Vector Product on GPUs EURO-PAR 2020: PARALLEL PROCESSING WORKSHOPS, 2021, 12480 : 83 - 95
- [4] Efficient Sparse-Dense Matrix-Matrix Multiplication on GPUs Using the Customized Sparse Storage Format 2020 IEEE 26TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2020, : 19 - 26
- [5] TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs PROCEEDINGS OF THE 2023 USENIX ANNUAL TECHNICAL CONFERENCE, 2023, : 149 - 164
- [6] Characterization of data movement requirements for sparse matrix computations on GPUs 2017 IEEE 24TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2017, : 283 - 293
- [7] A Sparse Tensor Benchmark Suite for CPUs and GPUs 2020 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2020), 2020, : 193 - 204
- [8] Automatic Data Layout Optimizations for GPUs EURO-PAR 2015: PARALLEL PROCESSING, 2015, 9233 : 263 - 274
- [9] A Unified Optimization Approach for Sparse Tensor Operations on GPUs 2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 47 - 57
- [10] A Parallel Sparse Tensor Benchmark Suite on CPUs and GPUs PROCEEDINGS OF THE 25TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING (PPOPP '20), 2020, : 403 - 404