共 50 条
- [1] Understanding and Optimizing GPU Cache Memory Performance for Compute Workloads 2014 IEEE 13TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC), 2014, : 189 - 196
- [2] Optimizing Deep Learning Workloads on ARM GPU with TVM 1ST ACM REQUEST WORKSHOP/TOURNAMENT ON REPRODUCIBLE SOFTWARE/HARDWARE CO-DESIGN OF PARETO-EFFICIENT DEEP LEARNING, 2018,
- [3] Exploration of GPU sharing policies under GEMM workloads PROCEEDINGS OF THE 23RD INTERNATIONAL WORKSHOP ON SOFTWARE AND COMPILERS FOR EMBEDDED SYSTEMS (SCOPES 2020), 2020, : 66 - 69
- [4] An Evaluation of Cache Management Policies under Workloads with Malicious Requests 2017 IEEE SECOND ECUADOR TECHNICAL CHAPTERS MEETING (ETCM), 2017,
- [5] Optimizing GPU Kernels for Irregular Batch Workloads: A Case Study for Cholesky Factorization 2018 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2018,
- [7] Characterizing the impact of last-level cache replacement policies on big-data workloads 2020 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2020), 2020, : 134 - 144
- [8] Cache performance of video computation workloads THIRD INTERNATIONAL WORKSHOP ON DIGITAL AND COMPUTATIONAL VIDEO, PROCEEDINGS, 2002, : 169 - 175
- [9] Multilayer Cache Partitioning for Multiprogram Workloads EURO-PAR 2011 PARALLEL PROCESSING, PT 1, 2011, 6852 : 130 - 141
- [10] GPU Support for Batch Oriented Workloads 2009 IEEE 28TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCC 2009), 2009, : 231 - 238