共 50 条
- [2] On Improving the Performance of Multi-threaded CUDA Applications with Concurrent Kernel Execution by Kernel Reordering 2012 SYMPOSIUM ON APPLICATION ACCELERATORS IN HIGH PERFORMANCE COMPUTING (SAAHPC), 2012, : 74 - 83
- [3] High-Performance QR Decomposition for FPGAs PROCEEDINGS OF THE 2018 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'18), 2018, : 183 - 188
- [4] High-performance symmetric block ciphers on CUDA Proc. - Int. Conf. Networking Comput., ICNC, (221-227):
- [5] Predicting execution time of CUDA kernel using static analysis 2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS, 2018, : 948 - 955
- [6] High-performance and parameterized matrix factorization on FPGAS 2006 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, 2006, : 363 - 368
- [7] Integrating FPGAs in High-Performance Computing: Introduction FPGA 2007: FIFTEENTH ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2007, : 131 - 131
- [8] Optimizations for High-Performance IPsec Execution E-BUSINESS AND TELECOMMUNICATIONS, 2011, 130 : 199 - 211
- [9] Co-designing Trusted Execution Environment and Model Encryption for Secure High-Performance DNN Inference on FPGAs 2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,