共 50 条
- [1] The design and implementation of OpenMP 4.5 and OpenACC backends for the RAJA C++ performance portability layer Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2018, 10732 LNCS : 63 - 82
- [2] The Design and Implementation of OpenMP 4.5 and OpenACC Backends for the RAJA C plus plus Performance Portability Layer ACCELERATOR PROGRAMMING USING DIRECTIVES, WACCPD 2017, 2018, 10732 : 63 - 82
- [3] Enhancing OpenMP Tasking Model: Performance and Portability OPENMP: ENABLING MASSIVE NODE-LEVEL PARALLELISM, IWOMP 2021, 2021, 12870 : 35 - 49
- [4] On the Performance Portability of OpenACC, OpenMP, Kokkos and RAJA ACM International Conference Proceeding Series, 2022, : 103 - 114
- [5] The Productivity, Portability and Performance of OpenMP 4.5 for Scientific Applications Targeting Intel CPUs, IBM CPUs, and NVIDIA GPUs SCALING OPENMP FOR EXASCALE PERFORMANCE AND PORTABILITY (IWOMP 2017), 2017, 10468 : 185 - 200
- [6] Pragmatic Performance Portability with OpenMP 4.x OPENMP: MEMORY, DEVICES, AND TASKS, 2016, 9903 : 253 - 267
- [7] Performance portability of sparse matrix-vector multiplication implemented using OpenMP, OpenACC and SYCL FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 170
- [8] Evaluating Performance Portability of OpenMP for SNAP on NVIDIA, Intel, and AMD GPUs Using the Roofline Methodology ACCELERATOR PROGRAMMING USING DIRECTIVES, WACCPD 2020, 2021, 12655 : 3 - 24
- [9] Evaluating the Impact of Proposed OpenMP 5.0 Features on Performance, Portability and Productivity PROCEEDINGS OF 2018 IEEE/ACM INTERNATIONAL WORKSHOP ON PERFORMANCE, PORTABILITY AND PRODUCTIVITY IN HPC (P3HPC 2018), 2018, : 37 - 46
- [10] A Performance Portability Study Using Tensor Contraction Benchmarks 2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW, 2023, : 591 - 600