Improved Parallel Cache-Oblivious Algorithms for Dynamic Programming [Extend Abstract]

被引:0
|
作者
Blelloch, Guy E. [1 ]
Gu, Yan [2 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Calif Riverside, Riverside, CA 92521 USA
基金
美国国家科学基金会;
关键词
COMMUNICATION; COMPLEXITY;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Emerging non-volatile main memory (NVRAM) technologies provide byte-addressability, low idle power, and improved memorydensity, and are likely to be a key component in the future memory hierarchy. However, a critical challenge in achieving high performance is in accounting for the asymmetry that NVRAM writes can be significantly more expensive than NVRAM reads. In this paper, we consider a large class of cache-oblivious algorithms for dynamic programming (DP) and try to reduce the writes in the asymmetric setting while maintaining high parallelism. To achieve that, our key approach is to show the correspondence between these problems and an abstraction for their computation, which is referred to as the k-d grids. Then by showing lower bound and new algorithms for computing k-d grids, we show a list of improved cache-oblivious algorithms of many DP recurrences in the asymmetric setting, both sequentially and in parallel. Surprisingly, even without considering the read-write asymmetry (i.e., setting the write cost to be the same as the read cost in the algorithms), the new algorithms improve the existing cache complexity of many problems. We believe the reason is that the extra level of abstraction of k-d grids helps us to better understand the complexity and difficulties of these problems. We believe that the novelty of our framework is of theoretical interest and leads to many new questions for future work.
引用
收藏
页码:105 / 119
页数:15
相关论文
共 50 条
  • [21] A locality-preserving cache-oblivious dynamic dictionary
    Bender, MA
    Duan, ZY
    Iacono, J
    Wu, J
    PROCEEDINGS OF THE THIRTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2002, : 29 - 38
  • [22] POSTER: Provably Efficient Scheduling of Cache-Oblivious Wavefront Algorithms
    Chowdhury, Rezaul
    Ganapathi, Pramod
    Tang, Yuan
    Tithi, Jesmin Jahan
    ACM SIGPLAN NOTICES, 2017, 52 (08) : 435 - 436
  • [23] Brief Announcement: Balanced Partitioning of Several Cache-Oblivious Algorithms
    Tang, Yuan
    PROCEEDINGS OF THE 32ND ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES (SPAA '20), 2020, : 575 - 577
  • [24] An optimal cache-oblivious priority queue and its application to graph algorithms
    Arge, Lars
    Bender, Michael A.
    Demaine, Erik D.
    Holland-Minkley, Bryan
    Munro, J. Ian
    SIAM JOURNAL ON COMPUTING, 2007, 36 (06) : 1672 - 1695
  • [25] Cache-Oblivious parallel SIMD Viterbi decoding for sequence search in HMMER
    Miguel Ferreira
    Nuno Roma
    Luis MS Russo
    BMC Bioinformatics, 15
  • [26] Cache oblivious algorithms for nonserial polyadic programming
    Guangming Tan
    Shengzhong Feng
    Ninghui Sun
    The Journal of Supercomputing, 2007, 39 : 227 - 249
  • [27] Cache oblivious algorithms for nonserial polyadic programming
    Tan, Guangming
    Feng, Shengzhong
    Sun, Ninghui
    JOURNAL OF SUPERCOMPUTING, 2007, 39 (02): : 227 - 249
  • [28] Cache-Oblivious parallel SIMD Viterbi decoding for sequence search in HMMER
    Ferreira, Miguel
    Roma, Nuno
    Russo, Luis M. S.
    BMC BIOINFORMATICS, 2014, 15
  • [29] Cache-oblivious data structures and algorithms for undirected breadth-first search and shortest paths
    Brodal, GS
    Fagerberg, R
    Meyer, U
    Zeh, N
    ALGORITHM THEORY- SWAT 2004, 2004, 3111 : 480 - 492
  • [30] Cache-efficient Dynamic Programming Algorithms for Multicores
    Chowdhury, Rezaul Alam
    Ramachandran, Vijaya
    SPAA'08: PROCEEDINGS OF THE TWENTIETH ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2008, : 207 - 216