Stride-directed prefetching for secondary caches

被引:8
|
作者
Kim, S
Veidenbaum, AV
机构
来源
PROCEEDINGS OF THE 1997 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING | 1997年
关键词
D O I
10.1109/ICPP.1997.622661
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper studies hardware prefetcheng for second-level (L2) caches. Previous work on prefetching has been extensive but largely directed at primary caches. In some cases only L2 prefetching is possible or is more appropriate. By studying L2 prefetching characteristics we show that existing stride-directed methods [1, 8] for L1 caches do not work as well in L2 caches. We propose a new stride-detection mechanism for L2 prefetching and combine it with stream buffers used in [16]. Our evaluation shows that this new prefetching scheme is more effective than stream buffer prefetching particularly for applications with long-stride accesses. Finally, we evaluate an L2 cache prefetching organization which combines a small L2 cache with our stride-directed prefetching scheme. Our results show that this system performs significantly better than stream buffer prefetching or a larger non-prefetching L2 cache without suffering from a significant increase in the memory traffic.
引用
收藏
页码:314 / 321
页数:8
相关论文
共 50 条
  • [31] Linear aggressive prefetching: A way to increase the performance of cooperative caches.
    Cortes, T
    Labarta, J
    IPPS/SPDP 1999: 13TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & 10TH SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1999, : 46 - 54
  • [32] Prefetching into smart-disk caches for high performance media servers
    Triantafillou, P
    Harizopoulos, S
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 1, 1999, : 500 - 505
  • [33] SLAP-CC: Set-Level Adaptive Prefetching for Compressed Caches
    AlBarakat, Laith M.
    Gratz, Paul, V
    Jimenez, Daniel A.
    2022 IEEE 40TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2022), 2022, : 50 - 58
  • [34] Exploring DMA-assisted Prefetching Strategies for Software Caches on Multicore Clusters
    Pinto, Christian
    Benini, Luca
    PROCEEDINGS OF THE 2014 IEEE 25TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2014), 2014, : 224 - 231
  • [35] Improvement of energy-efficiency in off-chip caches by selective prefetching
    Jalminger, J
    Stenström, P
    MICROPROCESSORS AND MICROSYSTEMS, 2002, 26 (03) : 107 - 121
  • [36] Fundamental Limits of Coded Caching With Multiple Antennas, Shared Caches and Uncoded Prefetching
    Parrinello, Emanuele
    Unsal, Ayse
    Elia, Petros
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2020, 66 (04) : 2252 - 2268
  • [37] Profiler and Compiler Assisted Adaptive I/O Prefetching for Shared Storage Caches
    Son, Seung Woo
    Muralidhara, Sai Prashanth
    Ozturk, Ozcan
    Kandemir, Mahmut
    Kolcu, Ibrahim
    Karakoy, Mustafa
    PACT'08: PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2008, : 112 - 121
  • [38] Adaptive compiler directed prefetching for EPIC processors
    Kim, J
    Rabbah, RM
    Palem, KV
    Wong, WF
    PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 495 - 501
  • [39] Instruction cache prefetching directed by branch prediction
    Chiu, JC
    Shiu, RM
    Chi, SA
    Chung, CP
    IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1999, 146 (05): : 241 - 246
  • [40] Instruction cache prefetching directed by branch prediction
    Institute of Computer Science and Information Engineering, National Chiao Tung University, Hsinchu 30050, Taiwan
    IEE Proc Comput Digital Tech, 5 (241-246):