Tackling Memory Access Latency Through DRAM Row Management

被引:8
|
作者
Srikanth, Sriseshan [1 ]
Subramanian, Lavanya [2 ]
Subramoney, Sreenivas [2 ]
Conte, Thomas M. [1 ]
Wang, Hong [2 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Intel Labs, Hillsboro, OR USA
关键词
D O I
10.1145/3240302.3240314
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Memory latency is a critical bottleneck in today's systems. The organization of the DRAM main memory necessitates sensing and reading an entire row (around 4KB) of data in order to access a single cache block. The benefit of this organization is that subsequent accesses to the same row can be served faster (row hits). However, accesses to other rows incur high latency to prepare the DRAM bank for a subsequent access and read the contents of the new row (row conflicts). Therefore, the decision on how long a row is held open for is a key factor that determines the access latency incurred by requests to memory. While prior work has tackled this problem, existing solutions are either complex or ineffective. Our goal, in this work, is to build a row management scheme that is simple yet effective. Towards this end, we first build a scoreboard scheme that determines how long to hold a row open, by i) predicting the number of row hits and row conflicts for different lengths of time rows are held open and ii) picking the time that maximizes row hits without increasing row conflicts significantly. We then observe that a small set of rows tend to experience a large number of back-to-back accesses. We build a row exclusion scheme that identifies such rows and prevents them from being closed until the next access to a different row arrives. Our evaluations show that our scoreboard and row exclusion policies together incur less than 0.4% of the additional storage cost of the most effective prior mechanism, while surpassing it in terms of performance.
引用
收藏
页码:137 / 147
页数:11
相关论文
共 50 条
  • [31] A Distributed Interleaving Scheme for Efficient Access to WideIO DRAM Memory
    Seiculescu, Ciprian
    Benini, Luca
    De Micheli, Giovanni
    CODES+ISSS'12:PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON HARDWARE/SOFTWARE-CODESIGN AND SYSTEM SYNTHESIS, 2012, : 103 - 112
  • [32] On the retention time distribution of dynamic random access memory (DRAM)
    Hamamoto, T
    Sugiura, S
    Sawada, S
    IEEE TRANSACTIONS ON ELECTRON DEVICES, 1998, 45 (06) : 1300 - 1309
  • [33] Saving Memory Movements Through Vector Processing in the DRAM
    Alves, Marco A. Z.
    Santos, Paulo C.
    Moreira, Francis B.
    Diener, Matthias
    Carro, Luigi
    2015 INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURE AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES), 2015, : 117 - 126
  • [34] Latency equalization of memory access in network-chips
    Li, Yang
    Chen, Xiao-Wen
    Zhao, Xiao-Hui
    Yang, Yong
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2015, 45 (05): : 1624 - 1630
  • [35] Tackling the shadow economy through inflation and access to credit
    Bansah, Marcus
    Mohsin, Mohammed
    JOURNAL OF INTERNATIONAL TRADE & ECONOMIC DEVELOPMENT, 2024, 33 (05): : 903 - 927
  • [36] Transparent Hardware Management of Stacked DRAM as Part of Memory
    Sim, Jaewoong
    Alameldeen, Alaa R.
    Chishti, Zeshan
    Wilkerson, Chris
    Kim, Hyesoon
    2014 47TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2014, : 13 - 24
  • [37] Software Thermal Management of DRAM Memory for Multicore Systems
    Lin, Jiang
    Zheng, Hongzhong
    Zhu, Zhichun
    Gorbatov, Eugene
    David, Howard
    Zhang, Zhao
    SIGMETRICS'08: PROCEEDINGS OF THE 2008 INTERNATIONAL CONFERENCE ON MEASUREMENT & MODELING OF COMPUTER SYSTEMS, 2008, 36 (01): : 337 - +
  • [38] A composable worst case latency analysis for multi-rank DRAM devices under open row policy
    Zheng Pei Wu
    Rodolfo Pellizzoni
    Danlu Guo
    Real-Time Systems, 2016, 52 : 761 - 807
  • [39] Efficient management of memory hierarchies in embedded DRAM systems
    Saulsbury, Ashley
    Huang, Su-Jaen
    Dahlgren, Fredrik
    Proceedings of the International Conference on Supercomputing, 1999, : 464 - 473
  • [40] A composable worst case latency analysis for multi-rank DRAM devices under open row policy
    Wu, Zheng Pei
    Pellizzoni, Rodolfo
    Guo, Danlu
    REAL-TIME SYSTEMS, 2016, 52 (06) : 761 - 807