Tackling Memory Access Latency Through DRAM Row Management

被引:8
|
作者
Srikanth, Sriseshan [1 ]
Subramanian, Lavanya [2 ]
Subramoney, Sreenivas [2 ]
Conte, Thomas M. [1 ]
Wang, Hong [2 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Intel Labs, Hillsboro, OR USA
关键词
D O I
10.1145/3240302.3240314
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Memory latency is a critical bottleneck in today's systems. The organization of the DRAM main memory necessitates sensing and reading an entire row (around 4KB) of data in order to access a single cache block. The benefit of this organization is that subsequent accesses to the same row can be served faster (row hits). However, accesses to other rows incur high latency to prepare the DRAM bank for a subsequent access and read the contents of the new row (row conflicts). Therefore, the decision on how long a row is held open for is a key factor that determines the access latency incurred by requests to memory. While prior work has tackled this problem, existing solutions are either complex or ineffective. Our goal, in this work, is to build a row management scheme that is simple yet effective. Towards this end, we first build a scoreboard scheme that determines how long to hold a row open, by i) predicting the number of row hits and row conflicts for different lengths of time rows are held open and ii) picking the time that maximizes row hits without increasing row conflicts significantly. We then observe that a small set of rows tend to experience a large number of back-to-back accesses. We build a row exclusion scheme that identifies such rows and prevents them from being closed until the next access to a different row arrives. Our evaluations show that our scoreboard and row exclusion policies together incur less than 0.4% of the additional storage cost of the most effective prior mechanism, while surpassing it in terms of performance.
引用
收藏
页码:137 / 147
页数:11
相关论文
共 50 条
  • [1] ChargeCache: Reducing DRAM Latency by Exploiting Row Access Locality
    Hassan, Hasan
    Pekhimenko, Gennady
    Vijaykumar, Nandita
    Seshadri, Vivek
    Lee, Donghyuk
    Ergin, Oguz
    Mutlu, Onur
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA-22), 2016, : 581 - 593
  • [2] Cached DRAM for ILP processor memory access latency reduction
    Zhang, Z
    Zhu, ZC
    Zhang, XD
    IEEE MICRO, 2001, 21 (04) : 22 - 32
  • [3] Optimization of Access Latency in DRAM
    Qazi, Azhar
    Ullah, Zahid
    Rehman, Khalid
    Khan, Muhammad Hilal
    Bilal, Muhammad
    2016 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRONIC AND ELECTRICAL ENGINEERING (ICE CUBE), 2016, : 163 - 168
  • [4] Multiple Clone Row DRAM: A Low Latency and Area Optimized DRAM
    Choi, Jungwhan
    Shin, Wongyu
    Jang, Jaemin
    Suh, Jinwoong
    Kwon, Yongkee
    Moon, Youngsuk
    Kim, Lee-Sup
    2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, : 223 - 234
  • [5] DRAM-Latency Optimization Inspired by Relationship between Row-Access Time and Refresh Timing
    Shin, Wongyu
    Choi, Jungwhan
    Jang, Jaemin
    Suh, Jinwoong
    Moon, Youngsuk
    Kwon, Yongkee
    Kim, Lee-Sup
    IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (10) : 3027 - 3040
  • [6] Reducing DRAM Access Latency via Helper Rows
    Xin, Xin
    Zhang, Youtao
    Yang, Jun
    PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,
  • [7] Fast modeling DRAM access latency based on the LLC memory stride distribution without detailed simulations
    Ling, Ming
    Shang, Xiaojing
    Ji, Kecheng
    Shi, Longxing
    MICROPROCESSORS AND MICROSYSTEMS, 2019, 64 : 159 - 169
  • [8] Solar-DRAM: Reducing DRAM Access Latency by Exploiting the Variation in Local Bitlines
    Kim, Jeremie S.
    Patel, Minesh
    Hassan, Hasan
    Mutlu, Onur
    2018 IEEE 36TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2018, : 282 - 291
  • [9] Exposing Implementation Details of Embedded DRAM Memory Controllers through Latency-based Analysis
    Hassan, Mohamed
    Kaushik, Anirudh M.
    Patel, Hiren
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (05)
  • [10] Row-Buffer Decoupling: A Case for Low-Latency DRAM Microarchitecture
    Seongil, O.
    Son, Young Hoon
    Kim, Nam Sung
    Ahn, Jung Ho
    2014 ACM/IEEE 41ST ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2014, : 337 - 348