Tackling Memory Access Latency Through DRAM Row Management

被引：8

作者：

Srikanth, Sriseshan ^{[1
]}

Subramanian, Lavanya ^{[2
]}

Subramoney, Sreenivas ^{[2
]}

Conte, Thomas M. ^{[1
]}

Wang, Hong ^{[2
]}

机构：

[1] Georgia Inst Technol, Atlanta, GA 30332 USA

[2] Intel Labs, Hillsboro, OR USA

来源：

PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS (MEMSYS 2018) | 2018年

关键词：

D O I：

10.1145/3240302.3240314

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Memory latency is a critical bottleneck in today's systems. The organization of the DRAM main memory necessitates sensing and reading an entire row (around 4KB) of data in order to access a single cache block. The benefit of this organization is that subsequent accesses to the same row can be served faster (row hits). However, accesses to other rows incur high latency to prepare the DRAM bank for a subsequent access and read the contents of the new row (row conflicts). Therefore, the decision on how long a row is held open for is a key factor that determines the access latency incurred by requests to memory. While prior work has tackled this problem, existing solutions are either complex or ineffective. Our goal, in this work, is to build a row management scheme that is simple yet effective. Towards this end, we first build a scoreboard scheme that determines how long to hold a row open, by i) predicting the number of row hits and row conflicts for different lengths of time rows are held open and ii) picking the time that maximizes row hits without increasing row conflicts significantly. We then observe that a small set of rows tend to experience a large number of back-to-back accesses. We build a row exclusion scheme that identifies such rows and prevents them from being closed until the next access to a different row arrives. Our evaluations show that our scoreboard and row exclusion policies together incur less than 0.4% of the additional storage cost of the most effective prior mechanism, while surpassing it in terms of performance.

引用

页码：137 / 147

页数：11

共 50 条

[1] ChargeCache: Reducing DRAM Latency by Exploiting Row Access Locality
Hassan, Hasan
Pekhimenko, Gennady
Vijaykumar, Nandita
Seshadri, Vivek
Lee, Donghyuk
Ergin, Oguz
Mutlu, Onur
PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA-22), 2016, : 581 - 593
[2] Cached DRAM for ILP processor memory access latency reduction
Zhang, Z
Zhu, ZC
Zhang, XD
IEEE MICRO, 2001, 21 (04) : 22 - 32
[3] Optimization of Access Latency in DRAM
Qazi, Azhar
Ullah, Zahid
Rehman, Khalid
Khan, Muhammad Hilal
Bilal, Muhammad
2016 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRONIC AND ELECTRICAL ENGINEERING (ICE CUBE), 2016, : 163 - 168
[4] Multiple Clone Row DRAM: A Low Latency and Area Optimized DRAM
Choi, Jungwhan
Shin, Wongyu
Jang, Jaemin
Suh, Jinwoong
Kwon, Yongkee
Moon, Youngsuk
Kim, Lee-Sup
2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, : 223 - 234
[5] DRAM-Latency Optimization Inspired by Relationship between Row-Access Time and Refresh Timing
Shin, Wongyu
Choi, Jungwhan
Jang, Jaemin
Suh, Jinwoong
Moon, Youngsuk
Kwon, Yongkee
Kim, Lee-Sup
IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (10) : 3027 - 3040
[6] Reducing DRAM Access Latency via Helper Rows
Xin, Xin
Zhang, Youtao
Yang, Jun
PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,
[7] Fast modeling DRAM access latency based on the LLC memory stride distribution without detailed simulations
Ling, Ming
Shang, Xiaojing
Ji, Kecheng
Shi, Longxing
MICROPROCESSORS AND MICROSYSTEMS, 2019, 64 : 159 - 169
[8] Solar-DRAM: Reducing DRAM Access Latency by Exploiting the Variation in Local Bitlines
Kim, Jeremie S.
Patel, Minesh
Hassan, Hasan
Mutlu, Onur
2018 IEEE 36TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2018, : 282 - 291
[9] Exposing Implementation Details of Embedded DRAM Memory Controllers through Latency-based Analysis
Hassan, Mohamed
Kaushik, Anirudh M.
Patel, Hiren
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (05)
[10] Row-Buffer Decoupling: A Case for Low-Latency DRAM Microarchitecture
Seongil, O.
Son, Young Hoon
Kim, Nam Sung
Ahn, Jung Ho
2014 ACM/IEEE 41ST ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2014, : 337 - 348

← 1 2 3 4 5 →