Efficient Synonym Filtering and Scalable Delayed Translation for Hybrid Virtual Caching

被引：5

作者：

Park, Chang Hyun ^{[1
]}

Heo, Taekyung ^{[1
]}

Huh, Jaehyuk ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Sch Comp, Daejeon, South Korea

来源：

2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA) | 2016年

关键词：

address translation; hybrid virtual cache; synonym detection; segmented translation;

D O I：

10.1109/ISCA.2016.28

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Conventional translation look-aside buffers (TLBs) are required to complete address translation with short latencies, as the address translation is on the critical path of all memory accesses even for L1 cache hits. Such strict TLB latency restrictions limit the TLB capacity, as the latency increase with large TLBs may lower the overall performance even with potential TLB miss reductions. Furthermore, TLBs consume a significant amount of energy as they are accessed for every instruction fetch and data access. To avoid the latency restriction and reduce the energy consumption, virtual caching techniques have been proposed to defer translation to after L1 cache misses. However, an efficient solution for the synonym problem has been a critical issue hindering the wide adoption of virtual caching. Based on the virtual caching concept, this study proposes a hybrid virtual memory architecture extending virtual caching to the entire cache hierarchy, aiming to improve both performance and energy consumption. The hybrid virtual caching uses virtual addresses augmented with address space identifiers (ASID) in the cache hierarchy for common non-synonym addresses. For such non-synonyms, the address translation occurs only after last-level cache (LLC) misses. For uncommon synonym addresses, the addresses are translated to physical addresses with conventional TLBs before L1 cache accesses. To support such hybrid translation, we propose an efficient synonym detection mechanism based on Bloom filters which can identify synonym candidates with few false positives. For large memory applications, delayed translation alone cannot solve the address translation problem, as fixed-granularity delayed TLBs may not scale with the increasing memory requirements. To mitigate the translation scalability problem, this study proposes a delayed many segment translation designed for the hybrid virtual caching. The experimental results show that our approach effectively lowers accesses to the TLBs, leading to significant power savings. In addition, the approach provides performance improvement with scalable delayed translation with variable length segments.

引用

页码：217 / 229

页数：13

共 50 条

[21] Efficient Scalable Median Filtering Using Histogram-Based Operations
Green, Oded
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (05) : 2217 - 2228
[22] Scalable and Efficient Neural Speech Coding: A Hybrid Design
Zhen, Kai
Sung, Jongmo
Lee, Mi Suk
Beack, Seungkwon
Kim, Minje
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 12 - 25
[23] Power-Efficient and Scalable Virtual Router Architecture on FPGA
Haria, Swapnil
Ganegedara, Thilan
Prasanna, Viktor
2012 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2012,
[24] Memory-Efficient and Scalable Virtual Routers Using FPGA
Le, Hoang
Ganegedara, Thilan
Prasanna, Viktor K.
FPGA 11: PROCEEDINGS OF THE 2011 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD PROGRAMMABLE GATE ARRAYS, 2011, : 257 - 266
[25] Efficient image restoration of virtual machines with reference count based rewriting and caching
Xu, Jiwei
Wang, Tao
Xing, Xiaozhao
Zhang, Wenbo
Zhong, Hua
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 77 : 87 - 96
[26] An Efficient Hybrid I/O Caching Architecture Using Heterogeneous SSDs
Salkhordeh, Reza
Hadizadeh, Mostafa
Asadi, Hossein
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (06) : 1238 - 1250
[27] Utopia: Fast and Efficient Address Translation via Hybrid Restrictive & Flexible Virtual-to-Physical Address Mappings
Kanellopoulos, Konstantinos
Bera, Rahul
Stojiljkovic, Kosta
Bostanci, Nisa
Firtina, Can
Ausavarungnirun, Rachata
Kumar, Rakesh
Hajinazar, Nastaran
Sadrosadati, Mohammad
Vijaykumar, Nandita
Mutlu, Onur
56TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2023, 2023, : 1196 - 1212
[28] Efficient Page Caching Algorithm with Prediction and Migration for a Hybrid Main Memory
Seok, Hyunchul
Park, Youngwoo
Park, Ki-Woong
Park, Kyu Ho
APPLIED COMPUTING REVIEW, 2011, 11 (04): : 38 - 48
[29] A hybrid clustering based filtering approach with efficient sequencing
Mittal, Namita
Govil, M. C.
Nayak, Richi
Kumar, Rajesh
Gothwal, Himanshu
Das, Dwipayan
IMECS 2008: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2008, : 517 - +
[30] HyperFuzzer: An Efficient Hybrid Fuzzer for Virtual CPUs
Ge, Xinyang
Niu, Ben
Brotzman, Robert
Chen, Yaohui
Han, HyungSeok
Godefroid, Patrice
Cui, Weidong
CCS '21: PROCEEDINGS OF THE 2021 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 366 - 378

← 1 2 3 4 5 →