Enabling Scalable Chiplet-based Uniform Memory Architectures with Silicon Photonics

被引:18
|
作者
Fotouhi, Pouya [1 ]
Werner, Sebastian [1 ]
Lowe-Power, Jason [2 ]
Ben Yoo, S. J. [1 ]
机构
[1] Univ Calif Davis, Dept Elect & Comp Engn, Davis, CA 95616 USA
[2] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
Chiplets; Interconnects; Memory Architecture; Silicon Photonics; INTERPOSER TECHNOLOGIES; NETWORK;
D O I
10.1145/3357526.3357564
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Chiplet-based systems have recently received much attention for scaling-up processing power in HPC systems due to their high energy efficiency and low cost manufacturing; however, large inter-chiplet NUMA latencies, distance-related energy overheads, and limited IO bandwidth caused by state-of-the-art packaging and inter-connect technologies substantially limit their scalability. Large last level caches (up to 16MiB/chiplet and 40% of chiplet area) of current systems can only temporarily hide these limitations and come at the large cost and leakage power of SRAM cells. In this paper, we propose the use of integrated silicon-photonic (SiPh) interconnects on an organic package substrate which combines low material costs with a high IO bandwidth, distance-independent energy consumption, and low-latency point-to-point interconnection fabric to effectively overcome current interconnect and packaging limitations. We exploit the properties of this fabric to propose a scalable uniform memory architecture (S-UMA) that overcomes all NUMA-related performance challenges. Moreover, we propose exploiting our low-latency SiPh fabric to remove the large LLC caches from the processor chiplets and re-integrate them into separate chiplets, increasing manufacturing yield by using smaller chiplets, allowing to use the most efficient process for SRAM circuits, or easing integration of alternative memory technologies without performance hits. Compared to state-of-the-art architectures, S-UMA offers 23% performance speed-up and 30% network power savings on average across HPC workloads for a 8-chiplet 64-core system.
引用
收藏
页码:222 / 234
页数:13
相关论文
共 31 条
  • [21] Distributed Memory Guard: Enabling Secure Enclave Computing in NoC-based Architectures
    Dessouky, Ghada
    Isakov, Mihailo
    Kinsy, Michel A.
    Mahmoody, Pouya
    Mark, Miguel
    Sadeghi, Ahmad-Reza
    Stapf, Emmanuel
    Zeitouni, Shaza
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 985 - 990
  • [22] Integrated, scalable and reconfigurable Silicon Photonics based optical switch for colorless, directionless and contentionless operation
    Tondini, Stefano
    Chalyan, Astghik
    Fontana, Giorgio
    Pavesi, Lorenzo
    Zecevic, Nikola
    Hofbauer, Michael
    Goll, Bernhard
    Zimmermann, Horst
    Stracca, Stefano
    Bianchi, Alberto
    Manganelli, Costanza
    Velha, Philippe
    Pintus, Paolo
    Di Pasquale, Fabrizio
    Oton, Claudio J.
    Kopp, Christophe
    Adelmini, Laetitia
    Lemonnier, Olivier
    Pares, Gabriel
    Chiaretti, Guido
    Serrano, Aina
    Angel Ayucar, Jose
    Battista Preve, Giovan
    Kim, Minsu
    Lee, Jong Moo
    Testa, Francesco
    2018 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXPOSITION (OFC), 2018,
  • [23] Multi-Chiplet System Architecture With Shared Uniform Access Memory Based on Board-Level Optical Interconnects
    Sharma, Arastu
    Bamiedakis, Nikolaos
    Karinou, Fotini
    Penty, Richard
    2021 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXPOSITION (OFC), 2021,
  • [24] Covert Timing Channels Exploiting Non-Uniform Memory Access based Architectures
    Yao, Fan
    Venkataramani, Guru
    Doroslovacki, Milos
    PROCEEDINGS OF THE GREAT LAKES SYMPOSIUM ON VLSI 2017 (GLSVLSI' 17), 2017, : 155 - 160
  • [25] Silicon-based all-optical memory elements for 1.54 μm photonics
    Forcales, M
    Gregorkiewicz, T
    Zavada, JM
    SOLID-STATE ELECTRONICS, 2003, 47 (01) : 165 - 168
  • [26] Enabling Wavelength-Dependent Adjoint-Based Methods for Process Variation Sensitivity Analysis in Silicon Photonics
    Zhang, Zhengxing
    El-Henawy, Sally, I
    Sadun, Allan
    Miller, Ryan
    Daniel, Luca
    White, Jacob K.
    Boning, Duane S.
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2021, 39 (06) : 1762 - 1769
  • [27] Silicon Photonics Interconnect Based on Ultra-small Scalable Components for Multi-channel Optical Transceivers
    Rhee, Hanjo
    Al-Saadi, Aws
    Kupijai, Sebastian
    Theiss, Christoph
    Otte, Sven
    Eichler, Hans J.
    Woggon, Ulrike
    Tillack, Bernd
    Zimmermann, Lars
    Richter, Harald H.
    Lischke, Stefan
    Mai, Christian
    Stolarek, David
    Meister, Stefan
    ECOC 2015 41ST EUROPEAN CONFERENCE ON OPTICAL COMMUNICATION, 2015,
  • [28] Scalable Memory Fabric for Silicon Interposer-Based Multi-Core Systems
    Akgun, Itir
    Zhan, Jia
    Wang, Yuangang
    Xie, Yuan
    PROCEEDINGS OF THE 34TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2016, : 33 - 40
  • [29] Scalable and Memory-efficient Spin Locks for Embedded Tile-based Many-core Architectures
    Awamoto, Shinichi
    Chishiro, Hiroyuki
    Kato, Shinpei
    2018 IEEE 21ST INTERNATIONAL SYMPOSIUM ON REAL-TIME DISTRIBUTED COMPUTING (ISORC 2018), 2018, : 25 - 32
  • [30] 1.2 Tbps/cm2 Enabling Silicon Photonics IC Technology Based on 40-nm Generation Platform
    Mogami, Tohru
    Horikawa, Tsuyoshi
    Kinoshita, Keizo
    Hagihara, Yasuhiko
    Ushida, Jun
    Tokushima, Masatoshi
    Fujikata, Junichi
    Takahashi, Shigeki
    Shimizu, Takanori
    Ukita, Akio
    Takemura, Koichi
    Kurihara, Mitsuru
    Yashiki, Kenichiro
    Okamoto, Daisuke
    Suzuki, Yasuyuki
    Sobu, Yohei
    Jeong, Seok-Hwan
    Tanaka, Yu
    Nakamura, Takahiro
    Kurata, Kazuhiko
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2018, 36 (20) : 4701 - 4712