Optimizing Communication and Capacity in a 3D Stacked Reconfigurable Cache Hierarchy

被引:0
|
作者
Madan, Niti [1 ]
Zhao, Li [2 ]
Muralimanohar, Naveen [1 ]
Udipi, Aniruddha [1 ]
Balasubramonian, Rajeev [1 ]
Iyer, Ravishankar [2 ]
Makineni, Srihari [2 ]
Newell, Donald [2 ]
机构
[1] Univ Utah, Sch Comp, Salt Lake City, UT 84112 USA
[2] Intel Corp, Syst Technol Lab, Santa Clara, CA 95051 USA
关键词
multi-core processors; cache and memory hierarchy; non-uniform cache architecture (NUCA); page coloring; on-chip networks; SRAM/DRAM cache reconfiguration;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Cache hierarchies in future many-core processors are expected to grow in size and contribute a large fraction of overall processor power and performance. In this paper we postulate a 3D chip design that stacks SRAM and DRAM upon processing cores and employs OS-based page coloring to minimize horizontal communication of cache data. We then propose a heterogeneous reconfigurable cache design that takes advantage of the high density of DRAM and the superior power/delay characteristics of SRAM to efficiently, meet the working set demands of each individual core. Finally, we analyze the communication patterns for such a processor and show that a tree topology is an ideal fit that significantly reduces the power and latency requirements of the on-chip network. The above proposals art, synergistic: each proposal is made more compelling because of its combination with the other innovations described in this paper The proposed reconfigurable cache model improves performance by up to 19% along with 48% savings in network power.
引用
收藏
页码:262 / +
页数:3
相关论文
共 50 条
  • [31] 3D Stacked Microprocessor: Are We There Yet?
    Loh, Gabriel H.
    Xie, Yuan
    IEEE MICRO, 2010, 30 (03) : 60 - 63
  • [32] 3D Stacked Microfluidic Cooling for High-Performance 3D ICs
    Zhang, Yue
    Dembla, Ashish
    Joshi, Yogendra
    Bakir, Muhannad S.
    2012 IEEE 62ND ELECTRONIC COMPONENTS AND TECHNOLOGY CONFERENCE (ECTC), 2012, : 1644 - 1650
  • [33] A 3D Stacked High Performance Scalable Architecture for 3D Fourier Transform
    Voicu, George R.
    Enachescu, Marius
    Cotofana, Sorin D.
    2012 IEEE 30TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2012, : 498 - 499
  • [34] Path optimizing for 3D printing
    Zeng Feng
    PROCEEDINGS OF THE 2016 6TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS, ENVIRONMENT, BIOTECHNOLOGY AND COMPUTER (MMEBC), 2016, 88 : 1662 - 1665
  • [35] Exploring the Relation between Monolithic 3D Ll GPU Cache Capacity and Warp Scheduling Efficiency
    Cong Thuan Do
    Gong, Young-Ho
    Kim, Cheol Hong
    Kim, Seon Wook
    Chung, Sung Woo
    2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2019,
  • [36] Optimizing Test Architecture of 3D Stacked ICs for Partial Stack/Complete Stack using Hard SOCs
    Roy, Surajit Kumar
    Giri, Chandan
    Rahaman, Hafizur
    2013 8TH INTERNATIONAL DESIGN AND TEST SYMPOSIUM (IDT), 2013,
  • [37] RECONFIGURABLE ACCELERATION OF 3D IMAGE REGISTRATION
    Tsoi, Kuen Hung
    Rueckert, Daniel
    Ho, Chun Hok
    Luk, Wayne
    2009 5TH SOUTHERN CONFERENCE ON PROGRAMMABLE LOGIC, PROCEEDINGS, 2009, : 95 - 100
  • [38] 3D graphics coding in a reconfigurable environment
    Tulvan, Christian
    Preda, Marius
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (10) : 1239 - 1254
  • [39] Efficient and reconfigurable design with 3D parametrics
    Rudig, Christian
    Witwer, Gerhard
    Dierneder, Stefan
    RECONFIGURABLE MECHANISMS AND ROBOTS, 2009, : 728 - 737
  • [40] Efficient 3D imaging with reconfigurable sensors
    Feng, Xiaohua
    NATURE ELECTRONICS, 2025, 8 (01): : 11 - 12