PIMPR: PIM-based Personalized Recommendation with Heterogeneous Memory Hierarchy

被引:0
|
作者
Yang, Tao [1 ,2 ]
Ma, Hui [1 ]
Zhao, Yilong [1 ]
Liu, Fangxin [1 ]
He, Zhezhi [1 ]
Sun, Xiaoli [4 ]
Jiang, Li [1 ,2 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Shanghai Qi Zhi Inst, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
[4] Zhejiang Inst Sci & Technol Informat, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Recommendation System; PIM; Embedding; Acceleration; Architecture Design;
D O I
10.23919/DATE56975.2023.10137249
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning-based personalized recommendation models (DLRMs) are dominating AI tasks in data centers. The performance bottleneck of typical DLRMs mainly lies in the memory-bounded embedding layers. Resistive Random Access Memory (ReRAM)-based Processing-in-memory (PIM) architecture is a natural fit for DLRMs thanks to its in-situ computation and high computational density. However, it remains two challenges before DLRMs fully embrace ReRAM-based PIM architectures: 1) The size of DLRM's embedding tables can reach tens of GBs, far beyond the memory capacity of typical ReRAM chips. 2) The irregular sparsity conveyed in the embedding layers is difficult to exploit in ReRAM crossbars architecture. In this paper, we present a PIM-based DLRM accelerator named PIMPR. PIMPR has a heterogeneous memory hierarchy-ReRAM crossbar-based PIM modules serve as the computing caches with high computing parallelism, while DIMM modules are able to hold the entire embedding table-leveraging the data locality of DLRM's embedding layers. Moreover, we propose a runtime strategy to skip the useless calculation induced by the sparsity and an offline strategy to balance the workload of each ReRAM crossbar. Compared to the state-of-the-art DLRM accelerator SPACE and TRiM, PIMPR achieves on average 2.02x and 1.79x speedup, 5.6x, and 5.1x energy reduction, respectively.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] SemRec: a personalized semantic recommendation method based on weighted heterogeneous information networks
    Shi, Chuan
    Zhang, Zhiqiang
    Ji, Yugang
    Wang, Weipeng
    Yu, Philip S.
    Shi, Zhiping
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (01): : 153 - 184
  • [32] Enabling Highly Efficient Capsule Networks Processing Through A PIM-Based Architecture Design
    Zhang, Xingyao
    Song, Shuaiwen Leon
    Xie, Chenhao
    Wang, Jing
    Zhang, Weigong
    Fu, Xin
    2020 IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2020), 2020, : 542 - 555
  • [33] Personalized Entity Recommendation: A Heterogeneous Information Network Approach
    Yu, Xiao
    Ren, Xiang
    Sun, Yizhou
    Gu, Quanquan
    Sturt, Bradley
    Khandelwal, Urvashi
    Norick, Brandon
    Han, Jiawei
    WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 283 - 292
  • [34] A Personalized Recommendation Algorithm via Heterogeneous Heat Conduction
    Chen, Guang
    Qiu, Tian
    Zhong, Lixin
    Zhang, Xiaolin
    Ye, Aihua
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 602 - 606
  • [35] PSQ: An Automatic Search Framework for Data-Free Quantization on PIM-based Architecture
    Liu, Fangxin
    Yang, Ning
    Jiang, Li
    2023 IEEE 41ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD, 2023, : 507 - 514
  • [36] LerGAN: A Zero-free, Low Data Movement and PIM-based GAN Architecture
    Mao, Haiyu
    Song, Mingcong
    Li, Tao
    Dai, Yuting
    Shu, Jiwu
    2018 51ST ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2018, : 669 - 681
  • [37] NGPR: A comprehensive personalized point-of-interest recommendation method based on heterogeneous graphs
    Dongjin Yu
    Ting Yu
    Dongjing Wang
    Yi Shen
    Multimedia Tools and Applications, 2022, 81 : 39207 - 39228
  • [38] HRec: Heterogeneous Graph Embedding-Based Personalized Point-of-Interest Recommendation
    Su, Yijun
    Li, Xiang
    Zha, Daren
    Tang, Wei
    Jiang, Yiwen
    Xiang, Ji
    Gao, Neng
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT III, 2019, 11955 : 37 - 49
  • [39] NGPR: A comprehensive personalized point-of-interest recommendation method based on heterogeneous graphs
    Yu, Dongjin
    Yu, Ting
    Wang, Dongjing
    Shen, Yi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (27) : 39207 - 39228
  • [40] Ontology-based personalized couple clustering for heterogeneous product recommendation in mobile marketing
    Yuan, ST
    Cheng, CS
    EXPERT SYSTEMS WITH APPLICATIONS, 2004, 26 (04) : 461 - 476