Occamy: Memory-efficient GPU Compiler for DNN Inference

被引:5
|
作者
Lee, Jaeho [1 ]
Jeong, Shinnung [1 ]
Song, Seungbin [1 ]
Kim, Kunwoo [1 ]
Choi, Heelim [1 ]
Kim, Youngsok [1 ]
Kim, Hanjun [1 ]
机构
[1] Yonsei Univ, Seoul, South Korea
关键词
D O I
10.1109/DAC56929.2023.10247839
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work proposes Occamy, a new memory-efficient DNN compiler that reduces the memory usage of a DNN model without affecting its accuracy. For each DNN operation, Occamy analyzes the dimensions of input and output tensors, and their liveness within the operation. Across all the operations, Occamy analyzes liveness of all the tensors, generates a memory pool after calculating the maximum required memory size, and schedules when and where to place each tensor in the memory pool. Compared to PyTorch, on an integrated embedded GPU for six DNNs, Occamy reduces the memory usage by 34.6% and achieves a geometric mean speedup of 1.25x.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Gerbil: a fast and memory-efficient k-mer counter with GPU-support
    Erbert, Marius
    Rechner, Steffen
    Mueller-Hannemann, Matthias
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2017, 12
  • [32] Gerbil: a fast and memory-efficient k-mer counter with GPU-support
    Marius Erbert
    Steffen Rechner
    Matthias Müller-Hannemann
    Algorithms for Molecular Biology, 12
  • [33] Memory-Efficient Parallelization of 3D Lattice Boltzmann Flow Solver on a GPU
    Nhat-Phuong Tran
    Lee, Myungho
    Choi, Dong Hoon
    2015 IEEE 22ND INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2015, : 315 - 324
  • [34] FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices
    Li, Xiangyu
    Li, Yuanchun
    Li, Yuanzhe
    Cao, Ting
    Liu, Yunxin
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING, ACM MOBICOM 2024, 2024, : 709 - 723
  • [35] Memory-efficient interconnect optimization
    Lai, MH
    Wong, DF
    PROCEEDINGS OF THE ASP-DAC 2001: ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE 2001, 2001, : 198 - 202
  • [36] Memory-efficient DRASiW Models
    Napoli, Otavio Oliveira
    de Almeida, Ana Maria
    Borin, Edson
    Breternitz Jr, Mauricio
    NEUROCOMPUTING, 2024, 610
  • [37] Memory-efficient fingerprint verification
    Beleznai, C
    Ramoser, H
    Wachmann, B
    Birchbauer, J
    Bischof, H
    Kropatsch, W
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2001, : 463 - 466
  • [38] Memory-efficient fixpoint computation
    Sung Kook Kim
    Arnaud J. Venet
    Aditya V. Thakur
    Formal Methods in System Design, 2025, 65 (1) : 133 - 162
  • [39] Memory-Efficient Hash Joins
    Barber, R.
    Lohman, G.
    Pandis, I.
    Raman, V.
    Sidle, R.
    Attaluri, G.
    Chainani, N.
    Lightstone, S.
    Sharpe, D.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 8 (04): : 353 - 364
  • [40] Memory-Efficient Fixpoint Computation
    Kim, Sung Kook
    Venet, Arnaud J.
    Thakur, Aditya, V
    STATIC ANALYSIS (SAS 2020), 2020, 12389 : 35 - 64