MSCA: A Multi-grained Sparse Convolution Accelerator for DNN Training

被引:0
|
作者
Mao, Yingchang [1 ]
Liu, Qiang [1 ]
Cheung, Ray C. C. [2 ]
机构
[1] Tianjin Univ, Sch Microelect, Tianjin, Peoples R China
[2] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep neural networks (DNNs); sparsity; training; hardware accelerator; FPGA;
D O I
10.1109/ASAP61560.2024.00019
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Training deep neural networks (DNNs) on edge devices is appealing for its adaptability and privacy benefits, but it faces challenges due to the limited resources and energy available on edge devices. In this paper, we propose MSCA, a Multi-grained Sparsity Convolution Accelerator. MSCA exploits both coarse-grained and fine-grained sparsity during the DNN training phases through two types of well-designed units. Experimental results show that MSCA implemented on FPGA achieves 218.03 GOPS throughput, 39.8 GOPS/W energy efficiency, and 4.0-6.2x speedup over dense accelerators for training VGG-8 and ResNet-10 on the CIFAR-10 and SVHN datasets.
引用
收藏
页码:34 / 35
页数:2
相关论文
共 50 条
  • [41] Towards Multi-Grained Explainability for Graph Neural Networks
    Wang, Xiang
    Wu, Ying-Xin
    Zhang, An
    He, Xiangnan
    Chua, Tat-Seng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [42] MCPS: a mapping method for MAERI accelerator base on Cartesian Product based Convolution for DNN layers with sparse input feature map
    Narimanjahan, Babak
    Reshadi, Midia
    Khademzadeh, Ahmad
    Reza, Akram
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (05): : 3213 - 3230
  • [43] Modeling Multi-Grained User Preference in Location Visitation
    Qin, Yingrong
    Gao, Chen
    Tu, Zhen
    Wu, Hongsheng
    Wei, Shuangqing
    Wang, Yue
    Zhang, Lin
    Li, Yong
    31ST ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS, ACM SIGSPATIAL GIS 2023, 2023, : 436 - 445
  • [44] MULTI-GRAINED DEEP FEATURE LEARNING FOR PEDESTRIAN DETECTION
    Lin, Chunze
    Lu, Jiwen
    Zhou, Jie
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [45] Multi-Grained Guaranteeable Requirement Analysis for Iterative Adaptation
    Li, Jialong
    Yamauchi, Takuto
    Hirano, Takanori
    Cai, Jinyu
    Tei, Kenji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2025, E108D (04) : 360 - 370
  • [46] A Silicon Photonic Multi-DNN Accelerator
    Li, Yuan
    Louri, Ahmed
    Karanth, Avinash
    2023 32ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PACT, 2023, : 238 - 249
  • [47] Multi-grained Aspect Fusion for Review Response Generation
    Yuan, Yun
    Gong, Chen
    Kong, Dexin
    Yu, Nan
    Fu, Guohong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IX, 2023, 14262 : 25 - 37
  • [48] Exploiting multi-grained parallelism in reconfigurable SBC architectures
    Zambreno, J
    Honbo, D
    Choudhary, A
    FCCM 2005: 13TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2005, : 333 - 334
  • [49] mmLayout: Multi-grained MultiModal Transformer for Document Understanding
    Wang, Wenjin
    Huang, Zhengjie
    Luo, Bin
    Chen, Qianglong
    Peng, Qiming
    Pan, Yinxu
    Yin, Weichong
    Feng, Shikun
    Sun, Yu
    Yu, Dianhai
    Zhang, Yin
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4877 - 4886
  • [50] A Multi-grained Dataset for News Event Triggered Knowledge Update
    Lee, Yu-Ting
    Tang, Ying-Jhe
    Cheng, Yu-Chung
    Chen, Pai-Lin
    Li, Tsai-Yen
    Huang, Hen-Hsen
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4158 - 4162