MSCA: A Multi-grained Sparse Convolution Accelerator for DNN Training

被引:0
|
作者
Mao, Yingchang [1 ]
Liu, Qiang [1 ]
Cheung, Ray C. C. [2 ]
机构
[1] Tianjin Univ, Sch Microelect, Tianjin, Peoples R China
[2] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep neural networks (DNNs); sparsity; training; hardware accelerator; FPGA;
D O I
10.1109/ASAP61560.2024.00019
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Training deep neural networks (DNNs) on edge devices is appealing for its adaptability and privacy benefits, but it faces challenges due to the limited resources and energy available on edge devices. In this paper, we propose MSCA, a Multi-grained Sparsity Convolution Accelerator. MSCA exploits both coarse-grained and fine-grained sparsity during the DNN training phases through two types of well-designed units. Experimental results show that MSCA implemented on FPGA achieves 218.03 GOPS throughput, 39.8 GOPS/W energy efficiency, and 4.0-6.2x speedup over dense accelerators for training VGG-8 and ResNet-10 on the CIFAR-10 and SVHN datasets.
引用
收藏
页码:34 / 35
页数:2
相关论文
共 50 条
  • [1] A Multi-Grained Reconfigurable Accelerator for Approximate Computing
    Kan, Yirong
    Wu, Man
    Zhang, Renyuan
    Nakashima, Yasuhiko
    2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020), 2020, : 90 - 95
  • [2] Hardware Accelerator Design for Sparse DNN Inference and Training: A Tutorial
    Mao, Wendong
    Wang, Meiqi
    Xie, Xiaoru
    Wu, Xiao
    Wang, Zhongfeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (03) : 1708 - 1714
  • [3] Visual learning graph convolution for multi-grained orange quality grading
    Guan Zhi-bin
    Zhang Yan-qi
    Chai Xiu-juan
    Chai Xin
    Zhang Ning
    Zhang Jian-hua
    Sun Tan
    JOURNAL OF INTEGRATIVE AGRICULTURE, 2023, 22 (01) : 279 - 291
  • [4] MRCA: Multi-grained Reconfigurable Cryptographic Accelerator for Diverse Security Requirements
    Pham Hoai Luan
    Hai Hau Nguyen
    Vu Trung Duong Le
    Thi Diem Tran
    Tuan Hai Vu
    Thi Hong Tran
    Yasuhiko Nakashima
    2024 IEEE SYMPOSIUM IN LOW-POWER AND HIGH-SPEED CHIPS, COOL CHIPS 27, 2024,
  • [5] Visual learning graph convolution for multi-grained orange quality grading
    GUAN Zhi-bin
    ZHANG Yan-qi
    CHAI Xiu-juan
    CHAI Xin
    ZHANG Ning
    ZHANG Jian-hua
    SUN Tan
    Journal of Integrative Agriculture, 2023, 22 (01) : 279 - 291
  • [6] SIGMA: A Sparse and Irregular GEMM Accelerator with Flexible Interconnects for DNN Training
    Qin, Eric
    Samajdar, Ananda
    Kwon, Hyoukjun
    Nadella, Vineet
    Srinivasan, Sudarshan
    Das, Dipankar
    Kaul, Bharat
    Krishna, Tushar
    2020 IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2020), 2020, : 58 - 70
  • [7] Video-Text Retrieval by Supervised Sparse Multi-Grained Learning
    Wang, Yimu
    Shi, Peng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 633 - 649
  • [8] MuGRA: A Scalable Multi-Grained Reconfigurable Accelerator Powered by Elastic Neural Network
    Kan, Yirong
    Wu, Man
    Zhang, Renyuan
    Nakashima, Yasuhiko
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (01) : 258 - 271
  • [9] An Elastic Neural Network Toward Multi-Grained Re-configurable Accelerator
    Wu, Man
    Chen, Yan
    Ran, Yirong
    Nomura, Takeshi
    Zhang, Renyuan
    Nakashima, Yasuhiko
    2020 18TH IEEE INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS'20), 2020, : 218 - 221
  • [10] A Balanced Sparse Matrix Convolution Accelerator for Efficient CNN Training
    Chen, Yuechen
    Louri, Ahmed
    Liu, Shanshan
    Lombardi, Fabrizio
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (10) : 4638 - 4651