An Efficient Deep-Learning-Based Super-Resolution Accelerating SoC With Heterogeneous Accelerating and Hierarchical Cache

被引:6
|
作者
Li, Zhiyong [1 ]
Kim, Sangjin [1 ]
Im, Dongseok [1 ]
Han, Donghyeon [1 ]
Yoo, Hoi-Jun [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon 34141, South Korea
关键词
Convolutional neural networks; System-on-chip; Image reconstruction; Hardware; Superresolution; Optimization; Feature extraction; Convolutional neural network (CNN); depth-first layer fusion; heterogeneous caching; heterogeneous processing; hierarchical cache; hybrid-precision; super-resolution (SR); system-on-chip (SoC); IMAGE SUPERRESOLUTION;
D O I
10.1109/JSSC.2022.3224964
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article presents an energy-efficient accelerating system-on-chip (SoC) for super-resolution (SR) image reconstruc-tion on a mobile platform. With the rise of contactless commu-nication and streaming services, the need for SR is growing. As one of the most basic low-level image processing algorithms, SR can reconstruct high-quality images from low-quality images which are noisy, compressed, or with damaged pixels. However, a massive amount of computation and considerable precision of pixel data pose challenges for acceleration in a resource and bandwidth constrained platform. SR has high energy consump-tion and long latency. While previous neural processing units (NPUs) reduced the precision to increase the efficiency and accelerate convolutional neural network (CNN) computation, few of them concentrated on both the output image quality and the performance of the entire system. The proposed SR SoC restores the high-quality image using a precision-optimized SR algorithm on an energy-efficient accelerating architecture and cache subsystem. It contributes three algorithm-hardware co-optimized features: 1) heterogeneous accelerating architecture (HAA) with only 8-bit floating-point (FP)-and-fixed-point (FXP) hybrid-precision for SR task; 2) tile-based hierarchical cache (THC) subsystem for the low energy and small footprint cost layer fusion; and 3) heterogeneous L1 data lifetime-aware optimized cache (DLOC) for the energy-efficient on-chip memory access. The prototype of SR SoC is fabricated in 65-nm technology and occupies a 10.0-mm2 die area. The proposed SR SoC can maintain the high reconstruction quality while consuming only 19% of the energy of an FXP16 system with homogeneous NPU. As a result, the SR SoC presents 2.6x higher energy efficiency than the previous SR targeting NPU and achieves 107-frame-per-second (fps) framerates running 4x SR image generation to full high definition (FHD) scale at only 0.92-mJ/frame energy
引用
收藏
页码:614 / 623
页数:10
相关论文
共 50 条
  • [21] A deep-learning-based compact method for accelerating the electrowetting lattice Boltzmann simulations
    Zhuang, Zijian
    Xu, Qin
    Zeng, Hanxian
    Pan, Yongcai
    Wen, Binghai
    PHYSICS OF FLUIDS, 2024, 36 (04)
  • [22] Deep Learning based Frameworks for Image Super-Resolution and Noise-Resilient Super-Resolution
    Sharma, Manoj
    Chaudhury, Santanu
    Lall, Brejesh
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 744 - 751
  • [23] Super-resolution photoacoustic microscopy based on deep learning
    Wang, Zhuangzhuang
    Li, Sihang
    Song, Xianlin
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2021, 2021, 11736
  • [24] Accelerating Image Super-Resolution Networks with Pixel-Level Classification
    Jeong, Jinho
    Kim, Jinwoo
    Jo, Younghyun
    Joo, Seon
    COMPUTER VISION - ECCV 2024, PT III, 2025, 15061 : 236 - 251
  • [25] Accelerating Image Super-Resolution Regression by a Hybrid Implementation in Mobile Devices
    Amanatiadis, Angelos
    Bampis, Loukas
    Gasteratos, Antonios
    2014 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2014, : 337 - 338
  • [26] Deep learning for image super-resolution
    Yang, Wenming
    Zhou, Fei
    Zhu, Rui
    Fukui, Kazuhiro
    Wang, Guijin
    Xue, Jing-Hao
    NEUROCOMPUTING, 2020, 398 (398) : 291 - 292
  • [27] Efficient Look-Up Table from Expanded Convolutional Network for Accelerating Image Super-resolution
    Yin, Kai
    Shen, Jie
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6720 - 6728
  • [28] Deep-Learning-Based Super-Resolution of Video Satellite Imagery by the Coupling of Multiframe and Single-Frame Models
    Shen, Huanfeng
    Qiu, Zhonghang
    Yue, Linwei
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [29] Accelerating Super-Resolution Network Inference via Sensitivity-Based Weight Sparsity Allocation
    Nguyen, Tuan Nghia
    Nguyen, Xuan Truong
    Lee, Kyujoong
    Lee, Hyuk-Jae
    IEEE ACCESS, 2023, 11 : 122962 - 122973
  • [30] Accelerating Neural Style-Transfer Using Contrastive Learning for Unsupervised Satellite Image Super-Resolution
    Mishra, Divya
    Hadar, Ofer
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61