A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge

被引:1
|
作者
Mahmud, Hasanul [1 ]
Kang, Peng [1 ]
Desai, Kevin [1 ]
Lama, Palden [1 ]
Prasad, Sushil K. [1 ]
机构
[1] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
关键词
Energy-efficiency; Deep Neural Networks; Edge Computing; Early-exit DNNs; Converting Autoencoder;
D O I
10.1109/IPDPSW63119.2024.00117
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Reducing inference time and energy usage while maintaining prediction accuracy has become a significant concern for deep neural networks (DNN) inference on resourcecon-strained edge devices. To address this problem, we propose a novel approach based on "converting" autoencoder and lightweight DNNs. This improves upon recent work such as early-exiting framework and DNN partitioning. Early-exiting frameworks spend different amounts of computation power for different input data depending upon their complexity. However, they can be inefficient in real-world scenarios that deal with many hard image samples. On the other hand, DNN partitioning algorithms that utilize the computation power of both the cloud and edge devices can be affected by network delays and intermittent connections between the cloud and the edge. We present CBNet, a low-latency and energy-efficient DNN inference framework tailored for edge devices. It utilizes a "converting" autoencoder to efficiently transform hard images into easy ones, which are subsequently processed by a lightweight DNN for inference. To the best of our knowledge, such autoencoder has not been proposed earlier. Our experimental results using three popular image-classification datasets on a Raspberry Pi 4, a Google Cloud instance, and an instance with Nvidia Tesla K80 GPU show that CBNet achieves up to 4.8 x speedup in inference latency and 79% reduction in energy usage compared to competing techniques while maintaining similar or higher accuracy.
引用
收藏
页码:592 / 599
页数:8
相关论文
共 50 条
  • [41] Energy-Efficient Approximate Edge Inference Systems
    Ghosh, Soumendu Kumar
    Raha, Arnab
    Raghunathan, Vijay
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (04)
  • [42] Low-Latency and Energy-Efficient Data Preservation Mechanism in Low-Duty-Cycle Sensor Networks
    Jiang, Chan
    Li, Tao-Shen
    Liang, Jun-Bin
    Wu, Heng
    SENSORS, 2017, 17 (05):
  • [43] Energy-efficient, low-latency, and non-contact eye blink detection with capacitive sensing
    Liu, Mengxi
    Bian, Sizhen
    Zhao, Zimin
    Zhou, Bo
    Lukowicz, Paul
    FRONTIERS IN COMPUTER SCIENCE, 2024, 6
  • [44] A Low-Latency and Energy-Efficient Multimetric Routing Protocol Based on Network Connectivity in VANET Communication
    Wang, Xiaobo
    Weng, Yu
    Gao, Honghao
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2021, 5 (04): : 1761 - 1776
  • [45] A Buffer-Aware Finite Blocklength Coding Scheme for Low-Latency Energy-Efficient Communications
    Liu, Yuanrui
    Zhao, Xiaoyu
    Chen, Wei
    Zhang, Ying-Jun Angela
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 4734 - 4739
  • [46] Distributed loss-compensation techniques for energy-efficient low-latency on-chip communication
    Jose, Anup P.
    Shepard, Kenneth L.
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2007, 42 (06) : 1415 - 1424
  • [47] Tree-Model Based Mapping for Energy-Efficient and Low-Latency Network-on-Chip
    Yang, Bo
    Xu, Thomas Canhao
    Santti, Tero
    Plosila, Juha
    PROCEEDINGS OF THE 13TH IEEE SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS AND SYSTEMS, 2010, : 189 - 192
  • [48] Scalable Energy-Efficient, Low-Latency Implementations of Trained Spiking Deep Belief Networks on SpiNNaker
    Stromatias, Evangelos
    Neil, Daniel
    Galluppi, Francesco
    Pfeiffer, Michael
    Liu, Shih-Chii
    Furber, Steve
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [49] Fedab: A Low-Latency Energy-Efficient Proactive Neighbor Discovery Protocol in MLDC-WSN
    Sun, Haibin
    Meng, Ziran
    Wang, Dong
    Li, Hongxing
    IEEE ACCESS, 2023, 11 : 22843 - 22854
  • [50] A Co-Design-Based Reliable Low-Latency and Energy-Efficient Transmission Protocol for UWSNs
    Wei, Xiaohui
    Guo, Hao
    Wang, Xingwang
    Wang, Xiaonan
    Wang, Chu
    Guizani, Mohsen
    Du, Xiaojiang
    SENSORS, 2020, 20 (21) : 1 - 22