A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge

被引:1
|
作者
Mahmud, Hasanul [1 ]
Kang, Peng [1 ]
Desai, Kevin [1 ]
Lama, Palden [1 ]
Prasad, Sushil K. [1 ]
机构
[1] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
关键词
Energy-efficiency; Deep Neural Networks; Edge Computing; Early-exit DNNs; Converting Autoencoder;
D O I
10.1109/IPDPSW63119.2024.00117
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Reducing inference time and energy usage while maintaining prediction accuracy has become a significant concern for deep neural networks (DNN) inference on resourcecon-strained edge devices. To address this problem, we propose a novel approach based on "converting" autoencoder and lightweight DNNs. This improves upon recent work such as early-exiting framework and DNN partitioning. Early-exiting frameworks spend different amounts of computation power for different input data depending upon their complexity. However, they can be inefficient in real-world scenarios that deal with many hard image samples. On the other hand, DNN partitioning algorithms that utilize the computation power of both the cloud and edge devices can be affected by network delays and intermittent connections between the cloud and the edge. We present CBNet, a low-latency and energy-efficient DNN inference framework tailored for edge devices. It utilizes a "converting" autoencoder to efficiently transform hard images into easy ones, which are subsequently processed by a lightweight DNN for inference. To the best of our knowledge, such autoencoder has not been proposed earlier. Our experimental results using three popular image-classification datasets on a Raspberry Pi 4, a Google Cloud instance, and an instance with Nvidia Tesla K80 GPU show that CBNet achieves up to 4.8 x speedup in inference latency and 79% reduction in energy usage compared to competing techniques while maintaining similar or higher accuracy.
引用
收藏
页码:592 / 599
页数:8
相关论文
共 50 条
  • [21] A Low-Latency and Energy-Efficient MAC Protocol for Cooperative Wireless Sensor Networks
    Duc-Long Nguyen
    Le Quang Vinh Tran
    Berder, Olivier
    Sentieys, Olivier
    2013 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2013, : 3826 - 3831
  • [22] An Adaptive Energy-Efficient and Low-Latency MAC Protocol for Wireless Sensor Networks
    Liu, Hao
    Yao, Guoliang
    Wu, Jianhui
    Shi, Longxing
    JOURNAL OF COMMUNICATIONS AND NETWORKS, 2010, 12 (05) : 510 - 517
  • [23] A Low-Latency and Energy-Efficient Neighbor Discovery Algorithm for Wireless Sensor Networks
    Gu, Zhaoquan
    Cao, Zhen
    Tian, Zhihong
    Wang, Yuexuan
    Du, Xiaojiang
    Mohsen, Guizani
    SENSORS, 2020, 20 (03)
  • [24] ELECTION: Energy-efficient and Low-latEncy sCheduling Technique for wIreless sensOr Networks
    Begum, S
    Wang, SC
    Krishnamachari, B
    Helmy, A
    LCN 2004: 29TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON LOCAL COMPUTER NETWORKS, PROCEEDINGS, 2004, : 60 - 67
  • [25] An energy-efficient and low-latency sink positioning approach for wireless sensor networks
    Kong, Fanrui
    Li, Chunwen
    Zhao, Xuedong
    Ding, Qingqing
    Jiao, Fei
    Gu, Qibin
    MOBILE AD-HOC AND SENSOR NETWORKS, PROCEEDINGS, 2007, 4864 : 123 - +
  • [26] Low-latency and energy-efficient scheduling in fog-based IoT applications
    Rahbari, Dadmehr
    Nickray, Mohsen
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (02) : 1406 - 1427
  • [27] On energy-efficient and low-latency medium access control in wireless sensor network
    Wan, Zhiwen
    Zhang, Jinsong
    Zhu, Hao
    Makki, Kia
    Pissinou, Niki
    WCNC 2008: IEEE WIRELESS COMMUNICATIONS & NETWORKING CONFERENCE, VOLS 1-7, 2008, : 1905 - +
  • [28] A Novel Low-Latency and Energy-Efficient Task Scheduling Framework for Internet of Medical Things in an Edge Fog Cloud System
    Alatoun, Kholoud
    Matrouk, Khaled
    Mohammed, Mazin Abed
    Nedoma, Jan
    Martinek, Radek
    Zmij, Petr
    SENSORS, 2022, 22 (14)
  • [29] Energy-Efficient Mapping for a Network of DNN Models at the Edge
    Ghasemi, Mehdi
    Heidari, Soroush
    Kim, Young Geun
    Lamb, Aaron
    Wu, Carole-Jean
    Vrudhula, Sarma
    2021 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2021), 2021, : 25 - 30
  • [30] FEECA: Design Space Exploration for Low-Latency and Energy-Efficient Capsule Network Accelerators
    Marchisio, Alberto
    Mrazek, Vojtech
    Hanif, Muhammad Abdullah
    Shafique, Muhammad
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2021, 29 (04) : 716 - 729