Impact of Joint Heat and Memory Constraints of Mobile Device in Edge-Assisted On-Device Artificial Intelligence

被引:2
|
作者
Choi, Pyeongjun [1 ]
Kim, Jeongsoo [1 ]
Kwak, Jeongho [1 ]
机构
[1] DGIST, Daegu, South Korea
基金
新加坡国家研究基金会;
关键词
On-device AI; Offloaded analytics; Thermal and memory aware control; DVFS;
D O I
10.1145/3662004.3663555
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, consumer demand for artificial intelligence (AI) applications using deep neural network (DNN) model such as large language model (LLM), miXed Reality (XR), and AI assistants has been steadily increasing. Hitherto, on-device AI and offloaded analytics with the help of mobile edge computing (MEC) have been extensively studied to realize AI services on top of mobile devices. However, both technologies suffer from the limited resources of mobile devices, such as thermal resilience, battery capacity, and memory size. To tackle this problem, we first extensively examine the impact of heat and memory constraints of a mobile device when networking and processing resources and multi-dimensional DNN model sizes are dynamically managed for AI applications via motivating measurement. From the experimental results, we conjecture that the threshold-based approach for joint consideration of heat and memory constraints would increase the performance of AI applications in terms of energy, frames per second (FPS), and inference accuracy. Hence, we propose a threshold-based H&M algorithm that jointly adjusts offloading, Dynamic Voltage and Frequency Scaling (DVFS), and DNN model size, aiming to maximize inference accuracy while keeping target FPS with memory and heat constraints in various environments. Finally, we implement the proposed scheme on a mobile device and an MEC server and evaluate its performance and adaptability via extensive experiments.
引用
收藏
页码:31 / 36
页数:6
相关论文
共 47 条
  • [41] A newly developed transparent and flexible one-transistor memory device using advanced nanomaterials for medical and artificial intelligence applications
    Dai, Mingzhi
    Hu, Yongbin
    Huo, Changhe
    Webster, Thomas J.
    Guo, Liqiang
    INTERNATIONAL JOURNAL OF NANOMEDICINE, 2019, 14 : 5691 - 5696
  • [42] Multi-Agent Deep Reinforcement Learning Based Dynamic Task Offloading in a Device-to-Device Mobile-Edge Computing Network to Minimize Average Task Delay with Deadline Constraints
    He, Huaiwen
    Yang, Xiangdong
    Mi, Xin
    Shen, Hong
    Liao, Xuefeng
    SENSORS, 2024, 24 (16)
  • [43] Clinical validation of an artificial intelligence-assisted algorithm for automated quantification of left ventricular ejection fraction in real time by a novel handheld ultrasound device
    Papadopoulou, Stella-Lida
    Sachpekidis, Vasileios
    Kantartzi, Vasiliki
    Styliadis, Ioannis
    Nihoyannopoulos, Petros
    EUROPEAN HEART JOURNAL - DIGITAL HEALTH, 2022, 3 (01): : 29 - 37
  • [44] Read Disturb Evaluations of 3D NAND Flash for Highly Read-Intensive Edge-Computing Inference Device for Artificial Intelligence Applications
    Du, Pei-Ying
    Lue, Hang-Ting
    Hsu, Tzu-Hsuan
    Hsieh, Chih-Chang
    Chen, Wei-Chen
    Chang, Kuo-Ping
    Wang, Keh-Chung
    Lu, Chih-Yuan
    2019 IEEE 11TH INTERNATIONAL MEMORY WORKSHOP (IMW 2019), 2019, : 12 - 15
  • [45] A 28nm 128TFLOPS/W Computing-In-Memory Engine Supporting One-Shot Floating-Point NN Inference and On-Device Fine-Tuning for Edge AI
    Diao, Haikang
    Luo, Haoyang
    Song, Jiahao
    Xu, Bocheng
    Wang, Runsheng
    Wang, Yuan
    Tang, Xiyuan
    2024 IEEE CUSTOM INTEGRATED CIRCUITS CONFERENCE, CICC, 2024,
  • [46] Artificial intelligence-assisted remote detection of ST-elevation myocardial infarction using a mini-12-lead electrocardiogram device in prehospital ambulance care
    Chen, Ke-Wei
    Wang, Yu-Chen
    Liu, Meng-Hsuan
    Tsai, Being-Yuah
    Wu, Mei-Yao
    Hsieh, Po-Hsin
    Wei, Jung-Ting
    Shih, Edward S. C.
    Shiao, Yi-Tzone
    Hwang, Ming-Jing
    Wu, Ya-Lun
    Hsu, Kai-Cheng
    Chang, Kuan-Cheng
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2022, 9
  • [47] Corrigendum: Artificial intelligence-assisted remote detection of ST-elevation myocardial infarction using a mini-12-lead electrocardiogram device in prehospital ambulance care (vol 9, 1001982, 2022)
    Chen, Ke-Wei
    Wang, Yu-Chen
    Liu, Meng-Hsuan
    Tsai, Being-Yuah
    Wu, Mei-Yao
    Hsieh, Po-Hsin
    Wei, Jung-Ting
    Shih, Edward S. C.
    Shiao, Yi-Tzone
    Hwang, Ming-Jing
    Wu, Ya-Lun
    Hsu, Kai-Cheng
    Chang, Kuan-Cheng
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2022, 9