NVM-Enhanced Machine Learning Inference in 6G Edge Computing

被引:4
|
作者
Shang, Xiaojun [1 ]
Huang, Yaodong [1 ]
Liu, Zhenhua [1 ]
Yang, Yuanyuan [1 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
基金
美国国家科学基金会;
关键词
Nonvolatile memory; Servers; Machine learning; Memory management; Cloud computing; Random access memory; Real-time systems; AI-based edge computing; 6G network; Machine learning inference; STATISTICAL DELAY; M-MIMO; INTELLIGENCE; NETWORKS; VISION;
D O I
10.1109/TNSE.2021.3109538
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With the increasing popularization of smart terminals and real-time interactive applications, fast growing technical requirements push both academia and industry to look beyond 5G and conceptualize the sixth generation (6G) mobile network. Artificial intelligence (AI) with machine learning capacities at the edge is one crucial component of a 6G mobile network which makes various time-sensitive and high-stake services possible, e.g., smart security, virtual reality, self-driving vehicles. However, resource constraints, especially the memory limitation of edge servers, become major obstacles to deploying machine learning services at the edge. Fortunately, the new generation of non-volatile memory (NVM) provides new affordable memory resources that can be easily attached to existing edge servers. In this paper, we propose a novel machine learning application placement scheme using the NVM technology at the edge to reduce the end-to-end latency. Specifically, the proposed NVM-enhanced placement scheme takes into consideration the latency of various machine learning applications over NVM devices and the network. The corresponding optimization problem is exceedingly challenging, i.e., NP-hard. Therefore, we developed a novel approximation algorithm with both low computational complexity and theoretical guarantees. Experiments and extensive simulations using real-world applications highlight that our scheme provides significantly lower end-to-end latency compared with existing baselines.
引用
收藏
页码:5615 / 5626
页数:12
相关论文
共 50 条
  • [1] Aerial edge computing for 6G
    Sun, Mao
    Yan, Zhang
    Journal of China Universities of Posts and Telecommunications, 2022, 29 (01): : 50 - 63
  • [2] Aerial edge computing for 6G
    Mao Sun
    Zhang Yan
    TheJournalofChinaUniversitiesofPostsandTelecommunications, 2022, 29 (01) : 50 - 63
  • [3] Learning IoV in 6G: Intelligent Edge Computing for Internet of Vehicles in 6G Wireless Communications
    Li, He
    Ota, Kaoru
    Dong, Mianxiong
    IEEE WIRELESS COMMUNICATIONS, 2023, 30 (06) : 96 - 101
  • [4] Reinforcement Learning-Empowered Mobile Edge Computing for 6G Edge Intelligence
    Wei, Peng
    Guo, Kun
    Li, Ye
    Wang, Jue
    Feng, Wei
    Jin, Shi
    Ge, Ning
    Liang, Ying-Chang
    IEEE ACCESS, 2022, 10 : 65156 - 65192
  • [5] Personalized Vehicular Edge Computing in 6G
    Hui, Yilong
    Cheng, Nan
    Huang, Yuanhao
    Chen, Rui
    Xiao, Xiao
    Li, Changle
    Mao, Guoqiang
    IEEE NETWORK, 2021, 35 (06): : 278 - 284
  • [6] Edge Computing in the Internet of Things: A 6G Perspective
    Ishtiaq, Mariam
    Saeed, Nasir
    Khan, Muhammad Asif
    IT PROFESSIONAL, 2024, 26 (05) : 62 - 70
  • [7] Enhancing Edge Computing with Unikernels in 6G Networks
    Yazdani, Syed
    Ramzan, Naeem
    Olivier, Pierre
    2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [8] Collaborative Machine Learning for Energy-Efficient Edge Networks in 6G
    Huang, Xiaoyan
    Zhang, Ke
    Wu, Fan
    Leng, Supeng
    IEEE NETWORK, 2021, 35 (06): : 12 - 19
  • [9] Split Learning in 6G Edge Networks
    Lin, Zheng
    Qu, Guanqiao
    Chen, Xianhao
    Huang, Kaibin
    IEEE WIRELESS COMMUNICATIONS, 2024, 31 (04) : 170 - 176
  • [10] Machine learning and quantum computing for 5G/6G communication networks - A survey
    M S.
    International Journal of Intelligent Networks, 2022, 3 : 197 - 203