NVM-Enhanced Machine Learning Inference in 6G Edge Computing

被引:4
|
作者
Shang, Xiaojun [1 ]
Huang, Yaodong [1 ]
Liu, Zhenhua [1 ]
Yang, Yuanyuan [1 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
基金
美国国家科学基金会;
关键词
Nonvolatile memory; Servers; Machine learning; Memory management; Cloud computing; Random access memory; Real-time systems; AI-based edge computing; 6G network; Machine learning inference; STATISTICAL DELAY; M-MIMO; INTELLIGENCE; NETWORKS; VISION;
D O I
10.1109/TNSE.2021.3109538
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With the increasing popularization of smart terminals and real-time interactive applications, fast growing technical requirements push both academia and industry to look beyond 5G and conceptualize the sixth generation (6G) mobile network. Artificial intelligence (AI) with machine learning capacities at the edge is one crucial component of a 6G mobile network which makes various time-sensitive and high-stake services possible, e.g., smart security, virtual reality, self-driving vehicles. However, resource constraints, especially the memory limitation of edge servers, become major obstacles to deploying machine learning services at the edge. Fortunately, the new generation of non-volatile memory (NVM) provides new affordable memory resources that can be easily attached to existing edge servers. In this paper, we propose a novel machine learning application placement scheme using the NVM technology at the edge to reduce the end-to-end latency. Specifically, the proposed NVM-enhanced placement scheme takes into consideration the latency of various machine learning applications over NVM devices and the network. The corresponding optimization problem is exceedingly challenging, i.e., NP-hard. Therefore, we developed a novel approximation algorithm with both low computational complexity and theoretical guarantees. Experiments and extensive simulations using real-world applications highlight that our scheme provides significantly lower end-to-end latency compared with existing baselines.
引用
收藏
页码:5615 / 5626
页数:12
相关论文
共 50 条
  • [31] Enhancing the robustness of object detection via 6G vehicular edge computing
    Chen Chen
    Guorun Yao
    Chenyu Wang
    Sotirios Goudos
    Shaohua Wan
    Digital Communications and Networks, 2022, 8 (06) : 923 - 931
  • [32] Guest Editorial The Nexus Between Edge Computing and AI for 6G Networks
    Zhou, Zhi
    Niyato, Dusit
    Xiong, Zehui
    Gong, Xiaowen
    Saad, Walid
    Fu, Xiaoming
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (03): : 1186 - 1189
  • [33] Secure and Personalized Edge Computing Services in 6G Heterogeneous Vehicular Networks
    Hui, Yilong
    Cheng, Nan
    Su, Zhou
    Huang, Yuanhao
    Zhao, Pincan
    Luan, Tom H.
    Li, Changle
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (08) : 5920 - 5931
  • [34] Application of Cybertwin for Offloading in Mobile Multiaccess Edge Computing for 6G Networks
    Rodrigues, Tiago Koketsu
    Liu, Jiajia
    Kato, Nei
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (22) : 16231 - 16242
  • [35] FedCPF: An Efficient-Communication Federated Learning Approach for Vehicular Edge Computing in 6G Communication Networks
    Liu, Su
    Yu, Jiong
    Deng, Xiaoheng
    Wan, Shaohua
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) : 1616 - 1629
  • [36] Overview of Distributed Machine Learning Techniques for 6G Networks
    Muscinelli, Eugenio
    Shinde, Swapnil Sadashiv
    Tarchi, Daniele
    ALGORITHMS, 2022, 15 (06)
  • [37] When Machine Learning Meets Privacy in 6G: A Survey
    Sun, Yuanyuan
    Liu, Jiajia
    Wang, Jiadai
    Cao, Yurui
    Kato, Nei
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2020, 22 (04): : 2694 - 2724
  • [38] Waveform Management Approach With Machine Learning for 6G Systems
    Islam Demir, Yusuf
    Yazar, Ahmet
    Arslan, Huseyin
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (05): : 5432 - 5444
  • [39] Orbital Edge Computing: Machine Inference in Space
    Denby, Bradley
    Lucia, Brandon
    IEEE COMPUTER ARCHITECTURE LETTERS, 2019, 18 (01) : 59 - 62
  • [40] Edge Computing Platform with Efficient Migration Scheme for 5G/6G Networks
    Ateya A.A.
    Alhussan A.A.
    Abdallah H.A.
    Al duailij M.A.
    Khakimov A.
    Muthanna A.
    Computer Systems Science and Engineering, 2023, 45 (02): : 1775 - 1787