NVM-Enhanced Machine Learning Inference in 6G Edge Computing

被引:4
|
作者
Shang, Xiaojun [1 ]
Huang, Yaodong [1 ]
Liu, Zhenhua [1 ]
Yang, Yuanyuan [1 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
基金
美国国家科学基金会;
关键词
Nonvolatile memory; Servers; Machine learning; Memory management; Cloud computing; Random access memory; Real-time systems; AI-based edge computing; 6G network; Machine learning inference; STATISTICAL DELAY; M-MIMO; INTELLIGENCE; NETWORKS; VISION;
D O I
10.1109/TNSE.2021.3109538
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With the increasing popularization of smart terminals and real-time interactive applications, fast growing technical requirements push both academia and industry to look beyond 5G and conceptualize the sixth generation (6G) mobile network. Artificial intelligence (AI) with machine learning capacities at the edge is one crucial component of a 6G mobile network which makes various time-sensitive and high-stake services possible, e.g., smart security, virtual reality, self-driving vehicles. However, resource constraints, especially the memory limitation of edge servers, become major obstacles to deploying machine learning services at the edge. Fortunately, the new generation of non-volatile memory (NVM) provides new affordable memory resources that can be easily attached to existing edge servers. In this paper, we propose a novel machine learning application placement scheme using the NVM technology at the edge to reduce the end-to-end latency. Specifically, the proposed NVM-enhanced placement scheme takes into consideration the latency of various machine learning applications over NVM devices and the network. The corresponding optimization problem is exceedingly challenging, i.e., NP-hard. Therefore, we developed a novel approximation algorithm with both low computational complexity and theoretical guarantees. Experiments and extensive simulations using real-world applications highlight that our scheme provides significantly lower end-to-end latency compared with existing baselines.
引用
收藏
页码:5615 / 5626
页数:12
相关论文
共 50 条
  • [41] Service Migration Algorithm for Distributed Edge Computing in 5G/6G Networks
    Kuznetsov, Konstantin
    Kuzmina, Ekaterina
    Lapteva, Tatiana
    Volkov, Artem
    Muthanna, Ammar
    Aziz, Ahmed
    INTERNET OF THINGS, SMART SPACES, AND NEXT GENERATION NETWORKS AND SYSTEMS, PT I, NEW2AN 2023, RUSMART 2023, 2024, 14542 : 320 - 337
  • [42] ECBA-MLI: Edge Computing Benchmark Architecture for Machine Learning Inference
    Schneider, Mathias
    Prokscha, Ruben
    Saadani, Seifeddine
    Hoess, Alfred
    2022 IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING & COMMUNICATIONS (IEEE EDGE 2022), 2022, : 23 - 32
  • [43] Ambient Computing for 6G Era
    Tenny, Nathan
    Kim, Bongho
    Shariat, Mehrdad
    Hsu, Alex
    Calin, Doru
    IEEE WIRELESS COMMUNICATIONS, 2025, 32 (02) : 10 - 12
  • [44] Edge Intelligence for 6G Networks
    Zheng, Haifeng
    Gao, Lin
    Chen, Zhiyong
    Xiao, Liang
    CHINA COMMUNICATIONS, 2022, 19 (08) : III - V
  • [45] FedRelay: Federated Relay Learning for 6G Mobile Edge Intelligence
    Li, Peichun
    Zhong, Yupei
    Zhang, Chaorui
    Wu, Yuan
    Yu, Rong
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (04) : 5125 - 5138
  • [46] Edge Intelligence for 6G Networks
    Haifeng Zheng
    Lin Gao
    Zhiyong Chen
    Liang Xiao
    China Communications, 2022, 19 (08) : 3 - 5
  • [47] 6G IoT Tracking- and Machine Learning-Enhanced Blockchained Supply Chain Management
    Liang, Wei
    Zhang, Lei
    Kadoch, Michel
    ELECTRONICS, 2023, 12 (01)
  • [48] Edge Cloud Server Deployment With Transmission Power Control Through Machine Learning for 6G Internet of Things
    Rodrigues, Tiago Koketsu
    Suto, Katsuya
    Kato, Nei
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (04) : 2099 - 2108
  • [49] IMPLICIT CHANNEL LEARNING FOR MACHINE LEARNING APPLICATIONS IN 6G WIRELESS NETWORKS
    Elbir, Ahmet M.
    Shi, Wei
    Mishra, Kumar Vijay
    Papazafeiropoulos, Anastasios K.
    Chatzinotas, Symeon
    2023 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW, 2023,
  • [50] Machine Learning and Enhanced Encryption for Edge Computing in IoT and Wireless Networks
    Hardas, Bhalchandra M.
    Raut, Vaishali
    Palsodkar, Prasanna
    Aush, Mithun G.
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (01) : 200 - 210