NVM-Enhanced Machine Learning Inference in 6G Edge Computing

被引:4
|
作者
Shang, Xiaojun [1 ]
Huang, Yaodong [1 ]
Liu, Zhenhua [1 ]
Yang, Yuanyuan [1 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
基金
美国国家科学基金会;
关键词
Nonvolatile memory; Servers; Machine learning; Memory management; Cloud computing; Random access memory; Real-time systems; AI-based edge computing; 6G network; Machine learning inference; STATISTICAL DELAY; M-MIMO; INTELLIGENCE; NETWORKS; VISION;
D O I
10.1109/TNSE.2021.3109538
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With the increasing popularization of smart terminals and real-time interactive applications, fast growing technical requirements push both academia and industry to look beyond 5G and conceptualize the sixth generation (6G) mobile network. Artificial intelligence (AI) with machine learning capacities at the edge is one crucial component of a 6G mobile network which makes various time-sensitive and high-stake services possible, e.g., smart security, virtual reality, self-driving vehicles. However, resource constraints, especially the memory limitation of edge servers, become major obstacles to deploying machine learning services at the edge. Fortunately, the new generation of non-volatile memory (NVM) provides new affordable memory resources that can be easily attached to existing edge servers. In this paper, we propose a novel machine learning application placement scheme using the NVM technology at the edge to reduce the end-to-end latency. Specifically, the proposed NVM-enhanced placement scheme takes into consideration the latency of various machine learning applications over NVM devices and the network. The corresponding optimization problem is exceedingly challenging, i.e., NP-hard. Therefore, we developed a novel approximation algorithm with both low computational complexity and theoretical guarantees. Experiments and extensive simulations using real-world applications highlight that our scheme provides significantly lower end-to-end latency compared with existing baselines.
引用
收藏
页码:5615 / 5626
页数:12
相关论文
共 50 条
  • [21] Toward Self-Learning Edge Intelligence in 6G
    Xiao, Yong
    Shi, Guangming
    Li, Yingyu
    Saad, Walid
    Poor, H. Vincent
    IEEE COMMUNICATIONS MAGAZINE, 2020, 58 (12) : 34 - 40
  • [22] Federated Edge Learning for 6G: Foundations, Methodologies, and Applications
    Tao, Meixia
    Zhou, Yong
    Shi, Yuanming
    Lu, Jianmin
    Cui, Shuguang
    Lu, Jianhua
    Letaief, Khaled B.
    PROCEEDINGS OF THE IEEE, 2024,
  • [23] MACHINE LEARNING FOR 6G ENHANCED ULTRA-RELIABLE AND LOW-LATENCY SERVICES
    Liu, Yan
    Deng, Yansha
    Nallanathan, Arumugam
    Yuan, Jinhong
    IEEE WIRELESS COMMUNICATIONS, 2023, 30 (02) : 48 - 54
  • [24] Addressing the CQI feedback delay in 5G/6G networks via machine learning and evolutionary computing
    Balieiro A.
    Dias K.
    Guarda P.
    Intelligent and Converged Networks, 2022, 3 (03): : 271 - 281
  • [25] ParallEdge: Exploiting Computing-Mobility Parallelism for Efficient 5G/6G Edge Computing
    Cong, Rong
    Zhao, Zhiwei
    Zhang, Linyuanqi
    Min, Geyong
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 4025 - 4037
  • [26] MLOps meets Edge Computing: an Edge Platform with Embedded Intelligence towards 6G Systems
    Psaromanolakis, Nikos
    Theodorou, Vasileios
    Laskaratos, Dimitris
    Kalogeropoulos, Ioannis
    Vlontzou, Maria-Eleftheria
    Zarogianni, Eleni
    Samaras, Georgios
    2023 JOINT EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS & 6G SUMMIT, EUCNC/6G SUMMIT, 2023, : 496 - 501
  • [27] A Cloud-Edge Collaborative Computing Task Scheduling Algorithm for 6G Edge Networks
    Ma L.
    Liu M.
    Li C.
    Lu Z.-M.
    Ma H.
    Ma, Huan (mahuan@cert.org.cn), 1600, Beijing University of Posts and Telecommunications (43): : 66 - 73
  • [28] Enhancing the robustness of object detection via 6G vehicular edge computing
    Chen, Chen
    Yao, Guorun
    Wang, Chenyu
    Goudos, Sotirios
    Wan, Shaohua
    DIGITAL COMMUNICATIONS AND NETWORKS, 2022, 8 (06) : 923 - 931
  • [29] Balancing Energy Consumption and Latency in Vehicle Edge Computing for 6G Networks
    Wang, Bingxin
    Tu, Dan
    Wang, Jie
    20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 309 - 314
  • [30] RECONFIGURABLE INTELLIGENT SURFACE FOR LOW-LATENCY EDGE COMPUTING IN 6G
    Dai, Yueyue
    Guan, Yong Liang
    Leung, Kin K.
    Zhang, Yan
    IEEE WIRELESS COMMUNICATIONS, 2021, 28 (06) : 72 - 79