NVM-Enhanced Machine Learning Inference in 6G Edge Computing

被引：4

作者：

Shang, Xiaojun ^{[1
]}

Huang, Yaodong ^{[1
]}

Liu, Zhenhua ^{[1
]}

Yang, Yuanyuan ^{[1
]}

机构：

[1] SUNY Stony Brook, Stony Brook, NY 11794 USA

来源：

IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2024年 / 11卷 / 06期

基金：

美国国家科学基金会;

关键词：

Nonvolatile memory; Servers; Machine learning; Memory management; Cloud computing; Random access memory; Real-time systems; AI-based edge computing; 6G network; Machine learning inference; STATISTICAL DELAY; M-MIMO; INTELLIGENCE; NETWORKS; VISION;

D O I：

10.1109/TNSE.2021.3109538

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

With the increasing popularization of smart terminals and real-time interactive applications, fast growing technical requirements push both academia and industry to look beyond 5G and conceptualize the sixth generation (6G) mobile network. Artificial intelligence (AI) with machine learning capacities at the edge is one crucial component of a 6G mobile network which makes various time-sensitive and high-stake services possible, e.g., smart security, virtual reality, self-driving vehicles. However, resource constraints, especially the memory limitation of edge servers, become major obstacles to deploying machine learning services at the edge. Fortunately, the new generation of non-volatile memory (NVM) provides new affordable memory resources that can be easily attached to existing edge servers. In this paper, we propose a novel machine learning application placement scheme using the NVM technology at the edge to reduce the end-to-end latency. Specifically, the proposed NVM-enhanced placement scheme takes into consideration the latency of various machine learning applications over NVM devices and the network. The corresponding optimization problem is exceedingly challenging, i.e., NP-hard. Therefore, we developed a novel approximation algorithm with both low computational complexity and theoretical guarantees. Experiments and extensive simulations using real-world applications highlight that our scheme provides significantly lower end-to-end latency compared with existing baselines.

引用

页码：5615 / 5626

页数：12

共 50 条

[1] Aerial edge computing for 6G
Sun, Mao
Yan, Zhang
Journal of China Universities of Posts and Telecommunications, 2022, 29 (01): : 50 - 63
[2] Aerial edge computing for 6G
Mao Sun
Zhang Yan
TheJournalofChinaUniversitiesofPostsandTelecommunications, 2022, 29 (01) : 50 - 63
[3] Learning IoV in 6G: Intelligent Edge Computing for Internet of Vehicles in 6G Wireless Communications
Li, He
Ota, Kaoru
Dong, Mianxiong
IEEE WIRELESS COMMUNICATIONS, 2023, 30 (06) : 96 - 101
[4] Reinforcement Learning-Empowered Mobile Edge Computing for 6G Edge Intelligence
Wei, Peng
Guo, Kun
Li, Ye
Wang, Jue
Feng, Wei
Jin, Shi
Ge, Ning
Liang, Ying-Chang
IEEE ACCESS, 2022, 10 : 65156 - 65192
[5] Personalized Vehicular Edge Computing in 6G
Hui, Yilong
Cheng, Nan
Huang, Yuanhao
Chen, Rui
Xiao, Xiao
Li, Changle
Mao, Guoqiang
IEEE NETWORK, 2021, 35 (06): : 278 - 284
[6] Edge Computing in the Internet of Things: A 6G Perspective
Ishtiaq, Mariam
Saeed, Nasir
Khan, Muhammad Asif
IT PROFESSIONAL, 2024, 26 (05) : 62 - 70
[7] Enhancing Edge Computing with Unikernels in 6G Networks
Yazdani, Syed
Ramzan, Naeem
Olivier, Pierre
2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
[8] Collaborative Machine Learning for Energy-Efficient Edge Networks in 6G
Huang, Xiaoyan
Zhang, Ke
Wu, Fan
Leng, Supeng
IEEE NETWORK, 2021, 35 (06): : 12 - 19
[9] Split Learning in 6G Edge Networks
Lin, Zheng
Qu, Guanqiao
Chen, Xianhao
Huang, Kaibin
IEEE WIRELESS COMMUNICATIONS, 2024, 31 (04) : 170 - 176
[10] Machine learning and quantum computing for 5G/6G communication networks - A survey
M S.
International Journal of Intelligent Networks, 2022, 3 : 197 - 203

← 1 2 3 4 5 →