Mobile Generative AI: Opportunities and Challenges

被引:0
|
作者
Zhang, Ye [1 ]
Zhang, Jinrui [2 ]
Yue, Sheng [2 ]
Lu, Wei [1 ]
Ren, Ju [2 ]
Shen, Xuemin [3 ]
机构
[1] Beijing Jiaotong Univ, Sch Software Engn, Beijing, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[3] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON, Canada
基金
国家重点研发计划;
关键词
Privacy; Costs; Generative AI; Memory management; Chatbots; Mobile handsets; Explosions;
D O I
10.1109/MWC.006.2300576
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, generative artificial intelligence (GenAI) has gained significant interest on a global scale, particularly with the explosion of some killer GenAl applications, like ChatGPT. However, due to the excessively large sizes of generative models, most current GenAl applications are deployed in the cloud, easily causing high cost, long delay, and potential risk of privacy leakage, thereby greatly impeding GenAl's further expansion and development. In this article, we explore mobile GenAl - deploying large generative models on mobile devices, aiming to bring the GenAl capability to the physical proximity to users. First, we analyze the benefits and opportunities of mobile GenAl in terms of cost, delay, privacy, personalization, and application. Then, we test various large generative models on the mobile testbed, and reveal mobile GenAl's key bottlenecks in inference latency and memory consumption. Accordingly, we propose a weight occupancy strategy for model compression during inference, and discuss the pros and cons thereof. Finally future directions are pointed out to foster continued research efforts.
引用
收藏
页码:58 / 64
页数:7
相关论文
共 50 条
  • [21] Using ChatGPT and other forms of generative AI in systematic reviews: Challenges and opportunities
    Hossain, M. Mahbub
    JOURNAL OF MEDICAL IMAGING AND RADIATION SCIENCES, 2024, 55 (01) : 11 - 12
  • [22] Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges
    Franceschelli, Giorgio
    Musolesi, Mirco
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 79 : 417 - 446
  • [23] Mentor's Musings on Concerns, Challenges & Opportunities for Generative AI at the Edge in IoT
    Narang N.K.
    IEEE Internet of Things Magazine, 2024, 7 (03): : 6 - 11
  • [24] Strategies for Integrating Generative AI into Higher Education: Navigating Challenges and Leveraging Opportunities
    Kurtz, Gila
    Amzalag, Meital
    Shaked, Nava
    Zaguri, Yanay
    Kohen-Vacs, Dan
    Gal, Eran
    Zailer, Gideon
    Barak-Medina, Eran
    EDUCATION SCIENCES, 2024, 14 (05):
  • [25] The impending disruption of creative industries by generative AI: Opportunities, challenges, and research agenda
    Amankwah-Amoah, Joseph
    Abdalla, Samar
    Mogaji, Emmanuel
    Elbanna, Amany
    Dwivedi, Yogesh K.
    International Journal of Information Management, 2024, 79
  • [26] Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges
    Franceschelli, Giorgio
    Musolesi, Mirco
    Journal of Artificial Intelligence Research, 2024, 79 : 417 - 446
  • [27] Generative AI in banking: empirical insights on integration, challenges and opportunities in a regulated industry
    Moharrak, Moayad
    Mogaji, Emmanuel
    INTERNATIONAL JOURNAL OF BANK MARKETING, 2025, 43 (04) : 871 - 896
  • [28] ChatGPT and generative AI chatbots: challenges and opportunities for science, medicine and medical leaders
    Loh, Erwin
    BMJ LEADER, 2024, 8 (01) : 51 - 54
  • [29] Integrating Generative AI into Legal Education: From Casebooks to Code, Opportunities and Challenges
    Prakash, G. Aswathy
    Nair, Vishnu
    LAW TECHNOLOGY AND HUMANS, 2024, 6 (03): : 60 - 79
  • [30] The Generative AI Landscape in Education: Mapping the Terrain of Opportunities, Challenges, and Student Perception
    Ahmed, Zishan
    Shanto, Shakib Sadat
    Rime, Most. Humayra Khanom
    Morol, Md. Kishor
    Fahad, Nafiz
    Hossen, Md. Jakir
    Abdullah-Al-Jubair, Md.
    IEEE ACCESS, 2024, 12 : 147023 - 147050