Survey of Different Large Language Model Architectures: Trends, Benchmarks, and Challenges

被引:1
|
作者
Shao, Minghao [1 ]
Basit, Abdul [2 ]
Karri, Ramesh [1 ]
Shafique, Muhammad [2 ]
机构
[1] NYU, Tandon Sch Engn, New York, NY 10012 USA
[2] New York Univ Abu Dhabi, Abu Dhabi Engn Div, Abu Dhabi, U Arab Emirates
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Surveys; Transformers; Benchmark testing; Encoding; Large language models; Adaptation models; Market research; Decoding; Training; Computational modeling; Large language models (LLMs); Transformer architecture; generative models; survey; multimodal learning; deep learning; natural language processing (NLP); GENERATIVE ADVERSARIAL NETWORKS;
D O I
10.1109/ACCESS.2024.3482107
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large Language Models (LLMs) represent a class of deep learning models adept at understanding natural language and generating coherent responses to various prompts or queries. These models far exceed the complexity of conventional neural networks, often encompassing dozens of neural network layers and containing billions to trillions of parameters. They are typically trained on vast datasets, utilizing architectures based on transformer blocks. Present-day LLMs are multi-functional, capable of performing a range of tasks from text generation and language translation to question answering, as well as code generation and analysis. An advanced subset of these models, known as Multimodal Large Language Models (MLLMs), extends LLM capabilities to process and interpret multiple data modalities, including images, audio, and video. This enhancement empowers MLLMs with capabilities like video editing, image comprehension, and captioning for visual content. This survey provides a comprehensive overview of the recent advancements in LLMs. We begin by tracing the evolution of LLMs and subsequently delve into the advent and nuances of MLLMs. We analyze emerging state-of-the-art MLLMs, exploring their technical features, strengths, and limitations. Additionally, we present a comparative analysis of these models and discuss their challenges, potential limitations, and prospects for future development.
引用
收藏
页码:188664 / 188706
页数:43
相关论文
共 50 条
  • [1] Large Language Model Instruction Following: A Survey of Progresses and Challenges
    Lou, Renze
    Zhang, Kai
    Yin, Wenpeng
    COMPUTATIONAL LINGUISTICS, 2024, 50 (03) : 1053 - 1095
  • [2] A survey on augmenting knowledge graphs (KGs) with large language models (LLMs): models, evaluation metrics, benchmarks, and challenges
    Ibrahim, Nourhan
    Aboulela, Samar
    Ibrahim, Ahmed
    Kashef, Rasha
    Discover Artificial Intelligence, 2024, 4 (01):
  • [3] Comparative Study of Large Language Model Architectures on Frontier
    Yin, Junqi
    Bose, Avishek
    Cong, Guojing
    Lyngaas, Isaac
    Anthony, Quentin
    PROCEEDINGS 2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS 2024, 2024, : 556 - 569
  • [4] Large Language Model for Medical Images: A Survey of Taxonomy, Systematic Review, and Future Trends
    Wang, Peng
    Lu, Wenpeng
    Lu, Chunlin
    Zhou, Ruoxi
    Li, Min
    Qin, Libo
    BIG DATA MINING AND ANALYTICS, 2025, 8 (02): : 496 - 517
  • [5] When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards
    Alzahrani, Norah A.
    Alyahya, Hisham Abdullah
    Alnumay, Yazeed
    Alrashed, Sultan
    Alsubaie, Shaykhah Z.
    Almushayqih, Yousef
    Mirza, Faisal Abdulrahman
    Alotaibi, Nouf M.
    Al-Twairesh, Nora
    Alowisheq, Areeb
    Bari, M. Saiful
    Khan, Haidar
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 13787 - 13805
  • [6] Security and Privacy Challenges of Large Language Models: A Survey
    Das, Badhan chandra
    Amini, M. hadi
    Wu, Yanzhao
    ACM COMPUTING SURVEYS, 2025, 57 (06)
  • [7] Blockchain for securing electronic voting systems: a survey of architectures, trends, solutions, and challenges
    Ohize, Henry O.
    Onumanyi, Adeiza James
    Umar, Buhari U.
    Ajao, Lukman A.
    Isah, Rabiu O.
    Dogo, Eustace M.
    Nuhu, Bello K.
    Olaniyi, Olayemi M.
    Ambafi, James G.
    Sheidu, Vincent B.
    Ibrahim, Muhammad M.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2025, 28 (02):
  • [8] Vehicular Fog Computing: A Survey of Architectures, Resource Management, Challenges and Emerging Trends
    Husain, Mohammed Hassan
    Ahmadi, Mahmood
    Mardukhi, Farhad
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 136 (04) : 2243 - 2273
  • [9] A Review on Large Language Models: Architectures, Applications, Taxonomies, Open Issues and Challenges
    Raiaan, Mohaimenul Azam Khan
    Mukta, Md. Saddam Hossain
    Fatema, Kaniz
    Fahad, Nur Mohammad
    Sakib, Sadman
    Mim, Most Marufatul Jannat
    Ahmad, Jubaer
    Ali, Mohammed Eunus
    Azam, Sami
    IEEE ACCESS, 2024, 12 : 26839 - 26874