Survey of Different Large Language Model Architectures: Trends, Benchmarks, and Challenges

被引：1

作者：

Shao, Minghao ^{[1
]}

Basit, Abdul ^{[2
]}

Karri, Ramesh ^{[1
]}

Shafique, Muhammad ^{[2
]}

机构：

[1] NYU, Tandon Sch Engn, New York, NY 10012 USA

[2] New York Univ Abu Dhabi, Abu Dhabi Engn Div, Abu Dhabi, U Arab Emirates

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Surveys; Transformers; Benchmark testing; Encoding; Large language models; Adaptation models; Market research; Decoding; Training; Computational modeling; Large language models (LLMs); Transformer architecture; generative models; survey; multimodal learning; deep learning; natural language processing (NLP); GENERATIVE ADVERSARIAL NETWORKS;

D O I：

10.1109/ACCESS.2024.3482107

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Large Language Models (LLMs) represent a class of deep learning models adept at understanding natural language and generating coherent responses to various prompts or queries. These models far exceed the complexity of conventional neural networks, often encompassing dozens of neural network layers and containing billions to trillions of parameters. They are typically trained on vast datasets, utilizing architectures based on transformer blocks. Present-day LLMs are multi-functional, capable of performing a range of tasks from text generation and language translation to question answering, as well as code generation and analysis. An advanced subset of these models, known as Multimodal Large Language Models (MLLMs), extends LLM capabilities to process and interpret multiple data modalities, including images, audio, and video. This enhancement empowers MLLMs with capabilities like video editing, image comprehension, and captioning for visual content. This survey provides a comprehensive overview of the recent advancements in LLMs. We begin by tracing the evolution of LLMs and subsequently delve into the advent and nuances of MLLMs. We analyze emerging state-of-the-art MLLMs, exploring their technical features, strengths, and limitations. Additionally, we present a comparative analysis of these models and discuss their challenges, potential limitations, and prospects for future development.

引用

页码：188664 / 188706

页数：43

共 50 条

[1] Large Language Model Instruction Following: A Survey of Progresses and Challenges
Lou, Renze
Zhang, Kai
Yin, Wenpeng
COMPUTATIONAL LINGUISTICS, 2024, 50 (03) : 1053 - 1095
[2] A survey on augmenting knowledge graphs (KGs) with large language models (LLMs): models, evaluation metrics, benchmarks, and challenges
Ibrahim, Nourhan
Aboulela, Samar
Ibrahim, Ahmed
Kashef, Rasha
Discover Artificial Intelligence, 2024, 4 (01):
[3] Comparative Study of Large Language Model Architectures on Frontier
Yin, Junqi
Bose, Avishek
Cong, Guojing
Lyngaas, Isaac
Anthony, Quentin
PROCEEDINGS 2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS 2024, 2024, : 556 - 569
[4] Large Language Model for Medical Images: A Survey of Taxonomy, Systematic Review, and Future Trends
Wang, Peng
Lu, Wenpeng
Lu, Chunlin
Zhou, Ruoxi
Li, Min
Qin, Libo
BIG DATA MINING AND ANALYTICS, 2025, 8 (02): : 496 - 517
[5] When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards
Alzahrani, Norah A.
Alyahya, Hisham Abdullah
Alnumay, Yazeed
Alrashed, Sultan
Alsubaie, Shaykhah Z.
Almushayqih, Yousef
Mirza, Faisal Abdulrahman
Alotaibi, Nouf M.
Al-Twairesh, Nora
Alowisheq, Areeb
Bari, M. Saiful
Khan, Haidar
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 13787 - 13805
[6] Security and Privacy Challenges of Large Language Models: A Survey
Das, Badhan chandra
Amini, M. hadi
Wu, Yanzhao
ACM COMPUTING SURVEYS, 2025, 57 (06)
[7] Blockchain for securing electronic voting systems: a survey of architectures, trends, solutions, and challenges
Ohize, Henry O.
Onumanyi, Adeiza James
Umar, Buhari U.
Ajao, Lukman A.
Isah, Rabiu O.
Dogo, Eustace M.
Nuhu, Bello K.
Olaniyi, Olayemi M.
Ambafi, James G.
Sheidu, Vincent B.
Ibrahim, Muhammad M.
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2025, 28 (02):
[8] Vehicular Fog Computing: A Survey of Architectures, Resource Management, Challenges and Emerging Trends
Husain, Mohammed Hassan
Ahmadi, Mahmood
Mardukhi, Farhad
WIRELESS PERSONAL COMMUNICATIONS, 2024, 136 (04) : 2243 - 2273
[9] A Review on Large Language Models: Architectures, Applications, Taxonomies, Open Issues and Challenges
Raiaan, Mohaimenul Azam Khan
Mukta, Md. Saddam Hossain
Fatema, Kaniz
Fahad, Nur Mohammad
Sakib, Sadman
Mim, Most Marufatul Jannat
Ahmad, Jubaer
Ali, Mohammed Eunus
Azam, Sami
IEEE ACCESS, 2024, 12 : 26839 - 26874
[10] A survey of multimodal hybrid deep learning for computer vision: Architectures, applications, trends, and challenges
Bayoudh, Khaled
INFORMATION FUSION, 2024, 105

← 1 2 3 4 5 →