A Comprehensive Review of AI Advancement Using testFAILS and testFAILS-2 for the Pursuit of AGI

被引:0
|
作者
Kumar, Yulia [1 ,2 ]
Lin, Mengtian [1 ]
Paredes, Christopher [1 ]
Li, Dan [1 ]
Yang, Guohao [1 ]
Kruger, Dov [2 ]
Li, J. Jenny [1 ]
Morreale, Patricia [1 ]
机构
[1] Kean Univ, Dept Comp Sci & Technol, Union, NJ 07083 USA
[2] Rutgers State Univ, Dept Elect & Comp Engn, Piscataway, NJ 08854 USA
来源
ELECTRONICS | 2024年 / 13卷 / 24期
关键词
AI evaluation; testFAILS-2; artificial general intelligence; multimodal AI; AI linguistic systems;
D O I
10.3390/electronics13244991
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In a previous paper we defined testFAILS, a set of benchmarks for measuring the efficacy of Large Language Models in various domains. This paper defines a second-generation framework, testFAILS-2 to measure how current AI engines are progressing towards Artificial General Intelligence (AGI). The testFAILS-2 framework offers enhanced evaluation metrics that address the latest developments in Artificial Intelligence Linguistic Systems (AILS). A key feature of this re-view is the "Chat with Alan" project, a Retrieval-Augmented Generation (RAG)-based AI bot inspired by Alan Turing, designed to distinguish between human and AI generated interactions, thereby emulating Turing's original vision. We assess a variety of models, including ChatGPT-4o-mini and other Small Language Models (SLMs), as well as prominent Large Language Models (LLMs), utilizing expanded criteria that encompass result relevance, accessibility, cost, multimodality, agent creation capabilities, emotional AI attributes, AI search capacity, and LLM-robot integration. The analysis reveals that testFAILS-2 significantly enhances the evaluation of model robustness and user productivity, while also identifying critical areas for improvement in multimodal processing and emotional reasoning. By integrating rigorous evaluation standards and novel testing methodologies, testFAILS-2 advances the assessment of AILS, providing essential insights that contribute to the ongoing development of more effective and resilient AI systems towards achieving AGI.
引用
收藏
页数:50
相关论文
共 29 条
  • [1] Future Trends for Human-AI Collaboration: A Comprehensive Taxonomy of AI/AGI Using Multiple Intelligences and Learning Styles
    Cichocki, Andrzej
    Kuleshov, Alexander P.
    Computational Intelligence and Neuroscience, 2021, 2021
  • [2] Future Trends for Human-AI Collaboration: A Comprehensive Taxonomy of AI/AGI Using Multiple Intelligences and Learning Styles
    Cichocki, Andrzej
    Kuleshov, Alexander P.
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [3] Recent Advancement in Accent Conversion Using Deep Learning Techniques: A Comprehensive Review
    Chandra, Sabyasachi
    Bharati, Puja
    Prasad, G. Satya
    Pramanik, Debolina
    Das Mandal, Shyamal Kumar
    PROCEEDINGS OF 27TH INTERNATIONAL SYMPOSIUM ON FRONTIERS OF RESEARCH IN SPEECH AND MUSIC, FRSM 2023, 2024, 1455 : 61 - 73
  • [4] Recent advancement in cancer diagnosis using machine learning and deep learning techniques: A comprehensive review
    Painuli, Deepak
    Bhardwaj, Suyash
    Kose, Utku
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
  • [5] AI-Enabled Learning Architecture Using Network Traffic Traces over IoT Network: A Comprehensive Review
    Aneja N.
    Aneja S.
    Bhargava B.
    Wireless Communications and Mobile Computing, 2023, 2023
  • [6] An AI-Based Newly Developed Analytical Formulation for Discharging Behavior of Supercapacitors with the Integration of a Review of Supercapacitor Challenges and Advancement Using Quantum Dots
    Satpathy, Sambit
    Misra, Neeraj Kumar
    Goyal, Vishal
    Das, Sanchali
    Sharma, Vishnu
    Ali, Shabir
    SYMMETRY-BASEL, 2023, 15 (04):
  • [7] Recent advancement in NiFe2O4-based nanocomposites for the photocatalytic degradation of pollutants in aqueous solutions: a comprehensive systematic review
    Derakhshani, Elham
    Naghizadeh, Ali
    AQUA-WATER INFRASTRUCTURE ECOSYSTEMS AND SOCIETY, 2023, 72 (08) : 1629 - 1645
  • [8] Hydrogen Production Using TiO2-Based Photocatalysts: A Comprehensive Review
    Rafique, Muhammad
    Hajra, Syeda
    Irshad, Muneeb
    Usman, Muhammad
    Imran, Muhammad
    Assiri, Mohammad A.
    Ashraf, Waqar Muhammad
    ACS OMEGA, 2023, 8 (29): : 25640 - 25648
  • [9] A critical review on advancement and challenges in using TiO2 as electron transport layer for perovskite solar cell
    Fatima, Qawareer
    Haidry, Azhar Ali
    Zhang, Haiqian
    El Jery, Atef
    Aldrdery, Moutaz
    MATERIALS TODAY SUSTAINABILITY, 2024, 27
  • [10] A comprehensive systematic review of photocatalytic degradation of pesticides using nano TiO2
    Hadei, Mostafa
    Mesdaghinia, Alireza
    Nabizadeh, Ramin
    Mahvi, Amir Hossein
    Rabbani, Shahram
    Naddafi, Kazem
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2021, 28 (11) : 13055 - 13071