Evaluation of Open-Source Large Language Models for Metal-Organic Frameworks Research

被引:13
|
作者
Bai, Xuefeng [1 ,2 ]
Xie, Yabo [1 ,2 ]
Zhang, Xin [1 ,2 ]
Han, Honggui [3 ,4 ]
Li, Jian-Rong [1 ,2 ]
机构
[1] Beijing Univ Technol, Coll Mat Sci & Engn, Beijing Key Lab Green Catalysis & Separat, Beijing 100124, Peoples R China
[2] Beijing Univ Technol, Coll Mat Sci & Engn, Dept Chem Engn, Beijing 100124, Peoples R China
[3] Beijing Univ Technol, Fac Informat Technol, Engn Res Ctr Digital Community, Beijing Lab Urban Mass Transit,Minist Educ, Beijing 100124, Peoples R China
[4] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Compendex;
D O I
10.1021/acs.jcim.4c00065
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Along with the development of machine learning, deep learning, and large language models (LLMs) such as GPT-4 (GPT: Generative Pre-Trained Transformer), artificial intelligence (AI) tools have been playing an increasingly important role in chemical and material research to facilitate the material screening and design. Despite the exciting progress of GPT-4 based AI research assistance, open-source LLMs have not gained much attention from the scientific community. This work primarily focused on metal-organic frameworks (MOFs) as a subdomain of chemistry and evaluated six top-rated open-source LLMs with a comprehensive set of tasks including MOFs knowledge, basic chemistry knowledge, in-depth chemistry knowledge, knowledge extraction, database reading, predicting material property, experiment design, computational scripts generation, guiding experiment, data analysis, and paper polishing, which covers the basic units of MOFs research. In general, these LLMs were capable of most of the tasks. Especially, Llama2-7B and ChatGLM2-6B were found to perform particularly well with moderate computational resources. Additionally, the performance of different parameter versions of the same model was compared, which revealed the superior performance of higher parameter versions.
引用
收藏
页码:4958 / 4965
页数:8
相关论文
共 50 条
  • [21] Highly tunable metal-organic frameworks with open metal centers
    Choi, Eun-Young
    Wray, Curtis A.
    Hu, Chunhua
    Choe, Wonyoung
    CRYSTENGCOMM, 2009, 11 (04): : 553 - 555
  • [22] Property and Research Process of Metal-organic Frameworks
    Ma, Yuanhui
    Tang, Chengchun
    Zhang, Lei
    ADVANCE IN ECOLOGICAL ENVIRONMENT FUNCTIONAL MATERIALS AND ION INDUSTRY III, 2012, 427 : 119 - +
  • [23] Research Progress of Hollow Metal-organic Frameworks
    Lu, Yu
    Wang, Tie
    CHEMICAL JOURNAL OF CHINESE UNIVERSITIES-CHINESE, 2023, 44 (01):
  • [24] Reproducibility in research into metal-organic frameworks in nanomedicine
    Ross S. Forgan
    Communications Materials, 5
  • [25] Reproducibility in research into metal-organic frameworks in nanomedicine
    Forgan, Ross S.
    COMMUNICATIONS MATERIALS, 2024, 5 (01)
  • [26] Enhancing Code Security Through Open-Source Large Language Models: A Comparative Study
    Ridley, Norah
    Branca, Enrico
    Kimber, Jadyn
    Stakhanova, Natalia
    FOUNDATIONS AND PRACTICE OF SECURITY, PT I, FPS 2023, 2024, 14551 : 233 - 249
  • [27] Automatic structuring of radiology reports with on-premise open-source large language models
    Woznicki, Piotr
    Laqua, Caroline
    Fiku, Ina
    Hekalo, Amar
    Truhn, Daniel
    Engelhardt, Sandy
    Kather, Jakob
    Foersch, Sebastian
    D'Antonoli, Tugba Akinci
    dos Santos, Daniel Pinto
    Baessler, Bettina
    Laqua, Fabian Christopher
    EUROPEAN RADIOLOGY, 2025, 35 (04) : 2018 - 2029
  • [28] Iterative Refactoring of Real-World Open-Source Programs with Large Language Models
    Choi, Jinsu
    An, Gabin
    Yoo, Shin
    SEARCH-BASED SOFTWARE ENGINEERING, SSBSE 2024, 2024, 14767 : 49 - 55
  • [29] Fine-Tuning and Evaluating Open-Source Large Language Models for the Army Domain
    Ruiz, Maj Daniel C.
    Sell, John
    arXiv,
  • [30] Comparing Commercial and Open-Source Large Language Models for Labeling Chest Radiograph Reports
    Dorfner, Felix J.
    Juergensen, Liv
    Donle, Leonhard
    Al Mohamad, Fares
    Bodenmann, Tobias R.
    Cleveland, Mason C.
    Busch, Felix
    Adams, Lisa C.
    Sato, James
    Schultz, Thomas
    Kim, Albert E.
    Merkow, Jameson
    Bressem, Keno K.
    Bridge, Christopher P.
    RADIOLOGY, 2024, 313 (01)