Image and data mining in reticular chemistry powered by GPT-4V

被引:13
|
作者
Zheng, Zhiling [1 ,2 ,3 ]
He, Zhiguo [1 ,2 ]
Khattab, Omar [4 ]
Rampal, Nakul [1 ,2 ,3 ]
Zaharia, Matei A. [5 ]
Borgs, Christian [3 ,5 ]
Chayes, Jennifer T. [3 ,5 ,6 ,7 ,8 ]
Yaghi, Omar M. [1 ,2 ,3 ,9 ]
机构
[1] Univ Calif Berkeley, Dept Chem, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Kavli Energy Nanosci Inst, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Bakar Inst Digital Mat Planet, Berkeley, CA 94720 USA
[4] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[5] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[6] Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USA
[7] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[8] Univ Calif Berkeley, Sch Informat, Berkeley, CA 94720 USA
[9] KACST UC Berkeley Ctr Excellence Nanomat Clean Ene, King Abdulaziz City Sci & Technol, Riyadh 11442, Saudi Arabia
来源
DIGITAL DISCOVERY | 2024年 / 3卷 / 03期
关键词
METAL-ORGANIC FRAMEWORKS; SURFACE-AREAS; BET METHOD; ADSORPTION; FUTURE; TOOL;
D O I
10.1039/d3dd00239j
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The integration of artificial intelligence into scientific research opens new avenues with the advent of GPT-4V, a large language model equipped with vision capabilities. In this study, we demonstrate that GPT-4V, accessible through the ChatGPT web user interface or an API, offers promising possibilities in navigating and mining complex data for metal-organic frameworks (MOFs) especially from graphical sources (e.g. sorption isotherms, powder X-ray diffraction patterns, thermogravimetric analysis graphs, etc.). Our approach involved an automated process of converting 346 scholarly articles into 6240 images, which represents a benchmark dataset in this task, followed by deploying GPT-4V to categorize and analyze these images using natural language prompts, which can be written by chemists or materials scientists with minimal prior coding knowledge. This methodology enabled GPT-4V to accurately identify and interpret key plots integral to MOF characterization, such as nitrogen isotherms, PXRD patterns, and TGA curves, among others, with accuracy and recall above 93%. The model's proficiency in extracting critical information from these plots not only underscores its capability in data mining but also highlights its potential to aid in the digitalization of experimental data and the creation of datasets for reticular chemistry. In addition, the trends and values of nitrogen isotherm data from the selected literature allowed for a comparison between theoretical and experimental porosity values for over 200 compounds, highlighting certain discrepancies and underscoring the importance of integrating computational and experimental data. This work highlights the potential of AI in accelerating scientific discovery by bridging the gap between computational tools and experimental research. The integration of artificial intelligence into scientific research opens new avenues with the advent of GPT-4V, a large language model equipped with vision capabilities.
引用
收藏
页码:491 / 501
页数:11
相关论文
共 33 条
  • [31] Multiparametric Prostate MRI Accuracy of Prostate Imaging Reporting and Data System (v2.1) Scores 4 and 5: The Influence of Image Quality According to the Prostate Imaging Quality Score
    Fuschi, Andrea
    Suraci, Paolo Pietro
    Pastore, Antonio Luigi
    Al Salhi, Yazan
    Capodiferro, Paola
    Scalzo, Silvio
    Rera, Onofrio Antonio
    Valenzi, Fabio Maria
    Di Dio, Michele
    Russo, Pierluigi
    Al-Zubi, Mohammad Talal
    Al Demour, Saddam
    Fathi Al-Rawashdah, Samer
    Mazzon, Giorgio
    Bellini, Davide
    Carbone, Iacopo
    Petrozza, Vincenzo
    Bozzini, Giorgio
    Zucchi, Alessandro
    Pacini, Matteo
    Tema, Giorgia
    De Nunzio, Cosimo
    Carbone, Antonio
    Rengo, Marco
    JOURNAL OF CLINICAL MEDICINE, 2024, 13 (13)
  • [32] Automated slip system identification and strain analysis framework using high-resolution digital image correlation data: Application to a bimodal Ti-6Al-4V alloy
    Hu, Haoyu
    Briffod, Fabien
    Shiraiwa, Takayuki
    Enoki, Manabu
    INTERNATIONAL JOURNAL OF PLASTICITY, 2023, 166
  • [33] An Image-Based Transfer Learning Approach for Using In Situ Processing Data to Predict Laser Powder Bed Fusion Additively Manufactured Ti-6Al-4V Mechanical Properties
    Luo, Qixiang
    Shimanek, John D.
    Simpson, Timothy W.
    Beese, Allison M.
    3D PRINTING AND ADDITIVE MANUFACTURING, 2025, 12 (01) : 48 - 60