Image and data mining in reticular chemistry powered by GPT-4V

被引:13
|
作者
Zheng, Zhiling [1 ,2 ,3 ]
He, Zhiguo [1 ,2 ]
Khattab, Omar [4 ]
Rampal, Nakul [1 ,2 ,3 ]
Zaharia, Matei A. [5 ]
Borgs, Christian [3 ,5 ]
Chayes, Jennifer T. [3 ,5 ,6 ,7 ,8 ]
Yaghi, Omar M. [1 ,2 ,3 ,9 ]
机构
[1] Univ Calif Berkeley, Dept Chem, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Kavli Energy Nanosci Inst, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Bakar Inst Digital Mat Planet, Berkeley, CA 94720 USA
[4] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[5] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[6] Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USA
[7] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[8] Univ Calif Berkeley, Sch Informat, Berkeley, CA 94720 USA
[9] KACST UC Berkeley Ctr Excellence Nanomat Clean Ene, King Abdulaziz City Sci & Technol, Riyadh 11442, Saudi Arabia
来源
DIGITAL DISCOVERY | 2024年 / 3卷 / 03期
关键词
METAL-ORGANIC FRAMEWORKS; SURFACE-AREAS; BET METHOD; ADSORPTION; FUTURE; TOOL;
D O I
10.1039/d3dd00239j
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The integration of artificial intelligence into scientific research opens new avenues with the advent of GPT-4V, a large language model equipped with vision capabilities. In this study, we demonstrate that GPT-4V, accessible through the ChatGPT web user interface or an API, offers promising possibilities in navigating and mining complex data for metal-organic frameworks (MOFs) especially from graphical sources (e.g. sorption isotherms, powder X-ray diffraction patterns, thermogravimetric analysis graphs, etc.). Our approach involved an automated process of converting 346 scholarly articles into 6240 images, which represents a benchmark dataset in this task, followed by deploying GPT-4V to categorize and analyze these images using natural language prompts, which can be written by chemists or materials scientists with minimal prior coding knowledge. This methodology enabled GPT-4V to accurately identify and interpret key plots integral to MOF characterization, such as nitrogen isotherms, PXRD patterns, and TGA curves, among others, with accuracy and recall above 93%. The model's proficiency in extracting critical information from these plots not only underscores its capability in data mining but also highlights its potential to aid in the digitalization of experimental data and the creation of datasets for reticular chemistry. In addition, the trends and values of nitrogen isotherm data from the selected literature allowed for a comparison between theoretical and experimental porosity values for over 200 compounds, highlighting certain discrepancies and underscoring the importance of integrating computational and experimental data. This work highlights the potential of AI in accelerating scientific discovery by bridging the gap between computational tools and experimental research. The integration of artificial intelligence into scientific research opens new avenues with the advent of GPT-4V, a large language model equipped with vision capabilities.
引用
收藏
页码:491 / 501
页数:11
相关论文
共 33 条
  • [1] GPT-4V passes the BLS and ACLS examinations: An analysis of GPT-4V's image recognition capabilities
    King, Ryan C.
    Bharani, Vishnu
    Shah, Kunal
    Yeo, Yee Hui
    Samaan, Jamil S.
    RESUSCITATION, 2024, 195
  • [2] Evaluation of Multimodal ChatGPT (GPT-4V) in Describing Mammography Image Features
    Haver, Hana
    Bahl, Manisha
    Doo, Florence
    Kamel, Peter
    Parekh, Vishwa
    Jeudy, Jean
    Yi, Paul
    CANADIAN ASSOCIATION OF RADIOLOGISTS JOURNAL-JOURNAL DE L ASSOCIATION CANADIENNE DES RADIOLOGISTES, 2024, 75 (04): : 947 - 949
  • [3] Map Reading and Analysis with GPT-4V(ision)
    Xu, Jinwen
    Tao, Ran
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2024, 13 (04)
  • [4] Glaucoma Detection and Feature Identification via GPT-4V Fundus Image Analysis
    Jalili, Jalil
    Jiravarnsirikul, Anuwat
    Bowd, Christopher
    Chuter, Benton
    Belghith, Akram
    Goldbaum, Michael H.
    Baxter, Sally L.
    Weinreb, Robert N.
    Zangwill, Linda M.
    Christopher, Mark
    OPHTHALMOLOGY SCIENCE, 2025, 5 (02):
  • [5] Revolution or risk?-Assessing the potential and challenges of GPT-4V in radiologic image interpretation
    Huppertz, Marc Sebastian
    Siepmann, Robert
    Topp, David
    Nikoubashman, Omid
    Yueksel, Can
    Kuhl, Christiane Katharina
    Truhn, Daniel
    Nebelung, Sven
    EUROPEAN RADIOLOGY, 2025, 35 (03) : 1111 - 1121
  • [6] Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data
    Zhang, Chenhui
    Wang, Sherrie
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 7839 - 7849
  • [7] Unveiling the clinical incapabilities: a benchmarking study of GPT-4V(ision) for ophthalmic multimodal image analysis
    Xu, Pusheng
    Chen, Xiaolan
    Zhao, Ziwei
    Shi, Danli
    BRITISH JOURNAL OF OPHTHALMOLOGY, 2024, 108 (10) : 1384 - 1389
  • [8] Can Large Language Models Automatically Jailbreak GPT-4V?
    Wu, Yuanwei
    Huang, Yue
    Liu, Yixin
    Li, Xiang
    Zhou, Pan
    Sun, Lichao
    arXiv,
  • [9] Evaluating the image recognition capabilities of GPT-4V and Gemini Pro in the Japanese national dental examination
    Fukuda, Hikaru
    Morishita, Masaki
    Muraoka, Kosuke
    Yamaguchi, Shino
    Nakamura, Taiji
    Yoshioka, Izumi
    Awano, Shuji
    Ono, Kentaro
    JOURNAL OF DENTAL SCIENCES, 2025, 20 (01) : 368 - 372
  • [10] Realizing Visual Question Answering for Education: GPT-4V as a Multimodal AI
    Lee, Gyeonggeon
    Zhai, Xiaoming
    TECHTRENDS, 2025, : 271 - 287