Image and data mining in reticular chemistry powered by GPT-4V

被引:13
|
作者
Zheng, Zhiling [1 ,2 ,3 ]
He, Zhiguo [1 ,2 ]
Khattab, Omar [4 ]
Rampal, Nakul [1 ,2 ,3 ]
Zaharia, Matei A. [5 ]
Borgs, Christian [3 ,5 ]
Chayes, Jennifer T. [3 ,5 ,6 ,7 ,8 ]
Yaghi, Omar M. [1 ,2 ,3 ,9 ]
机构
[1] Univ Calif Berkeley, Dept Chem, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Kavli Energy Nanosci Inst, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Bakar Inst Digital Mat Planet, Berkeley, CA 94720 USA
[4] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[5] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[6] Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USA
[7] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[8] Univ Calif Berkeley, Sch Informat, Berkeley, CA 94720 USA
[9] KACST UC Berkeley Ctr Excellence Nanomat Clean Ene, King Abdulaziz City Sci & Technol, Riyadh 11442, Saudi Arabia
来源
DIGITAL DISCOVERY | 2024年 / 3卷 / 03期
关键词
METAL-ORGANIC FRAMEWORKS; SURFACE-AREAS; BET METHOD; ADSORPTION; FUTURE; TOOL;
D O I
10.1039/d3dd00239j
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The integration of artificial intelligence into scientific research opens new avenues with the advent of GPT-4V, a large language model equipped with vision capabilities. In this study, we demonstrate that GPT-4V, accessible through the ChatGPT web user interface or an API, offers promising possibilities in navigating and mining complex data for metal-organic frameworks (MOFs) especially from graphical sources (e.g. sorption isotherms, powder X-ray diffraction patterns, thermogravimetric analysis graphs, etc.). Our approach involved an automated process of converting 346 scholarly articles into 6240 images, which represents a benchmark dataset in this task, followed by deploying GPT-4V to categorize and analyze these images using natural language prompts, which can be written by chemists or materials scientists with minimal prior coding knowledge. This methodology enabled GPT-4V to accurately identify and interpret key plots integral to MOF characterization, such as nitrogen isotherms, PXRD patterns, and TGA curves, among others, with accuracy and recall above 93%. The model's proficiency in extracting critical information from these plots not only underscores its capability in data mining but also highlights its potential to aid in the digitalization of experimental data and the creation of datasets for reticular chemistry. In addition, the trends and values of nitrogen isotherm data from the selected literature allowed for a comparison between theoretical and experimental porosity values for over 200 compounds, highlighting certain discrepancies and underscoring the importance of integrating computational and experimental data. This work highlights the potential of AI in accelerating scientific discovery by bridging the gap between computational tools and experimental research. The integration of artificial intelligence into scientific research opens new avenues with the advent of GPT-4V, a large language model equipped with vision capabilities.
引用
收藏
页码:491 / 501
页数:11
相关论文
共 33 条
  • [21] Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study
    Noda, Masao
    Ueno, Takayoshi
    Koshu, Ryota
    Takaso, Yuji
    Shimada, Mari Dias
    Saito, Chizu
    Sugimoto, Hisashi
    Fushiki, Hiroaki
    Ito, Makoto
    Nomura, Akihiro
    Yoshizaki, Tomokazu
    JMIR MEDICAL EDUCATION, 2024, 10
  • [22] Is it safe to cross? Interpretable Risk Assessment with GPT-4V for Safety-Aware Street Crossing
    Hwang, Hochul
    Kwon, Sunjae
    Kim, Yekyung
    Kim, Donghyun
    2024 21ST INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS, UR 2024, 2024, : 281 - 288
  • [23] Evaluating GPT-4V (GPT-4 with Vision) on Detection of Radiologic Findings on Chest Radiographs (May 7, 10.1148/radiol.233270, 2024)
    Zhou, Yiliang
    Ong, Hanley
    Kennedy, Patrick
    Wu, Carol C.
    Kazam, Jacob
    Hentel, Keith
    Flanders, Adam
    Shih, George
    Peng, Yifan
    RADIOLOGY, 2024, 311 (02)
  • [24] Unveiling GPT-4V's hidden challenges behind high accuracy on USMLE questions: Observational Study
    Yang, Zhichao
    Yao, Zonghai
    Tasmin, Mahbuba
    Vashisht, Parth
    Jang, Won Seok
    Ouyang, Feiyun
    Wang, Beining
    Mcmanus, David
    Berlowitz, Dan
    Yu, Hong
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2025, 27
  • [25] How far are we to GPT-4V? Closing the gap to commercial multimodal models with open-source suites
    Chen, Zhe
    Wang, Weiyun
    Tian, Hao
    Ye, Shenglong
    Gao, Zhangwei
    Cui, Erfei
    Tong, Wenwen
    Hu, Kongzhi
    Luo, Jiapeng
    Ma, Zheng
    Ma, Ji
    Wang, Jiaqi
    Dong, Xiaoyi
    Yan, Hang
    Guo, Hewei
    He, Conghui
    Shi, Botian
    Jin, Zhenjiang
    Xu, Chao
    Wang, Bin
    Wei, Xingjian
    Li, Wei
    Zhang, Wenjian
    Zhang, Bo
    Cai, Pinlong
    Wen, Licheng
    Yan, Xiangchao
    Dou, Min
    Lu, Lewei
    Zhu, Xizhou
    Lu, Tong
    Lin, Dahua
    Qiao, Yu
    Dai, Jifeng
    Wang, Wenhai
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (12)
  • [26] How far are we to GPT-4V?Closing the gap to commercial multimodal models with open-source suites
    Zhe CHEN
    Weiyun WANG
    Hao TIAN
    Shenglong YE
    Zhangwei GAO
    Erfei CUI
    Wenwen TONG
    Kongzhi HU
    Jiapeng LUO
    Zheng MA
    Ji MA
    Jiaqi WANG
    Xiaoyi DONG
    Hang YAN
    Hewei GUO
    Conghui HE
    Botian SHI
    Zhenjiang JIN
    Chao XU
    Bin WANG
    Xingjian WEI
    Wei LI
    Wenjian ZHANG
    Bo ZHANG
    Pinlong CAI
    Licheng WEN
    Xiangchao YAN
    Min DOU
    Lewei LU
    Xizhou ZHU
    Tong LU
    Dahua LIN
    Yu QIAO
    Jifeng DAI
    Wenhai WANG
    Science China(Information Sciences), 2024, 67 (12) : 5 - 22
  • [27] A decision-making model for self-driving vehicles based on GPT-4V, federated reinforcement learning, and blockchain
    Alam, Tanweer
    Gupta, Ruchi
    Ahamed, N. Nasurudeen
    Ullah, Arif
    Neural Computing and Applications, 2024, 36 (34) : 21545 - 21560
  • [28] Single and Multi-Hop Question-Answering Datasets for Reticular Chemistry with GPT-4-Turbo
    Rampal, Nakul
    Wang, Kaiyu
    Burigana, Matthew
    Hou, Lingxiang
    Al-Johani, Juri
    Sackmann, Anna
    Murayshid, Hanan S.
    AlSumari, Walaa A.
    AlAbdulkarim, Arwa M.
    Alhazmi, Nahla E.
    Alawad, Majed O.
    Borgs, Christian
    Chayes, Jennifer T.
    Yaghi, Omar M.
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2024, 20 (20) : 9128 - 9137
  • [29] Boosting GPT-4V's accuracy in dermoscopic classification with few-shot learning. Comment on "can ChatGPT vision diagnose melanoma? An exploratory diagnostic accuracy study"
    Wang, Jinge
    Hu, Gangqing
    JOURNAL OF THE AMERICAN ACADEMY OF DERMATOLOGY, 2024, 91 (06) : e165 - e166
  • [30] Potential of ChatGPT and GPT-4 for Data Mining of Free-Text CT Reports on Lung Cancer
    Fink, Matthias A.
    Bischoff, Arved
    Fink, Christoph A.
    Moll, Martin
    Kroschke, Jonas
    Dulz, Luca
    Heussel, Claus Peter
    Kauczor, Hans-Ulrich
    Weber, Tim F.
    RADIOLOGY, 2023, 308 (03)