Image and data mining in reticular chemistry powered by GPT-4V

被引：13

作者：

Zheng, Zhiling ^{[1
,2
,3
]}

He, Zhiguo ^{[1
,2
]}

Khattab, Omar ^{[4
]}

Rampal, Nakul ^{[1
,2
,3
]}

Zaharia, Matei A. ^{[5
]}

Borgs, Christian ^{[3
,5
]}

Chayes, Jennifer T. ^{[3
,5
,6
,7
,8
]}

Yaghi, Omar M. ^{[1
,2
,3
,9
]}

机构：

[1] Univ Calif Berkeley, Dept Chem, Berkeley, CA 94720 USA

[2] Univ Calif Berkeley, Kavli Energy Nanosci Inst, Berkeley, CA 94720 USA

[3] Univ Calif Berkeley, Bakar Inst Digital Mat Planet, Berkeley, CA 94720 USA

[4] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

[5] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA

[6] Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USA

[7] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA

[8] Univ Calif Berkeley, Sch Informat, Berkeley, CA 94720 USA

[9] KACST UC Berkeley Ctr Excellence Nanomat Clean Ene, King Abdulaziz City Sci & Technol, Riyadh 11442, Saudi Arabia

来源：

DIGITAL DISCOVERY | 2024年 / 3卷 / 03期

关键词：

METAL-ORGANIC FRAMEWORKS; SURFACE-AREAS; BET METHOD; ADSORPTION; FUTURE; TOOL;

D O I：

10.1039/d3dd00239j

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

The integration of artificial intelligence into scientific research opens new avenues with the advent of GPT-4V, a large language model equipped with vision capabilities. In this study, we demonstrate that GPT-4V, accessible through the ChatGPT web user interface or an API, offers promising possibilities in navigating and mining complex data for metal-organic frameworks (MOFs) especially from graphical sources (e.g. sorption isotherms, powder X-ray diffraction patterns, thermogravimetric analysis graphs, etc.). Our approach involved an automated process of converting 346 scholarly articles into 6240 images, which represents a benchmark dataset in this task, followed by deploying GPT-4V to categorize and analyze these images using natural language prompts, which can be written by chemists or materials scientists with minimal prior coding knowledge. This methodology enabled GPT-4V to accurately identify and interpret key plots integral to MOF characterization, such as nitrogen isotherms, PXRD patterns, and TGA curves, among others, with accuracy and recall above 93%. The model's proficiency in extracting critical information from these plots not only underscores its capability in data mining but also highlights its potential to aid in the digitalization of experimental data and the creation of datasets for reticular chemistry. In addition, the trends and values of nitrogen isotherm data from the selected literature allowed for a comparison between theoretical and experimental porosity values for over 200 compounds, highlighting certain discrepancies and underscoring the importance of integrating computational and experimental data. This work highlights the potential of AI in accelerating scientific discovery by bridging the gap between computational tools and experimental research. The integration of artificial intelligence into scientific research opens new avenues with the advent of GPT-4V, a large language model equipped with vision capabilities.

引用

页码：491 / 501

页数：11

共 33 条

[21] Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study
Noda, Masao
Ueno, Takayoshi
Koshu, Ryota
Takaso, Yuji
Shimada, Mari Dias
Saito, Chizu
Sugimoto, Hisashi
Fushiki, Hiroaki
Ito, Makoto
Nomura, Akihiro
Yoshizaki, Tomokazu
JMIR MEDICAL EDUCATION, 2024, 10
[22] Is it safe to cross? Interpretable Risk Assessment with GPT-4V for Safety-Aware Street Crossing
Hwang, Hochul
Kwon, Sunjae
Kim, Yekyung
Kim, Donghyun
2024 21ST INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS, UR 2024, 2024, : 281 - 288
[23] Evaluating GPT-4V (GPT-4 with Vision) on Detection of Radiologic Findings on Chest Radiographs (May 7, 10.1148/radiol.233270, 2024)
Zhou, Yiliang
Ong, Hanley
Kennedy, Patrick
Wu, Carol C.
Kazam, Jacob
Hentel, Keith
Flanders, Adam
Shih, George
Peng, Yifan
RADIOLOGY, 2024, 311 (02)
[24] Unveiling GPT-4V's hidden challenges behind high accuracy on USMLE questions: Observational Study
Yang, Zhichao
Yao, Zonghai
Tasmin, Mahbuba
Vashisht, Parth
Jang, Won Seok
Ouyang, Feiyun
Wang, Beining
Mcmanus, David
Berlowitz, Dan
Yu, Hong
JOURNAL OF MEDICAL INTERNET RESEARCH, 2025, 27
[25] How far are we to GPT-4V? Closing the gap to commercial multimodal models with open-source suites
Chen, Zhe
Wang, Weiyun
Tian, Hao
Ye, Shenglong
Gao, Zhangwei
Cui, Erfei
Tong, Wenwen
Hu, Kongzhi
Luo, Jiapeng
Ma, Zheng
Ma, Ji
Wang, Jiaqi
Dong, Xiaoyi
Yan, Hang
Guo, Hewei
He, Conghui
Shi, Botian
Jin, Zhenjiang
Xu, Chao
Wang, Bin
Wei, Xingjian
Li, Wei
Zhang, Wenjian
Zhang, Bo
Cai, Pinlong
Wen, Licheng
Yan, Xiangchao
Dou, Min
Lu, Lewei
Zhu, Xizhou
Lu, Tong
Lin, Dahua
Qiao, Yu
Dai, Jifeng
Wang, Wenhai
SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (12)
[26] How far are we to GPT-4V?Closing the gap to commercial multimodal models with open-source suites
Zhe CHEN
Weiyun WANG
Hao TIAN
Shenglong YE
Zhangwei GAO
Erfei CUI
Wenwen TONG
Kongzhi HU
Jiapeng LUO
Zheng MA
Ji MA
Jiaqi WANG
Xiaoyi DONG
Hang YAN
Hewei GUO
Conghui HE
Botian SHI
Zhenjiang JIN
Chao XU
Bin WANG
Xingjian WEI
Wei LI
Wenjian ZHANG
Bo ZHANG
Pinlong CAI
Licheng WEN
Xiangchao YAN
Min DOU
Lewei LU
Xizhou ZHU
Tong LU
Dahua LIN
Yu QIAO
Jifeng DAI
Wenhai WANG
Science China(Information Sciences), 2024, 67 (12) : 5 - 22
[27] A decision-making model for self-driving vehicles based on GPT-4V, federated reinforcement learning, and blockchain
Alam, Tanweer
Gupta, Ruchi
Ahamed, N. Nasurudeen
Ullah, Arif
Neural Computing and Applications, 2024, 36 (34) : 21545 - 21560
[28] Single and Multi-Hop Question-Answering Datasets for Reticular Chemistry with GPT-4-Turbo
Rampal, Nakul
Wang, Kaiyu
Burigana, Matthew
Hou, Lingxiang
Al-Johani, Juri
Sackmann, Anna
Murayshid, Hanan S.
AlSumari, Walaa A.
AlAbdulkarim, Arwa M.
Alhazmi, Nahla E.
Alawad, Majed O.
Borgs, Christian
Chayes, Jennifer T.
Yaghi, Omar M.
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2024, 20 (20) : 9128 - 9137
[29] Boosting GPT-4V's accuracy in dermoscopic classification with few-shot learning. Comment on "can ChatGPT vision diagnose melanoma? An exploratory diagnostic accuracy study"
Wang, Jinge
Hu, Gangqing
JOURNAL OF THE AMERICAN ACADEMY OF DERMATOLOGY, 2024, 91 (06) : e165 - e166
[30] Potential of ChatGPT and GPT-4 for Data Mining of Free-Text CT Reports on Lung Cancer
Fink, Matthias A.
Bischoff, Arved
Fink, Christoph A.
Moll, Martin
Kroschke, Jonas
Dulz, Luca
Heussel, Claus Peter
Kauczor, Hans-Ulrich
Weber, Tim F.
RADIOLOGY, 2023, 308 (03)

← 1 2 3 4 →