A fine-tuned large language model based molecular dynamics agent for code generation to obtain material thermodynamic parameters

被引：0

作者：

Zhuofan Shi ^{[1
]}

Chunxiao Xin ^{[2
]}

Tong Huo ^{[3
]}

Yuntao Jiang ^{[1
]}

Bowen Wu ^{[2
]}

Xingyue Chen ^{[3
]}

Wei Qin ^{[2
]}

Xinjian Ma ^{[3
]}

Gang Huang ^{[4
]}

Zhenyu Wang ^{[1
]}

Xiang Jing ^{[2
]}

机构：

[1] Peking University,School of Software and Microelectronics

[2] National Key Laboratory of Data Space Technology and System,Institute of Information Engineering

[3] Advanced Institute of Big Data,undefined

[4] Chinese Academy of Sciences,undefined

来源：

Scientific Reports | / 15卷 / 1期

关键词：

LLM; Agent; Materials science;

D O I：

10.1038/s41598-025-92337-6

中图分类号：

学科分类号：

摘要：

In the field of materials science, addressing the complex relationship between the material structure and properties has increasingly involved leveraging the text generation capabilities of AI-generated content (AIGC) models for tasks that include literature mining and data analysis. However, theoretical calculations and code development remain labor-intensive challenges. This paper proposes a novel approach based on text-to-code generation, utilizing large language models to automate the implementation of simulation programs in materials science. The effectiveness of automated code generation and review is validated with thermodynamics simulations based on the LAMMPS software as a foundation. This study introduces Molecular Dynamics Agent (MDAgent), a framework designed to guide large models in automatically generating, executing, and refining simulation code. In addition, a thermodynamic simulation code dataset for LAMMPS was constructed to fine-tune the language model. Expert evaluation scores demonstrate that MDAgent significantly improves the code generation and review capabilities. The proposed approach reduces the average task time by 42.22%, as compared to traditional models, thus highlighting its potential applications in the field of materials science.

引用

共 38 条

[21] NM-GPT: Advancing Nuclear Medicine Report Processing Through a Specialized Fine-tuned Large Language Model
Lyu, Zhiliang
Zeng, Fang
Guo, Ning
Li, Xiang
Li, Quanzheng
JOURNAL OF NUCLEAR MEDICINE, 2024, 65
[22] Fine-Tuned Large Language Model for Extracting Patients on Pretreatment for Lung Cancer from a Picture Archiving and Communication System Based on Radiological Reports
Yasaka, Koichiro
Kanzawa, Jun
Kanemaru, Noriko
Koshino, Saori
Abe, Osamu
JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2025, 38 (01): : 327 - 334
[23] ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge
Li, Yunxiang
Li, Zihan
Zhang, Kai
Dan, Ruilong
Jiang, Steve
Zhang, You
CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (06)
[24] Heterogeneous data-based information retrieval using a fine-tuned pre-trained BERT language model
Shaik, Amjan
Saxena, Surabhi
Gupta, Manisha
Parveen, Nikhat
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (21) : 59537 - 59559
[25] A large language model-based generative natural language processing framework fine-tuned on clinical notes accurately extracts headache frequency from electronic health records
Chiang, Chia-Chun
Luo, Man
Dumkrieger, Gina
Trivedi, Shubham
Chen, Yi-Chieh
Chao, Chieh-Ju
Schwedt, Todd J.
Sarker, Abeed
Banerjee, Imon
HEADACHE, 2024, 64 (04): : 400 - 409
[26] Experience is all you need: a large language model application of fine-tuned GPT-3.5 and RoBERTa for aspect-based sentiment analysis of college football stadium reviews
Qian, Tyreal Yizhou
Li, Weizhe
Gong, Hua
Seifried, Chad
Xu, Chenglong
SPORT MANAGEMENT REVIEW, 2025, 28 (01) : 1 - 25
[27] Large language model-based code generation for the control of construction assembly robots: A hierarchical generation approach
Luo, Hanbin
Wu, Jianxin
Liu, Jiajing
Antwi-Afari, Maxwell Fordjour
DEVELOPMENTS IN THE BUILT ENVIRONMENT, 2024, 19
[28] Automated Pathologic TN Classification Prediction and Rationale Generation From Lung CancerSurgical Pathology Reports Using a Large Language Model Fine-Tuned With Chain-of-Thought: Algorithm Development and Validation Study
Kim, Sanghwan
Jang, Sowon
Kim, Borham
Sunwoo, Leonard
Kim, Seok
Chung, Jin-Haeng
Nam, Sejin
Cho, Hyeongmin
Lee, Donghyoung
Lee, Keehyuck
Yoo, Sooyoung
JMIR MEDICAL INFORMATICS, 2024, 12
[29] A scientific-article key-insight extraction system based on multi-actor of fine-tuned open-source large language models
Song, Zihan
Hwang, Gyo-Yeob
Zhang, Xin
Huang, Shan
Park, Byung-Kwon
SCIENTIFIC REPORTS, 2025, 15 (01):
[30] Protecting Intellectual Property of Large Language Model-Based Code Generation APIs via Watermarks
Li, Zongjie
Wang, Chaozheng
Wang, Shuai
Gao, Cuiyun
PROCEEDINGS OF THE 2023 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, CCS 2023, 2023, : 2336 - 2350

← 1 2 3 4 →