A fine-tuned large language model based molecular dynamics agent for code generation to obtain material thermodynamic parameters

被引:0
|
作者
Zhuofan Shi [1 ]
Chunxiao Xin [2 ]
Tong Huo [3 ]
Yuntao Jiang [1 ]
Bowen Wu [2 ]
Xingyue Chen [3 ]
Wei Qin [2 ]
Xinjian Ma [3 ]
Gang Huang [4 ]
Zhenyu Wang [1 ]
Xiang Jing [2 ]
机构
[1] Peking University,School of Software and Microelectronics
[2] National Key Laboratory of Data Space Technology and System,Institute of Information Engineering
[3] Advanced Institute of Big Data,undefined
[4] Chinese Academy of Sciences,undefined
关键词
LLM; Agent; Materials science;
D O I
10.1038/s41598-025-92337-6
中图分类号
学科分类号
摘要
In the field of materials science, addressing the complex relationship between the material structure and properties has increasingly involved leveraging the text generation capabilities of AI-generated content (AIGC) models for tasks that include literature mining and data analysis. However, theoretical calculations and code development remain labor-intensive challenges. This paper proposes a novel approach based on text-to-code generation, utilizing large language models to automate the implementation of simulation programs in materials science. The effectiveness of automated code generation and review is validated with thermodynamics simulations based on the LAMMPS software as a foundation. This study introduces Molecular Dynamics Agent (MDAgent), a framework designed to guide large models in automatically generating, executing, and refining simulation code. In addition, a thermodynamic simulation code dataset for LAMMPS was constructed to fine-tune the language model. Expert evaluation scores demonstrate that MDAgent significantly improves the code generation and review capabilities. The proposed approach reduces the average task time by 42.22%, as compared to traditional models, thus highlighting its potential applications in the field of materials science.
引用
收藏
相关论文
共 38 条
  • [21] NM-GPT: Advancing Nuclear Medicine Report Processing Through a Specialized Fine-tuned Large Language Model
    Lyu, Zhiliang
    Zeng, Fang
    Guo, Ning
    Li, Xiang
    Li, Quanzheng
    JOURNAL OF NUCLEAR MEDICINE, 2024, 65
  • [22] Fine-Tuned Large Language Model for Extracting Patients on Pretreatment for Lung Cancer from a Picture Archiving and Communication System Based on Radiological Reports
    Yasaka, Koichiro
    Kanzawa, Jun
    Kanemaru, Noriko
    Koshino, Saori
    Abe, Osamu
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2025, 38 (01): : 327 - 334
  • [23] ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge
    Li, Yunxiang
    Li, Zihan
    Zhang, Kai
    Dan, Ruilong
    Jiang, Steve
    Zhang, You
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (06)
  • [24] Heterogeneous data-based information retrieval using a fine-tuned pre-trained BERT language model
    Shaik, Amjan
    Saxena, Surabhi
    Gupta, Manisha
    Parveen, Nikhat
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (21) : 59537 - 59559
  • [25] A large language model-based generative natural language processing framework fine-tuned on clinical notes accurately extracts headache frequency from electronic health records
    Chiang, Chia-Chun
    Luo, Man
    Dumkrieger, Gina
    Trivedi, Shubham
    Chen, Yi-Chieh
    Chao, Chieh-Ju
    Schwedt, Todd J.
    Sarker, Abeed
    Banerjee, Imon
    HEADACHE, 2024, 64 (04): : 400 - 409
  • [26] Experience is all you need: a large language model application of fine-tuned GPT-3.5 and RoBERTa for aspect-based sentiment analysis of college football stadium reviews
    Qian, Tyreal Yizhou
    Li, Weizhe
    Gong, Hua
    Seifried, Chad
    Xu, Chenglong
    SPORT MANAGEMENT REVIEW, 2025, 28 (01) : 1 - 25
  • [27] Large language model-based code generation for the control of construction assembly robots: A hierarchical generation approach
    Luo, Hanbin
    Wu, Jianxin
    Liu, Jiajing
    Antwi-Afari, Maxwell Fordjour
    DEVELOPMENTS IN THE BUILT ENVIRONMENT, 2024, 19
  • [28] Automated Pathologic TN Classification Prediction and Rationale Generation From Lung CancerSurgical Pathology Reports Using a Large Language Model Fine-Tuned With Chain-of-Thought: Algorithm Development and Validation Study
    Kim, Sanghwan
    Jang, Sowon
    Kim, Borham
    Sunwoo, Leonard
    Kim, Seok
    Chung, Jin-Haeng
    Nam, Sejin
    Cho, Hyeongmin
    Lee, Donghyoung
    Lee, Keehyuck
    Yoo, Sooyoung
    JMIR MEDICAL INFORMATICS, 2024, 12
  • [29] A scientific-article key-insight extraction system based on multi-actor of fine-tuned open-source large language models
    Song, Zihan
    Hwang, Gyo-Yeob
    Zhang, Xin
    Huang, Shan
    Park, Byung-Kwon
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [30] Protecting Intellectual Property of Large Language Model-Based Code Generation APIs via Watermarks
    Li, Zongjie
    Wang, Chaozheng
    Wang, Shuai
    Gao, Cuiyun
    PROCEEDINGS OF THE 2023 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, CCS 2023, 2023, : 2336 - 2350