TCMChat: A generative large language model for traditional Chinese medicine

被引:0
|
作者
Dai, Yizheng [1 ,2 ]
Shao, Xin [1 ,2 ,3 ]
Zhang, Jinlu [1 ,2 ]
Chen, Yulong [2 ]
Chen, Qian [1 ,2 ,3 ]
Liao, Jie [1 ,2 ]
Chi, Fei [2 ]
Zhang, Junhua [4 ]
Fan, Xiaohui [1 ,2 ,3 ,5 ]
机构
[1] Zhejiang Univ, Pharmaceut Informat Inst, Coll Pharmaceut Sci, Hangzhou 310058, Peoples R China
[2] Zhejiang Univ, Innovat Ctr Yangtze River Delta, State Key Lab Chinese Med Modernizat, Jiaxing 314103, Peoples R China
[3] Ningbo Municipal Hosp TCM, Joint Lab Clin Multiomics Res Zhejiang Univ & Ning, Ningbo 315000, Peoples R China
[4] Tianjin Univ Tradit Chinese Med, State Key Lab Chinese Med Modernizat, Tianjin 301617, Peoples R China
[5] Zhejiang Univ, Womens Hosp, Sch Med, Zhejiang Key Lab Precis Diag & Therapy Major Gynec, Hangzhou 310006, Peoples R China
关键词
Traditional Chinese medicine; Large language model; Dialogue system; Pre-training; Supervised fine-tuning;
D O I
10.1016/j.phrs.2024.107530
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
The utilization of ground-breaking large language models (LLMs) accompanied with dialogue system has been progressively prevalent in the medical domain. Nevertheless, the expertise of LLMs in Traditional Chinese Medicine (TCM) remains restricted despite several TCM LLMs proposed recently. Herein, we introduced TCMChat (https://xomics.com.cn/tcmchat), a generative LLM with pre-training (PT) and supervised fine-tuning (SFT) on large-scale curated TCM text knowledge and Chinese Question-Answering (QA) datasets. In detail, we first compiled a customized collection of six scenarios of Chinese medicine as the training set by text mining and manual verification, involving TCM knowledgebase, choice question, reading comprehension, entity extraction, medical case diagnosis, and herb or formula recommendation. Next, we subjected the model to PT and SFT, using the Baichuan2-7B-Chat as the foundation model. The benchmarking datasets and case studies further demonstrate the superior performance of TCMChat in comparison to existing models. Our code, data and model are publicly released on GitHub (https://github.com/ZJUFanLab/TCMChat) and HuggingFace (https://huggingface. co/ZJUFanLab), providing high-quality knowledgebase for the research of TCM modernization with a userfriendly dialogue web tool.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Large language models in traditional Chinese medicine: a systematic review
    Chen, Zhe
    Wang, Hui
    Li, Chengxian
    Liu, Chunxiang
    Yang, Fengwen
    Zhang, Dong
    Fauci, Alice Josephine
    Zhang, Junhua
    ACUPUNCTURE AND HERBAL MEDICINE, 2025, 5 (01) : 57 - 67
  • [2] Large Language Models in Traditional Chinese Medicine: A Scoping Review
    Ren, Yaxuan
    Luo, Xufei
    Wang, Ye
    Li, Haodong
    Zhang, Hairong
    Li, Zeming
    Lai, Honghao
    Li, Xuanlin
    Ge, Long
    Estill, Janne
    Zhang, Lu
    Yang, Shu
    Chen, Yaolong
    Wen, Chengping
    Bian, Zhaoxiang
    ADVANCED Working Group
    JOURNAL OF EVIDENCE BASED MEDICINE, 2025, 18 (01)
  • [3] Traditional Chinese Medicine Knowledge Graph Construction Based on Large Language Models
    Zhang, Yichong
    Hao, Yongtao
    ELECTRONICS, 2024, 13 (07)
  • [4] Integrative Medicine——Traditional Chinese Medicine,A Model?
    Nicola Robinson
    Chinese Journal of Integrative Medicine, 2011, (01) : 21 - 25
  • [5] Integrative Medicine——Traditional Chinese Medicine,A Model?
    Nicola Robinson
    Chinese Journal of Integrative Medicine, 2011, 17 (01) : 21 - 25
  • [6] Integrative medicine — Traditional Chinese medicine, A model ?
    Nicola Robinson
    Chinese Journal of Integrative Medicine, 2011, 17 : 21 - 25
  • [7] Integrative Medicine - Traditional Chinese Medicine, A Model ?
    Robinson, Nicola
    CHINESE JOURNAL OF INTEGRATIVE MEDICINE, 2011, 17 (01) : 21 - 25
  • [8] Other The Model of Traditional Chinese Medicine
    薛崇成
    杨秋莉
    Journal of Traditional Chinese Medicine, 2003, (04) : 308 - 311
  • [9] CPM: A large-scale generative Chinese Pre-trained language model
    Zhang, Zhengyan
    Han, Xu
    Zhou, Hao
    Ke, Pei
    Gu, Yuxian
    Ye, Deming
    Qin, Yujia
    Su, Yusheng
    Ji, Haozhe
    Guan, Jian
    Qi, Fanchao
    Wang, Xiaozhi
    Zheng, Yanan
    Zeng, Guoyang
    Cao, Huanqi
    Chen, Shengqi
    Li, Daixuan
    Sun, Zhenbo
    Liu, Zhiyuan
    Huang, Minlie
    Han, Wentao
    Tang, Jie
    Li, Juanzi
    Zhu, Xiaoyan
    Sun, Maosong
    AI OPEN, 2021, 2 : 93 - 99
  • [10] Translation of Traditional Chinese Medicine Terminology into Macedonian Language
    罗军
    海外英语, 2017, (05) : 137 - 138