DRAK: Unlocking Molecular Insights with Domain-Specific Retrieval-Augmented Knowledge in LLMs

被引:0
|
作者
Liu, Jinzhe [1 ,2 ]
Huang, Xiangsheng [3 ]
Chen, Zhuo [4 ]
Fang, Yin [4 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Chinese Acad Sci, Xiongan Inst Innovat, Hebei Key Lab Cognit Intelligence, Baoding, Peoples R China
[4] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
关键词
Retrieval-augmented knowledge; Knowledge injection; Biomolecular domain; LANGUAGE;
D O I
10.1007/978-981-97-9434-8_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large Language Models (LLMs) typically manifest knowledge gap in specialized applications due to pre-training on generalized textual corpora. Although fine-tuning and modality alignment aim to bridge this gap, their inability to provide comprehensive knowledge coverage leads to LLMs delivering imprecise responses. To address these challenges, we introduce a scalable and adaptable non-parametric knowledge injection framework, Domain-specific Retrieval-Augmented Knowledge (DRAK), aimed at bolstering LLMs' knowledge reasoning ability through context examples. DRAK integrates retrieval enhancement and structured knowledge graph recall of high-quality instances, utilizing retrieved examples to unlock LLMs' context-relevant molecular learning capabilities, offering a universal solution for specific domains. Our validation of DRAK's effectiveness and generalizability in the biomolecular domain, achieving superior performance across twelve tasks involving both molecule-oriented and bioinformatics texts within the Mol-Instructions dataset. This demonstration of DRAK's ability to unearth molecular insights establishes a standardized approach for LLMs in navigating the complexities of knowledge-intensive challenges.
引用
收藏
页码:255 / 267
页数:13
相关论文
共 50 条
  • [41] Application of retrieval-augmented generation for interactive industrial knowledge management via a large language model
    Chen, Lun-Chi
    Pardeshi, Mayuresh Sunil
    Liao, Yi-Xiang
    Pai, Kai-Chih
    COMPUTER STANDARDS & INTERFACES, 2025, 94
  • [42] LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion
    Zhao, Pancheng
    Xu, Peng
    Qin, Pengda
    Fan, Deng-Ping
    Zhang, Zhicheng
    Jia, Guoli
    Zhou, Bowen
    Yang, Jufeng
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 4092 - 4101
  • [43] Can LLMs revolutionize text mining in chemistry? A comparative study with domain-specific tools
    Kumari, Madhavi
    Chauhan, Rohit
    Garg, Prabha
    COMPUTER STANDARDS & INTERFACES, 2025, 94
  • [44] Improving Industrial Question Answering Chatbots with Domain-Specific LLMs Fine-Tuning
    Rosati, Riccardo
    Antonini, Filippo
    Muralikrishna, Nikhil
    Tonetto, Flavin
    Mancini, Adriano
    2024 20TH IEEE/ASME INTERNATIONAL CONFERENCE ON MECHATRONIC AND EMBEDDED SYSTEMS AND APPLICATIONS, MESA 2024, 2024,
  • [45] Advancing General Sensor Data Synthesis by Integrating LLMs and Domain-Specific Generative Models
    Zhou, Xiaomao
    Jia, Qingmin
    Hu, Yujiao
    IEEE SENSORS LETTERS, 2024, 8 (11)
  • [46] Transferrable Framework Based on Knowledge Graphs for Generating Explainable Results in Domain-Specific, Intelligent Information Retrieval
    Abu-Rasheed, Hasan
    Weber, Christian
    Zenkert, Johannes
    Dornhofer, Mareike
    Fathi, Madjid
    INFORMATICS-BASEL, 2022, 9 (01):
  • [47] MDL, a domain-specific language for molecular dynamics
    Cickovski, Trevor
    Sweet, Chris
    Izaguirre, Jesus A.
    40TH ANNUAL SIMULATION SYMPOSIUM, PROCEEDINGS, 2007, : 256 - +
  • [48] Domain-specific cross-language relevant question retrieval
    Bowen Xu
    Zhenchang Xing
    Xin Xia
    David Lo
    Shanping Li
    Empirical Software Engineering, 2018, 23 : 1084 - 1122
  • [49] Toward a Semantic Granularity Model for Domain-Specific Information Retrieval
    Yan, Xin
    Lau, Raymond Y. K.
    Song, Dawei
    Li, Xue
    Ma, Jian
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2011, 29 (03)
  • [50] Enhanced Information Retrieval Using Domain-Specific Recommender Models
    Li, Wei
    Ganguly, Debasis
    Jones, Gareth J. F.
    ADVANCES IN INFORMATION RETRIEVAL THEORY, 2011, 6931 : 201 - 212