Retrieval-Augmented Generation Approach: Document Question Answering using Large Language Model

被引:0
|
作者
Muludi, Kurnia [1 ]
Fitria, Kaira Milani [1 ]
Triloka, Joko [1 ]
Sutedi [1 ]
机构
[1] Darmajaya Informat & Business Inst, Informat Engn Grad Program, Bandar Lampung, Indonesia
关键词
Natural Language Processing; Large Language Model; Retrieval Augmented Generation; Question Answering; GPT;
D O I
10.14569/IJACSA.2024.0150379
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This study introduces the Retrieval Augmented Generation (RAG) method to improve Question-Answering (QA) systems by addressing document processing in Natural Language Processing problems. It represents the latest breakthrough in applying RAG to document question and answer applications, overcoming previous QA system obstacles. RAG combines search techniques in vector store and text generation mechanism developed by Large Language Models, offering a time-efficient alternative to manual reading limitations. The research evaluates RAG's that use Generative Pre-trained Transformer 3.5 or GPT-3.5-turbo from the ChatGPT model and its impact on document data processing, comparing it with other applications. This research also provides datasets to test the capabilities of the QA document system. The proposed dataset and Stanford Question Answering Dataset (SQuAD) are used for performance testing. The study contributes theoretically by advancing methodologies and knowledge representation, supporting benchmarking in research communities. Results highlight RAG's superiority: achieving a precision of 0.74 in Recall-Oriented Understudy for Gisting Evaluation (ROUGE) testing, outperforming others at 0.5; obtaining an F1 score of 0.88 in BERTScore, surpassing other QA apps at 0.81; attaining a precision of 0.28 in Bilingual Evaluation Understudy (BLEU) testing, surpassing others with a precision of 0.09; and scoring 0.33 in Jaccard Similarity, outshining others at 0.04. These findings underscore RAG's efficiency and competitiveness, promising a positive impact on various industrial sectors through advanced Artificial Intelligence (AI) technology.
引用
收藏
页码:776 / 785
页数:10
相关论文
共 50 条
  • [41] Integrating Retrieval-Augmented Generation with Large Language Models in Nephrology: Advancing Practical Applications
    Miao, Jing
    Thongprayoon, Charat
    Suppadungsuk, Supawadee
    Valencia, Oscar A. Garcia
    Cheungpasitporn, Wisit
    MEDICINA-LITHUANIA, 2024, 60 (03):
  • [42] Fine-grained knowledge fusion for retrieval-augmented medical visual question answering
    Liang, Xiao
    Wang, Di
    Jing, Bin
    Jiao, Zhicheng
    Li, Ronghan
    Liu, Ruyi
    Miao, Qiguang
    Wang, Quan
    INFORMATION FUSION, 2025, 120
  • [43] Zero-Shot ECG Diagnosis with Large Language Models and Retrieval-Augmented Generation
    Yu, Han
    Guo, Peikun
    Sano, Akane
    MACHINE LEARNING FOR HEALTH, ML4H, VOL 225, 2023, 225 : 650 - 663
  • [44] M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions
    Wang, Zheng
    Teo, Shu Xian
    Ouyang, Jieer
    Xu, Yongjun
    Shi, Wei
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 1966 - 1978
  • [45] Hallucination Mitigation for Retrieval-Augmented Large Language Models: A Review
    Zhang, Wan
    Zhang, Jing
    MATHEMATICS, 2025, 13 (05)
  • [46] Leveraging Retrieval-Augmented Generation for Swahili Language Conversation Systems
    Ndimbo, Edmund V.
    Luo, Qin
    Fernando, Gimo C.
    Yang, Xu
    Wang, Bang
    APPLIED SCIENCES-BASEL, 2025, 15 (02):
  • [47] Emergency Patient Triage Improvement through a Retrieval-Augmented Generation Enhanced Large-Scale Language Model
    Yazaki, Megumi
    Maki, Satoshi
    Furuya, Takeo
    Inoue, Ken
    Nagai, Ko
    Nagashima, Yuki
    Maruyama, Juntaro
    Toki, Yasunori
    Kitagawa, Kyota
    Iwata, Shuhei
    Kitamura, Takaki
    Gushiken, Sho
    Noguchi, Yuji
    Inoue, Masahiro
    Shiga, Yasuhiro
    Inage, Kazuhide
    Orita, Sumihisa
    Nakada, Takaaki
    Ohtori, Seiji
    PREHOSPITAL EMERGENCY CARE, 2024,
  • [48] Evaluating Retrieval Quality in Retrieval-Augmented Generation
    Salemi, Alireza
    Zamani, Hamed
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2395 - 2400
  • [49] Resolving Unseen Rumors with Retrieval-Augmented Large Language Models
    Chen, Lei
    Wei, Zhongyu
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT IV, NLPCC 2024, 2025, 15362 : 319 - 332
  • [50] RA-CFGPT: Chinese financial assistant with retrieval-augmented large language model
    Li, Jiangtong
    Lei, Yang
    Bian, Yuxuan
    Cheng, Dawei
    Ding, Zhijun
    Jiang, Changjun
    FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (05)