Optimized biomedical entity relation extraction method with data augmentation and classification using GPT-4 and Gemini

被引:0
|
作者
Phan, Cong-Phuoc [1 ]
Phan, Ben [1 ]
Chiang, Jung-Hsien [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, 1 Univ Rd, Tainan 701, Taiwan
关键词
D O I
10.1093/database/baae104
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Despite numerous research efforts by teams participating in the BioCreative VIII Track 01 employing various techniques to achieve the high accuracy of biomedical relation tasks, the overall performance in this area still has substantial room for improvement. Large language models bring a new opportunity to improve the performance of existing techniques in natural language processing tasks. This paper presents our improved method for relation extraction, which involves integrating two renowned large language models: Gemini and GPT-4. Our new approach utilizes GPT-4 to generate augmented data for training, followed by an ensemble learning technique to combine the outputs of diverse models to create a more precise prediction. We then employ a method using Gemini responses as input to fine-tune the BioNLP-PubMed-Bert classification model, which leads to improved performance as measured by precision, recall, and F1 scores on the same test dataset used in the challenge evaluation.Database URL: https://biocreative.bioinformatics.udel.edu/tasks/biocreative-viii/track-1/
引用
收藏
页数:8
相关论文
共 50 条
  • [41] The Joint Method of Triple Attention and Novel Loss Function for Entity Relation Extraction in Small Data-Driven Computational Social Systems
    Gao, Honghao
    Huang, Jiadong
    Tao, Yuan
    Hussain, Walayat
    Huang, Yuzhe
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 9 (06): : 1725 - 1735
  • [42] Advanced Random Mix Augmentation: Data Augmentation Method for Improving Performance of Image Classification in Deep Learning Using Unduplicated Image Processing Combinations
    Im J.
    Louhi Kasahara J.Y.
    Maruyama H.
    Asama H.
    Yamashita A.
    Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2023, 89 (01): : 105 - 112
  • [43] LeafNST: an improved data augmentation method for classification of plant disease using object-based neural style transfer
    Khare, Om
    Mane, Sunil
    Kulkarni, Harshmohan
    Barve, Ninad
    Discover Artificial Intelligence, 2024, 4 (01):
  • [44] RETRACTED: Music Classification Method Using Big Data Feature Extraction and Neural Networks (Retracted Article)
    Li, Xiabin
    Li, Jin
    JOURNAL OF ENVIRONMENTAL AND PUBLIC HEALTH, 2022, 2022
  • [45] A study on effective data preprocessing and augmentation method in diabetic retinopathy classification using pre-trained deep learning approaches
    Incir, Ramazan
    Bozkurt, Ferhat
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 12185 - 12208
  • [46] Motor Imagery Classification Using fNIRS Brain Signals: A Method Based on Synthetic Data Augmentation and Cosine-Modulated Attention
    Peng, Cheng
    Li, Baojiang
    Wang, Haiyan
    Shi, Xinbing
    Qin, Yuxing
    COMPUTATIONAL INTELLIGENCE, 2025, 41 (02)
  • [47] An Improved SAR Ship Classification Method Using Text-to-Image Generation-Based Data Augmentation and Squeeze and Excitation
    Wang, Lu
    Qi, Yuhang
    Mathiopoulos, P. Takis
    Zhao, Chunhui
    Mazhar, Suleman
    REMOTE SENSING, 2024, 16 (07)
  • [48] A study on effective data preprocessing and augmentation method in diabetic retinopathy classification using pre-trained deep learning approaches
    Ramazan İncir
    Ferhat Bozkurt
    Multimedia Tools and Applications, 2024, 83 : 12185 - 12208
  • [49] Towards Energy Efficient Smart Grids: Data Augmentation Through BiWGAN, Feature Extraction and Classification Using Hybrid 2DCNN and BiLSTM
    Asif, Muhammad
    Kabir, Benish
    Pamir
    Ullah, Ashraf
    Munawar, Shoaib
    Javaid, Nadeem
    INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS 2021, 2022, 279 : 108 - 119
  • [50] Benchmarking feature selection and feature extraction methods to improve the performances of machine-learning algorithms for patient classification using metabolomics biomedical data
    Labory, Justine
    Njomgue-Fotso, Evariste
    Bottini, Silvia
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 1274 - 1287