Biomedical Text NER Tagging Tool with Web Interface for Generating BERT-Based Fine-Tuning Dataset

被引:0
|
作者
Park, Yeon-Ji [1 ]
Lee, Min-a [1 ]
Yang, Geun-Je [1 ]
Park, Soo Jun [2 ]
Sohn, Chae-Bong [1 ]
机构
[1] Kwangwoon Univ, Dept Elect & Commun Engn, Seoul 01897, South Korea
[2] Elect & Telecommun Res Inst, Welf & Med ICT Res Dept, Daejeon 34129, South Korea
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 23期
基金
新加坡国家研究基金会;
关键词
dataset generation; BERT; tagging tool; web service; natural language process; text mining; named-entity recognition; fine-tuning model; NAMED ENTITY RECOGNITION; INFORMATION; MARKY;
D O I
10.3390/app122312012
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this paper, a tagging tool is developed to streamline the process of locating tags for each term and manually selecting the target term. It directly extracts the terms to be tagged from sentences and displays it to the user. It also increases tagging efficiency by allowing users to reflect candidate categories in untagged terms. It is based on annotations automatically generated using machine learning. Subsequently, this architecture is fine-tuned using Bidirectional Encoder Representations from Transformers (BERT) to enable the tagging of terms that cannot be captured using Named-Entity Recognition (NER). The tagged text data extracted using the proposed tagging tool can be used as an additional training dataset. The tagging tool, which receives and saves new NE annotation input online, is added to the NER and RE web interfaces using BERT. Annotation information downloaded by the user includes the category (e.g., diseases, genes/proteins) and the list of words associated to the named entity selected by the user. The results reveal that the RE and NER results are improved using the proposed web service by collecting more NE annotation data and fine-tuning the model using generated datasets. Our application programming interfaces and demonstrations are available to the public at via the website link provided in this paper.
引用
收藏
页数:13
相关论文
共 3 条
  • [1] Web Interface of NER and RE with BERT for Biomedical Text Mining
    Park, Yeon-Ji
    Lee, Min-a
    Yang, Geun-Je
    Park, Soo Jun
    Sohn, Chae-Bong
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [2] Fine-Tuning BERT-Based Pre-Trained Models for Arabic Dependency Parsing
    Al-Ghamdi, Sharefah
    Al-Khalifa, Hend
    Al-Salman, Abdulmalik
    APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [3] Investigation of BERT Model on Biomedical Relation Extraction Based on Revised Fine-tuning Mechanism
    Su, Peng
    Vijay-Shanker, K.
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2522 - 2529