Named Entity Recognition for Cancer Immunology Research Using Distant Supervision

被引:0
|
作者
Hai-Long Trieu [1 ,3 ]
Miwa, Makoto [1 ,2 ]
Ananiadou, Sophia [3 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, Artificial Intelligence Res Ctr AIRC, Tsukuba, Ibaraki, Japan
[2] Toyota Technol Inst, Toyota, Japan
[3] Univ Manchester, Natl Ctr Text Min, Manchester, Lancs, England
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer immunology research involves several important cell and protein factors. Extracting the information of such cells and proteins and the interactions between them from text are crucial in text mining for cancer immunology research. However, there are few available datasets for these entities, and the amount of annotated documents is not sufficient compared with other major named entity types. In this work, we introduce our automatically annotated dataset of key named entities, i.e., T-cells, cytokines, and transcription factors, which engages the recent cancer immunotherapy. The entities are annotated based on the UniProtKB knowledge base using dictionary matching. We build a neural named entity recognition (NER) model to be trained on this dataset and evaluate it on a manually-annotated data. Experimental results show that we can achieve a promising NER performance even though our data is automatically annotated. Our dataset also enhances the NER performance when combined with existing data, especially gaining improvement in yet investigated named entities such as cytokines and transcription factors.
引用
收藏
页码:171 / 177
页数:7
相关论文
共 50 条
  • [1] Research on the Named Entity Recognition for Rail Fault Text Based on Distant Supervision
    Cai, Yi
    Su, Shuai
    Li, Zheng
    Han, Qinglong
    Zhang, Jianxia
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 939 - 944
  • [2] Adaptive Named Entity Recognition Using Distant Supervision for Contemporary Written Texts
    Kim, Juae
    Kim, Yejin
    Kang, Sangwoo
    Seo, Jungyun
    IEEE ACCESS, 2021, 9 : 80405 - 80414
  • [3] Pattern-enhanced Named Entity Recognition with Distant Supervision
    Wang, Xuan
    Guan, Yingjun
    Zhang, Yu
    Li, Qi
    Han, Jiawei
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 818 - 827
  • [4] Named Entity Recognition for Open Domain Data Based on Distant Supervision
    Wu, Junshuang
    Zhang, Richong
    Deng, Ting
    Huai, Jinpeng
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING, 2019, 1134 : 185 - 197
  • [5] A template augmented distant supervision framework for Chinese named entity recognition
    Qi, Chengwen
    Laili, Yuanjun
    Ren, Lei
    Zhang, Lin
    Li, Bowen
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2024, 15 (01)
  • [6] Bagging-Based Active Learning Model for Named Entity Recognition with Distant Supervision
    Lee, Sunghee
    Song, Yeongkil
    Choi, Maengsik
    Kim, Harksoo
    2016 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2016, : 321 - 324
  • [7] Biomedical Named Entity Recognition with Less Supervision
    Ghiasvand, Omid
    Kate, Rohit J.
    2015 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2015), 2015, : 495 - 495
  • [8] Fine-Grained Named Entity Recognition with Distant Supervision in COVID-19 Literature
    Wang, Xuan
    Song, Xiangchen
    Li, Bangzheng
    Zhou, Kang
    Li, Qi
    Han, Jiawei
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 491 - 494
  • [9] BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision
    Liang, Chen
    Yu, Yue
    Jiang, Haoming
    Er, Siawpeng
    Wang, Ruijia
    Zhao, Tuo
    Zhang, Chao
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1054 - 1064
  • [10] Noise Detection for Distant Supervised Named Entity Recognition
    Wang J.
    Wang K.
    Wang H.
    Du W.
    He Z.
    Ruan T.
    Liu J.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (04): : 916 - 928