Named Entity Recognition for Cancer Immunology Research Using Distant Supervision

被引:0
|
作者
Hai-Long Trieu [1 ,3 ]
Miwa, Makoto [1 ,2 ]
Ananiadou, Sophia [3 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, Artificial Intelligence Res Ctr AIRC, Tsukuba, Ibaraki, Japan
[2] Toyota Technol Inst, Toyota, Japan
[3] Univ Manchester, Natl Ctr Text Min, Manchester, Lancs, England
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer immunology research involves several important cell and protein factors. Extracting the information of such cells and proteins and the interactions between them from text are crucial in text mining for cancer immunology research. However, there are few available datasets for these entities, and the amount of annotated documents is not sufficient compared with other major named entity types. In this work, we introduce our automatically annotated dataset of key named entities, i.e., T-cells, cytokines, and transcription factors, which engages the recent cancer immunotherapy. The entities are annotated based on the UniProtKB knowledge base using dictionary matching. We build a neural named entity recognition (NER) model to be trained on this dataset and evaluate it on a manually-annotated data. Experimental results show that we can achieve a promising NER performance even though our data is automatically annotated. Our dataset also enhances the NER performance when combined with existing data, especially gaining improvement in yet investigated named entities such as cytokines and transcription factors.
引用
收藏
页码:171 / 177
页数:7
相关论文
共 50 条
  • [31] Research on Chinese named entity recognition using combined boundary-PoS feature
    Qiang, Bao-Hua
    Huang, Jun
    Wang, Yu-Feng
    Wang, Sai
    Wang, Yong
    DESIGN, MANUFACTURING AND MECHATRONICS (ICDMM 2015), 2016, : 839 - 848
  • [32] Named Entity Recognition for Amharic Using Deep Learning
    Gamback, Bjorn
    Sikdar, Utpal Kumar
    2017 IST-AFRICA WEEK CONFERENCE (IST-AFRICA), 2017,
  • [33] Using WordNet Predicates for Multilingual Named Entity Recognition
    Negri, Matteo
    Magnini, Bernardo
    GWC 2004: SECOND INTERNATIONAL WORDNET CONFERENCE, PROCEEDINGS, 2003, : 169 - 174
  • [34] Less than One-shot: Named Entity Recognition via ExtremelyWeak Supervision
    Peng, Letian
    Wang, Zihan
    Shang, Jingbo
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 13603 - 13616
  • [35] Weak Supervision and Clustering-Based Sample Selection for Clinical Named Entity Recognition
    Sun, Wei
    Ji, Shaoxiong
    Denti, Tuulia
    Moen, Hans
    Kerro, Oleg
    Rannikko, Antti
    Marttinen, Pekka
    Koskinen, Miika
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 444 - 459
  • [36] Research on Named Entity Recognition Method of Network Threat Intelligence
    Zhang, Keke
    Chen, Xu
    Jing, Yongjun
    Wang, Shuyang
    Tang, Lijun
    CYBER SECURITY, CNCERT 2022, 2022, 1699 : 213 - 224
  • [37] Named entity recognition for Arabic using syntactic grammars
    Mesfar, Slim
    Natural Language Processing and Information Systems, Proceedings, 2007, 4592 : 305 - 316
  • [38] Domain Adaptation for Named Entity Recognition Using CRFs
    Tian, Tian
    Dinarelli, Marco
    Tellier, Isabelle
    Cardoso, Pedro Dias
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 561 - 565
  • [39] Named Entity Recognition for Vietnamese
    Dat Ba Nguyen
    Son Huu Hoang
    Son Bao Pham
    Thai Phuong Nguyen
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, PROCEEDINGS, 2010, 5991 : 205 - 214
  • [40] Named entity recognition in Bengali using system combination
    Ekbal, Asif
    Bandyopadhyay, Sivaji
    LINGUISTICAE INVESTIGATIONES, 2014, 37 (01): : 1 - 22