CycleNER: An Unsupervised Training Approach for Named Entity Recognition

被引:19
|
作者
Iovine, Andrea [1 ]
Fang, Anjie [2 ]
Fetahu, Besnik [2 ]
Rokhlenko, Oleg [2 ]
Malmasi, Shervin [2 ]
机构
[1] Univ Bari Aldo Moro, Bari, Italy
[2] Amazoncom Inc, Bellevue, WA USA
关键词
natural language processing; named entity recognition; cycleconsistency; training; unsupervised training;
D O I
10.1145/3485447.3512012
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Named Entity Recognition (NER) is a crucial natural language understanding task for many down-stream tasks such as question answering and retrieval. Despite significant progress in developing NER models for multiple languages and domains, scaling to emerging and/or low-resource domains still remains challenging, due to the costly nature of acquiring training data. We propose CycleNER, an unsupervised approach based on cycle-consistency training that uses two functions: (i) sentence-to-entity - S2E and (ii) entity-to-sentence - E2S, to carry out the NER task. CycleNER does not require annotations but a set of sentences with no entity labels and another independent set of entity examples. Through cycle-consistency training, the output from one function is used as input for the other (e.g. S2E. E2S) to align the representation spaces of both functions and therefore enable unsupervised training. Evaluation on several domains comparing CycleNER against supervised and unsupervised competitors shows that CycleNER achieves highly competitive performance with only a few thousand input sentences. We demonstrate competitive performance against supervised models, achieving 73% of supervised performance without any annotations on CoNLL03, while significantly outperforming unsupervised approaches.
引用
收藏
页码:2916 / 2924
页数:9
相关论文
共 50 条
  • [21] Unsupervised Named Entity Recognition and Disambiguation: An Application to Old French Journals
    Mosallam, Yusra
    Abi-Haidar, Alaa
    Ganascia, Jean-Gabriel
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, 2014, 8557 : 12 - 23
  • [22] Unsupervised named entity recognition using syntactic and semantic contextual evidence
    Cucchiarelli, A
    Velardi, P
    COMPUTATIONAL LINGUISTICS, 2001, 27 (01) : 123 - 131
  • [23] Unsupervised named-entity recognition: Generating gazetteers and resolving ambiguity
    Nadeau, David
    Turney, Peter D.
    Matwin, Stan
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4013 : 266 - 277
  • [24] OWNER - Toward Unsupervised Open-World Named Entity Recognition
    Genest, Pierre-Yves
    Portier, Pierre-Edouard
    Egyed-Zsigmond, Elod
    Lovisetto, Martino
    IEEE ACCESS, 2025, 13 : 50077 - 50105
  • [25] Unsupervised biomedical named entity recognition: Experiments with clinical and biological texts
    Zhang, Shaodian
    Elhadad, Noemie
    JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 (06) : 1088 - 1098
  • [26] Hungarian named entity recognition with a maximum entropy approach
    Varga, Daniel
    Simon, Eszter
    ACTA CYBERNETICA, 2007, 18 (02): : 293 - 301
  • [27] An Adaptive Approach for Web Scale Named Entity Recognition
    Zhu, Jianhan
    2009 1ST IEEE SYMPOSIUM ON WEB SOCIETY, PROCEEDINGS, 2009, : 41 - 46
  • [28] A Novel Hybrid Approach to Arabic Named Entity Recognition
    Meselhi, Mohamed A.
    Bakr, Hitham M. Abo
    Ziedan, Ibrahim
    Shaalan, Khaled
    MACHINE TRANSLATION, CWMT 2014, 2014, 493 : 93 - 103
  • [29] Deep Learning Approach for Arabic Named Entity Recognition
    Gridach, Mourad
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 439 - 451
  • [30] Semantic Crawling: an Approach based on Named Entity Recognition
    Di Pietro, Giulia
    Aliprandi, Carlo
    De Luca, Antonio E.
    Raffaelli, Matteo
    Soru, Tiziana
    2014 PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2014), 2014, : 695 - 699