Ensemble Learning of Named Entity Recognition Algorithms using Multilayer Perceptron for the Multilingual Web of Data

被引:3
|
作者
Speck, Rene [1 ]
Ngomo, Axel-Cyrille Ngonga [2 ]
机构
[1] Univ Leipzig, Data Sci Grp, Augustuspl 10, D-04109 Leipzig, Germany
[2] Univ Paderborn, Data Sci Grp, Pohlweg 51, D-33098 Paderborn, Germany
基金
欧盟地平线“2020”;
关键词
Named Entity Recognition; Ensemble Learning; Multilingual; Semantic Web;
D O I
10.1145/3148011.3154471
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Implementing the multilingual Semantic Web vision requires transforming unstructured data in multiple languages from the Document Web into structured data for the multilingual Web of Data. We present the multilingual version of FOX, a knowledge extraction suite which supports this migration by providing named entity recognition based on ensemble learning for five languages. Our evaluation results show that our approach goes beyond the performance of existing named entity recognition systems on all five languages. In our best run, we outperform the state of the art by a gain of 32.38% F1-Score points on a Dutch dataset. More information and a demo can be found at http://fox.aksw.org as well as an extended version of the paper(1) descriping the evaluation in detail.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] Named entity recognition in crime using machine learning approach
    Shabat, Hafedh (h2005_ali@yahoo.com), 1600, Springer Verlag (8870):
  • [42] Medical Named Entity Recognition Using Weakly Supervised Learning
    Ma, Long-Long
    Yang, Jie
    An, Bo
    Liu, Shuaikang
    Huang, Gaijuan
    COGNITIVE COMPUTATION, 2022, 14 (03) : 1068 - 1079
  • [43] Named Entity Extraction from Semi-structured Data Using Machine Learning Algorithms
    Mansurova, Madina
    Barakhnin, Vladimir
    Khibatkhanuly, Yerzhan
    Pastushkov, Ilya
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT II, 2019, 11684 : 58 - 69
  • [44] ALDANER: Active Learning based Data Augmentation for Named Entity Recognition
    Moscato, Vincenzo
    Postiglione, Marco
    Sperli, Giancarlo
    Vignali, Andrea
    KNOWLEDGE-BASED SYSTEMS, 2024, 305
  • [45] A Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers
    Hamdi, Ahmed
    Pontes, Elvys Linhares
    Boros, Emanuela
    Thi Tuyet Hai Nguyen
    Hackl, Guenter
    Moreno, Jose G.
    Doucet, Antoine
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2328 - 2334
  • [46] Majority or Minority: Data Imbalance Learning Method for Named Entity Recognition
    Nemoto, Sota
    Kitada, Shunsuke
    Iyatomi, Hitoshi
    IEEE ACCESS, 2025, 13 : 9902 - 9909
  • [47] Auxiliary Learning for Named Entity Recognition with Multiple Auxiliary Training Data
    Watanabe, Taiki
    Ichikawa, Tomoya
    Tamura, Akihiro
    Iwakura, Tomoya
    Ma, Chunpeng
    Kato, Tsuneo
    PROCEEDINGS OF THE 21ST WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2022), 2022, : 130 - 139
  • [48] Semi-Supervised Learning for Named Entity Recognition Using Weakly Labeled Training Data
    Zafarian, Atefeh
    Rokni, Ali
    Khadivi, Shahram
    Ghiasifard, Sonia
    2015 INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2015, : 129 - 135
  • [49] TLR at BSNLP2019: A Multilingual Named Entity Recognition System
    Moreno, Jose G.
    Pontes, Elvys Linhares
    Coustaty, Mickael
    Doucet, Antoine
    7TH WORKSHOP ON BALTO-SLAVIC NATURAL LANGUAGE PROCESSING (BSNLP'2019), 2019, : 83 - 88
  • [50] Dataset Enhancement and Multilingual Transfer for Named Entity Recognition in the Indonesian Language
    Khairunnisa, Siti Oryza
    Chen, Zhousi
    Komachi, Mamoru
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (06)