Multilingual open information extraction: Challenges and opportunities

被引:0
|
作者
Claro D.B. [1 ]
Souza M. [1 ]
Xavier C.C. [2 ]
Oliveira L. [1 ]
机构
[1] FORMAS Research Group, Computer Science Department, Federal University of Bahia, Salvador - BA
[2] FORMAS Research Group, Federal Institute of Rio Grande do Sul, Porto Alegre - RS
来源
Information (Switzerland) | 2019年 / 10卷 / 07期
关键词
Multilingual; Open information extraction; Parallel corpus;
D O I
10.3390/INFO10070228
中图分类号
学科分类号
摘要
The number of documents published on theWeb in languages other than English grows every year. As a consequence, the need to extract useful information from different languages increases, highlighting the importance of research into Open Information Extraction (OIE) techniques. Different OIE methods have dealt with features from a unique language; however, few approaches tackle multilingual aspects. In those approaches, multilingualism is restricted to processing text in different languages, rather than exploring cross-linguistic resources, which results in low precision due to the use of general rules. Multilingual methods have been applied to numerous problems in Natural Language Processing, achieving satisfactory results and demonstrating that knowledge acquisition for a language can be transferred to other languages to improve the quality of the facts extracted. We argue that a multilingual approach can enhance OIE methods as it is ideal to evaluate and compare OIE systems, and therefore can be applied to the collected facts. In this work, we discuss how the transfer knowledge between languages can increase acquisition from multilingual approaches. We provide a roadmap of the Multilingual Open IE area concerning state of the art studies. Additionally, we evaluate the transfer of knowledge to improve the quality of the facts extracted in each language. Moreover, we discuss the importance of a parallel corpus to evaluate and compare multilingual systems. © 2019 by the authors.
引用
收藏
相关论文
共 50 条
  • [1] Multilingual Open Information Extraction: Challenges and Opportunities
    Claro, Daniela Barreiro
    Souza, Marlo
    Xavier, Clarissa Castella
    Oliveira, Leandro
    INFORMATION, 2019, 10 (07)
  • [2] Multilingual Open Information Extraction
    Gamallo, Pablo
    Garcia, Marcos
    PROGRESS IN ARTIFICIAL INTELLIGENCE-BK, 2015, 9273 : 711 - 722
  • [3] The Opportunities and Challenges of Information Extraction
    Zhu, Qlan
    Cheng, Xianyi
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION WORKSHOP: IITA 2008 WORKSHOPS, PROCEEDINGS, 2008, : 597 - 600
  • [4] MILIE: Modular & Iterative Multilingual Open Information Extraction
    Kotnis, Bhushan
    Gashteovski, Kiril
    Onoro-Rubio, Daniel
    Shaker, Ammar
    Rodriguez-Tembras, Vanesa
    Takamoto, Makoto
    Niepert, Mathias
    Lawrence, Carolin
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6939 - 6950
  • [5] DetIE: Multilingual Open Information Extraction Inspired by Object Detection
    Vasilkovsky, Michael
    Alekseev, Anton
    Malykh, Valentin
    Shenbin, Ilya
    Tutubalina, Elena
    Salikhov, Dmitriy
    Stepnov, Mikhail
    Chertok, Andrey
    Nikolenko, Sergey
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11412 - 11420
  • [6] IndIE: A Multilingual Open Information Extraction Tool For Indic Languages
    Mishra, Ritwik
    Singh, Simranjeet
    Shah, Rajiv Ratn
    Kumaraguru, Ponnurangam
    Bhattacharyya, Pushpak
    13TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING AND THE 3RD CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, IJCNLP-AACL 2023, 2023, : 312 - 326
  • [7] Logical-linguistic model for multilingual Open Information Extraction
    Khairova, Nina
    Mamyrbayev, Orken
    Mukhsina, Kuralay
    Kolesnyk, Anastasiia
    COGENT ENGINEERING, 2020, 7 (01):
  • [8] Alignment-Augmented Consistent Translation for Multilingual Open Information Extraction
    Kolluru, Keshav
    Muqeeth, Mohammed
    Mittal, Shubham
    Chakrabarti, Soumen
    Mausam
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 2502 - 2517
  • [9] Challenges of an Annotation Task for Open Information Extraction in Portuguese
    Glauber, Rafael
    de Oliveira, Leandro Souza
    Lima Sena, Cleiton Fernando
    Claro, Daniela Barreiro
    Souza, Marlo
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 66 - 76
  • [10] The multilingual turn in languages education: opportunities and challenges
    Horii, Sachiko Yokoi
    LANGUAGE AND EDUCATION, 2015, 29 (04) : 369 - 371