Semi-Automatic Building and Learning of a Multilingual Ontology

被引:0
|
作者
Ben Mesmia, Fatma [1 ]
Mouhoub, Malek [1 ]
机构
[1] Univ Regina, Dept Comp Sci, 3737 Wascana Pkwy, Regina, SK S4S 0A2, Canada
关键词
Ontology building and learning; finite sate transducer; transducer cascade; API; Arabic NLP;
D O I
10.1145/3615864
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most online platforms, applications, and Websites use a massive amount of heterogeneous evolving data. These data must be structured and normalized before integration to improve the search and increase the relevance of results. An ontology can address this critical task by efficiently managing data and providing structured formats through techniques such as the Web Ontology Language (OWL). However, building an ontology can be costly, primarily if conducted manually. In this context, we propose a new methodology for automatically building and learning a multilingual ontology using Arabic as the base language via a corpus collected from Wikipedia. Our proposed methodology relies on Finite-state transducers (FSTs). FSTs are regrouped into a cascade to reduce errors and minimize ambiguity. The produced ontology is extended to English and French and independent language images via a translator we developed using APIs. The rationale for starting with the Arabic corpus to extract terms is that entity linking is more convenient from Arabic to other languages. In addition, many Wikipedia articles in English and French (for instance) do not have associated Arabic articles, but the opposite is true. In addition, dealing with Arabic terms permits us to enrich the Arabic module of the free linguistic platform we use in dictionaries and graphs. To assess the efficiency of our proposed methodology, we conducted performance metrics. The reported results are encouraging and promising.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Semi-automatic ontology alignment for geospatial data integration
    Cruz, IF
    Sunna, W
    Chaudhry, A
    GEOGRAPHIC INFORMATION SCIENCE, PROCEEDINGS, 2004, 3234 : 51 - 66
  • [22] An Information Extraction Process for Semi-automatic Ontology Population
    Faria, Carla
    Girardi, Rosario
    SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS, 6TH INTERNATIONAL CONFERENCE SOCO 2011, 2011, 87 : 319 - 328
  • [23] Semi-automatic construction of domain ontology for agent reasoning
    Choi, Ikkyu
    Rho, Seungmin
    Kim, Minkoo
    PERSONAL AND UBIQUITOUS COMPUTING, 2013, 17 (08) : 1721 - 1729
  • [24] Semi-automatic construction of domain ontology for agent reasoning
    Ikkyu Choi
    Seungmin Rho
    Minkoo Kim
    Personal and Ubiquitous Computing, 2013, 17 : 1721 - 1729
  • [25] A Holy Quran Ontology Construction with Semi-Automatic Population
    Elkhammash, Eman
    Ben Abdessalem, Wahiba
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2019, 12 (01): : 229 - 234
  • [26] A Preliminary Study on Semi-automatic Construction of Vietnamese Ontology
    Bao An Nguyen
    Yang, Don-Lin
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 3403 - 3408
  • [27] Semi-Automatic Approach for Ontology Enrichment using UMLS
    Rajput, Abdul Mateen
    Gurulingappa, Harsha
    4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS-BIOLOGY AND BIOINFORMATICS (CSBIO2013), 2013, 23 : 78 - 83
  • [28] A semi-automatic ontology acquisition method for the Semantic Web
    Li, M
    Du, XY
    Wang, S
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 209 - 220
  • [29] Semi-automatic generation of multilingual datasets for stance detection in Twitter
    Zotova, Elena
    Agerri, Rodrigo
    Rigau, German
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 170
  • [30] MnM: Ontology driven semi-automatic and automatic support for semantic markup
    Vargas-Vera, M
    Motta, E
    Domingue, J
    Lanzoni, M
    Stutt, A
    Ciravegna, F
    KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, PROCEEDINGS: ONTOLOGIES AND THE SEMANTIC WEB, 2002, 2473 : 379 - 391