Semi-Automatic Building and Learning of a Multilingual Ontology

被引:0
|
作者
Ben Mesmia, Fatma [1 ]
Mouhoub, Malek [1 ]
机构
[1] Univ Regina, Dept Comp Sci, 3737 Wascana Pkwy, Regina, SK S4S 0A2, Canada
关键词
Ontology building and learning; finite sate transducer; transducer cascade; API; Arabic NLP;
D O I
10.1145/3615864
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most online platforms, applications, and Websites use a massive amount of heterogeneous evolving data. These data must be structured and normalized before integration to improve the search and increase the relevance of results. An ontology can address this critical task by efficiently managing data and providing structured formats through techniques such as the Web Ontology Language (OWL). However, building an ontology can be costly, primarily if conducted manually. In this context, we propose a new methodology for automatically building and learning a multilingual ontology using Arabic as the base language via a corpus collected from Wikipedia. Our proposed methodology relies on Finite-state transducers (FSTs). FSTs are regrouped into a cascade to reduce errors and minimize ambiguity. The produced ontology is extended to English and French and independent language images via a translator we developed using APIs. The rationale for starting with the Arabic corpus to extract terms is that entity linking is more convenient from Arabic to other languages. In addition, many Wikipedia articles in English and French (for instance) do not have associated Arabic articles, but the opposite is true. In addition, dealing with Arabic terms permits us to enrich the Arabic module of the free linguistic platform we use in dictionaries and graphs. To assess the efficiency of our proposed methodology, we conducted performance metrics. The reported results are encouraging and promising.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] SEMI-Automatic approach to domain ontology building
    Gorskis, Henrihs
    Zmanovska, Tatjana
    Chizhov, Jurijs
    AICT 2013: APPLIED INFORMATION AND COMMUNICATION TECHNOLOGIES, 2013, : 55 - 61
  • [2] Semi-automatic Tool for Ontology Learning Tasks
    Sebek, Ondrej
    Jirkovsky, Vaclav
    Rychtyckyj, Nestor
    Kadera, Petr
    INDUSTRIAL APPLICATIONS OF HOLONIC AND MULTI-AGENT SYSTEMS (HOLOMAS 2019), 2019, 11710 : 119 - 129
  • [3] MANUAL AND SEMI-AUTOMATIC APPROACHES TO BUILDING A MULTILINGUAL PHONEME SET
    Egorova, Ekaterina
    Vesely, Karel
    Karafiat, Martin
    Janda, Milos
    Cernocky, Jan
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7324 - 7328
  • [4] Semi-automatic ontology construction based on text learning
    Wang, Ying
    Zuo, Wanli
    Peng, Tao
    Sun, Yifei
    Journal of Information and Computational Science, 2010, 7 (02): : 495 - 501
  • [5] ONTOLOGY DEVELOPMENT FOR GREEN BUILDING BY USING A SEMI-AUTOMATIC METHOD
    Yan, Hang
    Shi, Yiming
    Lu, Xuteng
    JOURNAL OF GREEN BUILDING, 2023, 18 (04): : 129 - 147
  • [6] Semi-automatic ontology bridging
    Silva, N
    Rocha, J
    IKE '05: Proceedings of the 2005 International Conference on Information and Knowledge Engineering, 2005, : 192 - 198
  • [7] A New Approach for Semi-Automatic Building and Extending a Multilingual Terminology Thesaurus
    Horak, Ales
    Baisa, Vit
    Rambousek, Adam
    Suchomel, Vit
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2019, 28 (02)
  • [8] Semi-automatic terminology ontology learning based on topic modeling
    Rani, Monika
    Dhar, Amit Kumar
    Vyas, O. P.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 63 : 108 - 125
  • [9] Towards A Semi-Automatic Method For Building Chinese Tax Domain Ontology
    Qiu, Yu
    Cheng, Li
    Alghazzawi, Daniyal
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 2530 - 2539
  • [10] SASOBUS: Semi-automatic Sentiment Domain Ontology Building Using Synsets
    Dera, Ewelina
    Frasincar, Flavius
    Schouten, Kim
    Zhuang, Lisa
    SEMANTIC WEB (ESWC 2020), 2020, 12123 : 105 - 120