Extracting Related Concepts from Wikipedia by Using a Graph Database System

被引:0
|
作者
Omasa, Asuka [1 ]
Inoue, Ushio [1 ]
机构
[1] Tokyo Denki Univ, Grad Sch Engn, Tokyo, Japan
来源
2019 20TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD) | 2019年
关键词
semantic relatedness; graph databases; Wikipedia; web structure mining;
D O I
10.1109/snpd.2019.8935874
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Wikipedia has been used as a corpus for research on natural language processing due to a huge number of articles and a high comprehensiveness of its vocabulary. The purpose of this research is to measure the semantic relatedness between two concepts covered by Wikipedia articles. The hyperlink structure among articles is focused because a link from an article to another article indicates an explicit relationship between them. However, the cost of analyzing links may be a problem if the article links to or linked from a lot of articles. A graph database system enables quick and efficient access to related data without handling unrelated data, because it stores data with predefined relationships among data. This paper proposes a method for extracting related concepts from Wikipedia by using a graph database system.
引用
收藏
页码:268 / 273
页数:6
相关论文
共 50 条
  • [21] Extracting Weighted Language Lexicons from Wikipedia
    Grefenstette, Gregory
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1365 - 1368
  • [22] Extracting Named Entities and Synonyms from Wikipedia
    Bohn, Christian
    Norvag, Kjetil
    2010 24TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2010, : 1300 - 1307
  • [23] An approach for extracting bilingual terminology from wikipedia
    Erdmann, Maike
    Nakayama, Kotaro
    Hara, Takahiro
    Nishio, Shojiro
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2008, 4947 : 380 - 392
  • [24] Issues and Concepts of Graph Database and a Comparative Analysis on list of Graph Database tools
    Das, Anupam
    Mitra, Anirban
    Bhagat, Surendra Nath
    Paul, Subrata
    2020 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI - 2020), 2020, : 353 - 358
  • [25] A BERT-Based Approach for Extracting Prerequisite Relations among Wikipedia Concepts
    Bai, Youheng
    Zhang, Yan
    Xiao, Kui
    Lou, Yuanyuan
    Sun, Kai
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [26] Extracting Key Terms From Micro-Blogs Messages Using Wikipedia
    Al-Zubi, Ahmad Ali
    WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, WCECS 2013, VOL I, 2013, I : 46 - 51
  • [27] Extracting and mapping industry 4.0 technologies using wikipedia
    Chiarello, Filippo
    Trivelli, Leonello
    Bonaccorsi, Andrea
    Fantoni, Gualtiero
    COMPUTERS IN INDUSTRY, 2018, 100 : 244 - 257
  • [28] Experimental data for computing semantic similarity between concepts using multiple inheritances in Wikipedia category graph
    Hussain, Muhammad Jawad
    Wasti, Shahbaz Hassan
    Huang, Guangjian
    Jiang, Yuncheng
    DATA IN BRIEF, 2020, 30
  • [29] Attribute Extracting from Wikipedia Pages in Domain Automatically
    Su, Fenglong
    Rong, Chuanzhen
    Huang, Qingquan
    Qiu, Jiyuan
    Shao, Xinhong
    Yue, Zhenjun
    Xie, Qinghua
    INFORMATION TECHNOLOGY AND INTELLIGENT TRANSPORTATION SYSTEMS, VOL 2, 2017, 455 : 433 - 440
  • [30] Extracting Ontologies from Arabic Wikipedia: A Linguistic Approach
    Al-Rajebah, Nora I.
    Al-Khalifa, Hend S.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2014, 39 (04) : 2749 - 2771