Simple classification into large topic ontology of Web documents

被引:0
|
作者
Grobelnik, M [1 ]
Mladenic, D [1 ]
机构
[1] Jozef Stefan Inst, Ljubljana 1000, Slovenia
关键词
classification of documents; topic ontology of Web documents; Web document context; link structure of the Web;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper presents an approach to classifying Web documents into large topic ontology. The main emphasis is on having a simple approach appropriate for handling a large ontology and providing it with enriched data by including additional information on the Web page context obtained from the link structure of the Web. The context is generated form the in-coming and out-going links of the Web document we want to classify (the target document), meaning that for representing a document we use, not only text of the document itself but also the text from the documents pointing to the target document as well as the text form the documents that the target document is pointing to. The idea is that providing enriched data is compensating for the simplicity of the approach while keeping it efficient and capable of handling large topic ontology.
引用
收藏
页码:201 / 206
页数:6
相关论文
共 50 条
  • [41] Highly Accurate Distributed Classification of Web Documents
    Song, JingKuan
    Gao, Hui
    Gao, LianLi
    Fu, Yan
    2009 INTERNATIONAL SYMPOSIUM ON WEB INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 68 - 71
  • [42] Multilingual medical documents classification based on MesH domain ontology
    Zakaria, Elberrichi
    Malika, Taibi
    Amel, Belaggoun
    International Journal of Computer Science Issues, 2012, 9 (2 2-2): : 150 - 156
  • [43] Extraction of Topic Map Ontology for Web Service-Oriented Enterprises
    Roy, Suman
    Sawant, Kiran Prakash
    Kale, Aditya
    Charvin, Olivier Maurice
    SERVICE-ORIENTED COMPUTING - ICSOC 2015 WORKSHOPS, 2016, 9586 : 117 - 129
  • [44] ONTOLOGY-ASSISTED DISCOVERY OF HIERARCHICAL TOPIC CLUSTERS ON THE SOCIAL WEB
    Slabbekoorn, Kristian
    Noro, Tomoya
    Tokuda, Takehiro
    JOURNAL OF WEB ENGINEERING, 2016, 15 (5-6): : 361 - 396
  • [45] Classification Methods of Text Documents Using Ontology Based Approach
    Lytvyn, Vasyl
    Vysotska, Victoria
    Veres, Oleh
    Rishnyak, Ihor
    Rishnyak, Halya
    ADVANCES IN INTELLIGENT SYSTEMS AND COMPUTING, CSIT 2016, 2017, 512 : 229 - 240
  • [46] ONTOLOGY-CONCEPTS WEIGHTING FOR ENHANCED SEMANTIC CLASSIFICATION OF DOCUMENTS
    Fraihat, Salam
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2016, 12 (02): : 519 - 531
  • [47] DarkOnto: An Ontology Construction Approach for Dark Web Community Discussions Through Topic Modeling and Ontology Learning
    Basheer, Randa
    Alkhatib, Bassel
    HUMAN BEHAVIOR AND EMERGING TECHNOLOGIES, 2024, 2024
  • [48] Query Topic Classification and Sociology of Web Query Logs
    Buzikashvili, Nikolai
    COMPUTACION Y SISTEMAS, 2015, 19 (04): : 633 - 646
  • [49] Automatic ontology-based knowledge extraction from web documents
    Alani, H
    Kim, S
    Millard, DE
    Weal, MJ
    Hall, W
    Lewis, PH
    Shadbolt, NR
    IEEE INTELLIGENT SYSTEMS, 2003, 18 (01) : 14 - 21
  • [50] An approach of information extraction from web documents for automatic ontology generation
    Yeom, KW
    Park, JH
    COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 450 - 457