Extracting Threshold Conceptual Structures from Web Documents

被引:3
|
作者
Ciobanu, Gabriel [1 ]
Horne, Ross [1 ]
Vaideanu, Cristian [2 ]
机构
[1] Romanian Acad, Inst Comp Sci, Iasi, Romania
[2] AI Cuza Univ Ia, Fac Math, Iasi, Romania
来源
关键词
D O I
10.1007/978-3-319-08389-6_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we describe an iterative approach based on formal concept analysis to refine the information retrieval process. Based on weights for ranking documents we define a weighted formal context. We use a Galois connection to introduce a new type of formal concept that allows us to work with specific thresholds for searching words in Web documents. By increasing the threshold, we obtain smaller lattices with more relevant concepts, thus improving the retrieval of more specific items. We use techniques for processing large data sets in parallel, to generate sequences of Galois lattices, overcoming the time complexity of building a lattice for an entire large context.
引用
收藏
页码:130 / 144
页数:15
相关论文
共 50 条
  • [1] Extracting conceptual relationships from specialized documents
    Hui, B
    Yu, E
    DATA & KNOWLEDGE ENGINEERING, 2005, 54 (01) : 29 - 55
  • [2] Extracting conceptual relationships from specialized documents
    Hui, B
    Yu, E
    CONCEPTUAL MODELING - ER 2002, 2002, 2503 : 232 - 246
  • [3] Extracting Conceptual Feature Structures from Text
    Andreasen, Troels
    Bulskov, Henrik
    Jensen, Per Anker
    Lassen, Tine
    FOUNDATIONS OF INTELLIGENT SYSTEMS, 2011, 6804 : 396 - 406
  • [4] SNExtractor: A Prototype for Extracting Semantic Networks from Web Documents
    Zhang, Chi
    Wang, Yanhua
    Wang, Chengyu
    Cheng, Wenliang
    He, Xiaofeng
    WEB-AGE INFORMATION MANAGEMENT, PT II, 2016, 9659 : 527 - 530
  • [5] Extracting Visually Presented Element Relationships from Web Documents
    Burget, Radek
    Smrz, Pavel
    INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2013, 7 (02) : 13 - 29
  • [6] Extracting instances of relations from Web documents using redundancy
    de Boer, Viktor
    van Someren, Maarten
    Wielinga, Bob J.
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, 2006, 4011 : 245 - 258
  • [7] Extracting structures of HTML']HTML documents
    Lim, SJ
    Ng, YK
    TWELFTH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN-12), PROCEEDINGS, 1998, : 420 - 426
  • [8] Extracting the Latent Hierarchical Structure of Web Documents
    El-Shayeb, Michael A.
    El-Beltagy, Samhaa R.
    Rafea, Ahmed
    ADVANCED INTERNET BASED SYSTEMS AND APPLICATIONS, 2009, 4879 : 305 - +
  • [9] Extracting Relations from Chinese Web Documents Using Kernel Methods
    Qiu, Jing
    Liao, Lejian
    PROCEEDINGS OF THE 8TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, 2009, : 352 - 356
  • [10] Discovering conceptual web-knowledge in web documents
    Yoo, SY
    Hoffmann, A
    ENGINEERING KNOWLEDGE IN THE AGE OF THE SEMANTIC WEB, PROCEEDINGS, 2004, 3257 : 504 - 505