A Web Page Clustering Method Based on Formal Concept Analysis

被引:5
|
作者
Zhang, Zuping [1 ]
Zhao, Jing [1 ]
Yan, Xiping [1 ]
机构
[1] Cent S Univ, Sch Informat Sci & Engn, Changsha 410000, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
formal concept analysis; feature weight; cross linked list; concept lattice; Web page clustering;
D O I
10.3390/info9090228
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web page clustering is an important technology for sorting network resources. By extraction and clustering based on the similarity of the Web page, a large amount of information on a Web page can be organized effectively. In this paper, after describing the extraction of Web feature words, calculation methods for the weighting of feature words are studied deeply. Taking Web pages as objects and Web feature words as attributes, a formal context is constructed for using formal concept analysis. An algorithm for constructing a concept lattice based on cross data links was proposed and was successfully applied. This method can be used to cluster the Web pages using the concept lattice hierarchy. Experimental results indicate that the proposed algorithm is better than previous competitors with regard to time consumption and the clustering effect.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Semantic Web search based on rough sets and Fuzzy Formal Concept Analysis
    Formica, Anna
    KNOWLEDGE-BASED SYSTEMS, 2012, 26 : 40 - 47
  • [32] Strategy for mining association rules for web pages based on formal concept analysis
    Du, YaJun
    Li, HaiMing
    APPLIED SOFT COMPUTING, 2010, 10 (03) : 772 - 783
  • [33] Web search using dynamic keyword suggestion based on formal concept analysis
    Kim, BS
    Park, Y
    INFORMATION REUSE AND INTEGRATION, 2001, : 108 - 114
  • [34] Web Page Trojan Detection Method Based on Dynamic Behavior Analysis
    Zhang, Wei-Feng
    Liu, Rui-Cheng
    Xu, Lei
    Ruan Jian Xue Bao/Journal of Software, 2018, 29 (05): : 1410 - 1421
  • [35] Term-based clustering and summarization of Web page collections
    Zhang, YZ
    Zincir-Heywood, N
    Milios, E
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2004, 3060 : 60 - 74
  • [36] A Chinese Web Page Clustering Algorithm Based on the Suffix Tree
    YANG Jian-wu National Key Laboratory for Text Processing
    Wuhan University Journal of Natural Sciences, 2004, (05) : 817 - 822
  • [37] A matrix approach for hierarchical web page clustering based on hyperlinks
    Hou, JY
    Zhang, YC
    WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING (WORKSHOPS), 2002, : 207 - 216
  • [38] Web Page Recommendation Algorithm based on Weighted MFP Clustering
    Xiong Haijun
    Huang Zhiqiang
    ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 1251 - 1253
  • [39] A Clustering Based Scalable Hybrid Approach for Web Page Recommendation
    Sharif, Mohammad Amir
    Raghavan, Vijay V.
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014,
  • [40] A UNIFIED EXTENDING METHOD FOR CONTENT-IGNORANT WEB PAGE CLUSTERING
    Shi Lin Chen Chen(School of Aerospace Science and Engineering
    JournalofElectronics(China), 2010, 27 (01) : 105 - 112