A Review on Web Pages Clustering Techniques

被引:0
|
作者
Patel, Dipak [1 ]
Zaveri, Mukesh [1 ]
机构
[1] Sardar Vallabhbhai Natl Inst Technol, Dept Comp Engn, Surat 395007, Gujarat, India
来源
TRENDS IN NETWORKS AND COMMUNICATIONS | 2011年 / 197卷
关键词
Web page Clustering; Vector Space model; Feature Extractions; Cluster quality;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
World Wide Web (WWW) has become largest source of information. This abundance of information with dynamic and heterogeneous nature of the web makes information retrieval a difficult process for the average user. A technique is required that can help the users to organize, summarize and browse the available information from web with the goal of satisfying their information need effectively. Clustering process organizes the collection of objects into related groups. Web page clustering is the key concept for getting desired information quickly from the massive storage of web pages on WWW. Many researchers have proposed various web document clustering techniques. In this paper, we present detail survey on existing web document clustering techniques along with document representation techniques. We have also described some evaluation measures to evaluate the cluster qualities.
引用
收藏
页码:700 / 710
页数:11
相关论文
共 50 条
  • [1] Clustering Web Pages into Hierarchical Categories
    Yao, Zhongmei
    Choi, Ben
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2007, 3 (02) : 17 - 35
  • [2] Clustering Web pages based on their structure
    Crescenzi, V
    Merialdo, P
    Missier, P
    DATA & KNOWLEDGE ENGINEERING, 2005, 54 (03) : 279 - 299
  • [3] Clustering Web pages into hierarchial categories
    Louisiana Tech University, Ruston, LA, United States
    Int. J. Intell. Inf. Technologies, 2007, 2 (17-35):
  • [4] Block Clustering for Web Pages Categorization
    Charrad, Malika
    Lechevallier, Yves
    ben Ahmed, Mohamed
    Saporta, Gilbert
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, PROCEEDINGS, 2009, 5788 : 260 - +
  • [5] Web pages reordering and clustering based on web patterns
    Kudelka, Milos
    Snasel, Vaclav
    Lehecka, Ondrej
    El-Qawasmeh, Eyas
    Pokorny, Jaroslav
    SOFSEM 2008: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2008, 4910 : 731 - +
  • [6] Applying Hybrid KEPSO Clustering to Web Pages
    Moh, Teng-Sheng
    Sabnis, Ameya
    PROCEEDINGS OF THE 48TH ANNUAL SOUTHEAST REGIONAL CONFERENCE (ACM SE 10), 2010, : 61 - 66
  • [7] Automatic partitioning of web pages using clustering
    Romero, R
    Berger, A
    MOBILE HUMAN-COMPUTER INTERACTION - MOBILEHCI 2004, PROCEEDINGS, 2004, 3160 : 388 - 393
  • [8] Web Clustering based on the Information of Sibling Pages
    Lu, Caimei
    Zhang, Xiaodan
    Park, Jung-ran
    Hu, Xiaohua
    He, Tingting
    2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 480 - +
  • [9] A Hierarchical Algorithm for Clustering Extremist Web Pages
    Qi, Xingqin
    Christensen, Kyle
    Duval, Robert
    Fuller, Edgar
    Spahiu, Arian
    Wu, Qin
    Zhang, Cun-Quan
    2010 INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2010), 2010, : 458 - 463
  • [10] Web pages: Designing with rhetorical techniques
    Dormann, C
    DESIGN OF COMPUTING SYSTEMS: SOCIAL AND ERGONOMIC CONSIDERATIONS, 1997, 21 : 799 - 802