Hybrid Neural Network Model for Web Document Clustering

被引:1
|
作者
Hemalatha, M. [1 ]
Srinivas, Sathya D. [2 ]
机构
[1] Karpagam Univ, Dept Comp Sci, Coimbatore 641021, Tamil Nadu, India
[2] Karpagam Univ, Dept Comp Appl, Coimbatore 641021, Tamil Nadu, India
关键词
Singular Value Decomposition; Principle component Analysis; Web document Clustering; Multilayer Neural Network;
D O I
10.1109/ICADIWT.2009.5273918
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The popularity of the internet has caused a massive increase in the amount of web pages. The information explosion has led to a growing challenge for information retrieval systems. Document clustering becomes an important process for helping the information retrieval systems organize this vast amount of data. It is believed that grouping similar documents together into clusters will help the users find relevant information quicker, and will allow them to focus their search in the appropriate direction. Feature selection is an important task in data analysis. It is useful to limit redundancy of features, promote comprehensibility, and find clusters (or structures) hidden in high dimensional data. This paper addresses the problems of document mining related with web page clustering and classification, using the Principle component Analysis for Feature Vector Selection. Singular Value Decomposition is used to find the similarity measure and Multi layer neural network used to improve the performance of the clustering algorithm. We illustrate and discuss the system performance by experimental evaluation results.
引用
收藏
页码:531 / +
页数:3
相关论文
共 50 条
  • [41] Multitype features coselection for web document clustering
    Huang, S
    Chen, Z
    Yu, Y
    Ma, WY
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (04) : 448 - 459
  • [42] Incremental document clustering for web page classification
    Wong, WC
    Fu, AWC
    ENABLING SOCIETY WITH INFORMATION TECHNOLOGY, 2002, : 101 - 110
  • [43] A Feature Selection for Korean Web Document Clustering
    Park, Heum
    Kim, Young-Gi
    Kwon, Hyuk-Chul
    IECON 2004: 30TH ANNUAL CONFERENCE OF IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOL 3, 2004, : 2650 - 2654
  • [44] Fuzzy clustering neural network as flood forecasting model
    Chang, FJ
    Chen, YC
    Liang, JM
    NORDIC HYDROLOGY, 2002, 33 (04) : 275 - 290
  • [45] A Hybrid Algorithm for Web Document Clustering Based on Frequent Term Sets and k-Means
    Wang, Le
    Tian, Li
    Jia, Yan
    Han, Weihong
    ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, PROCEEDINGS, 2007, 4537 : 198 - 203
  • [46] Hybrid model of Air Quality Prediction Using K-Means Clustering and Deep Neural Network
    Ao, Dun
    Cui, Zheng
    Gu, Deyu
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8416 - 8421
  • [47] Efficient phrase-based document indexing for web document clustering
    Hammouda, KM
    Kamel, MS
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (10) : 1279 - 1296
  • [48] Document structure model for survey generation using neural network
    Xu, Huiyan
    Wang, Zhongqing
    Zhang, Yifei
    Weng, Xiaolan
    Wang, Zhijian
    Zhou, Guodong
    FRONTIERS OF COMPUTER SCIENCE, 2021, 15 (04)
  • [49] Document structure model for survey generation using neural network
    Huiyan XU
    Zhongqing WANG
    Yifei ZHANG
    Xiaolan WENG
    Zhijian WANG
    Guodong ZHOU
    Frontiers of Computer Science, 2021, (04) : 68 - 77
  • [50] Document structure model for survey generation using neural network
    Huiyan Xu
    Zhongqing Wang
    Yifei Zhang
    Xiaolan Weng
    Zhijian Wang
    Guodong Zhou
    Frontiers of Computer Science, 2021, 15