A new approach for fuzzy clustering of web documents

被引:0
|
作者
Friedman, M [1 ]
Last, M [1 ]
Zaafrany, O [1 ]
Schneider, M [1 ]
Kandel, A [1 ]
机构
[1] Nucl Res Ctr Negev, Dept Phys, IL-84190 Beer Sheva, Israel
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing methods of document clustering are based on the classical vector-space model, which represents each document by a fixed-size vector of key terms or key phrases. In large and diverse document collections such as the World Wide Web, this approach suffers from a tremendous computational overload, since the constant size of the term vector equals to the total number of key terms in all documents. We propose a new fuzzy-based approach to clustering documents that are represented by vectors of variable size. Each entry in a vector consists of two fields. The first field is the name of a key phrase in the document and the second denotes an importance weight associated with this key phrase within the particular document. We will describe the proposed approach in detail and show how it is implemented in a real world application from the area of web monitoring.
引用
收藏
页码:377 / 381
页数:5
相关论文
共 50 条
  • [1] Fast fuzzy clustering of Web documents
    Wang, Jian-Hui
    Jiang, Long-Bin
    Yang, Shu
    Chang'an Daxue Xuebao (Ziran Kexue Ban)/Journal of Chang'an University (Natural Science Edition), 2007, 27 (02): : 107 - 110
  • [2] Fuzzy co-clustering of web documents
    William-Chandra, T
    Chen, L
    2005 INTERNATIONAL CONFERENCE ON CYBERWORLDS, PROCEEDINGS, 2005, : 545 - 551
  • [3] Discovering Latent Semantics in Web Documents Using Fuzzy Clustering
    Chiang, I-Jen
    Liu, Charles Chih-Ho
    Tsai, Yi-Hsin
    Kumar, Ajit
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2015, 23 (06) : 2122 - 2134
  • [4] A new approach on search for similar documents with multiple categories using fuzzy clustering
    Saracoglu, Ridvan
    Tuetuencue, Kemal
    Allahverdi, Novruz
    EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (04) : 2545 - 2554
  • [5] Classification of web documents using fuzzy logic categorical data clustering
    Tsekouras, George E.
    Anagnostopoulos, Christos
    Gavalas, Damianos
    Dafri, Economou
    ARTIFICIAL INTELLIGENCE AND INNOVATIONS 2007: FROM THEORY TO APPLICATIONS, 2007, : 93 - +
  • [6] A new approach to fuzzy clustering
    Looney, CG
    COMPUTERS AND THEIR APPLICATIONS, 2000, : 268 - 273
  • [7] Fuzzy multisets and fuzzy clustering of documents
    Miyamoto, S
    10TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3: MEETING THE GRAND CHALLENGE: MACHINES THAT SERVE PEOPLE, 2001, : 1539 - 1542
  • [8] A New Approach for Clustering Variable Length Documents
    Kumar, Niraj
    Srinathan, Kannan
    2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 982 - 987
  • [9] Browser with Clustering of Web Documents
    Tetali, Ravitheja
    Bose, Joy
    Arif, Tasleem
    2013 SECOND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND SECURITY (ADCONS 2013), 2013, : 164 - 168
  • [10] A new unsupervised approach for fuzzy clustering
    Nasibov, Efendi N.
    Ulutagay, Goezde
    FUZZY SETS AND SYSTEMS, 2007, 158 (19) : 2118 - 2133