Similar document detection using self-organizing maps

被引:0
|
作者
Lensu, Anssi [1 ]
Koikkalainen, Pasi [1 ]
机构
[1] Univ of Jyvaskyla, Jyvaskyla, Finland
关键词
Algorithms - Data reduction - Data structures - Encoding (symbols);
D O I
暂无
中图分类号
学科分类号
摘要
This paper describes how similar free-form textual documents can be matched using the Self-Organizing Maps (SOMs). The analysis chain is made of three parts: first, similar words are located using an alphabet occurrence coding and SOM; second, three-word contexts are clustered using codes obtained from the word SOM to build a context map; and third, whole documents are clustered using codes from the context SOM. Although this work is inspired by the WEBSOM method, it is quite different since our goal was to build a fast system, which is tolerant to the special features of different languages.
引用
收藏
页码:174 / 177
相关论文
共 50 条
  • [1] Document classification with self-organizing maps
    Merkl, D
    KOHONEN MAPS, 1999, : 183 - 195
  • [2] Novelty detection using Self-Organizing Maps
    Ypma, A
    Duin, RPW
    PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 1322 - 1325
  • [3] Multilingual document mining and navigation using self-organizing maps
    Yang, Hsin-Chang
    Hsiao, Han-Wei
    Lee, Chung-Hong
    INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (05) : 647 - 666
  • [4] Self-organizing maps of massive document collections
    Kohonen, T
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL II, 2000, : 3 - 9
  • [5] WEBSOM - Self-organizing maps of document collections
    Kaski, S
    Honkela, T
    Lagus, K
    Kohonen, T
    NEUROCOMPUTING, 1998, 21 (1-3) : 101 - 117
  • [6] Intrusion detection using Emergent Self-Organizing Maps
    Mitrokotsa, Aikaterini
    Douligeris, Christos
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 3955 : 559 - 562
  • [7] Intrusion Detection System using Self-Organizing Maps
    Alsulaiman, Mansour M.
    Alyahya, Aasem N.
    Alkharboush, Raed A.
    Alghafis, Nasser S.
    NSS: 2009 3RD INTERNATIONAL CONFERENCE ON NETWORK AND SYSTEM SECURITY, 2009, : 397 - +
  • [8] XML document mining using contextual self-organizing maps for structures
    Kc, M.
    Hagenbuchner, M.
    Tsoi, A. C.
    Scarselli, F.
    Sperduti, A.
    Gori, M.
    COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS, 2007, 4518 : 510 - 524
  • [9] Self-organizing maps for outlier detection
    Munoz, A
    Muruzabal, J
    NEUROCOMPUTING, 1998, 18 (1-3) : 33 - 60
  • [10] Exploration of large document collections by self-organizing maps
    Kohonen, T
    SIXTH SCANDINAVIAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 1997, 40 : 5 - 7