Similar document detection using self-organizing maps

被引:0
|
作者
Lensu, Anssi [1 ]
Koikkalainen, Pasi [1 ]
机构
[1] Univ of Jyvaskyla, Jyvaskyla, Finland
关键词
Algorithms - Data reduction - Data structures - Encoding (symbols);
D O I
暂无
中图分类号
学科分类号
摘要
This paper describes how similar free-form textual documents can be matched using the Self-Organizing Maps (SOMs). The analysis chain is made of three parts: first, similar words are located using an alphabet occurrence coding and SOM; second, three-word contexts are clustered using codes obtained from the word SOM to build a context map; and third, whole documents are clustered using codes from the context SOM. Although this work is inspired by the WEBSOM method, it is quite different since our goal was to build a fast system, which is tolerant to the special features of different languages.
引用
收藏
页码:174 / 177
相关论文
共 50 条
  • [31] Wireless localization using self-organizing maps
    Giorgetti, Gianni
    Gupta, Sandeep K. S.
    Manes, Gianfranco
    PROCEEDINGS OF THE SIXTH INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING IN SENSOR NETWORKS, 2007, : 293 - 302
  • [32] Shape indexing using self-organizing maps
    Suganthan, PN
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (04): : 835 - 840
  • [33] Color clustering using self-organizing maps
    Zhang, Xiao-Yu
    Chen, Jiu-Sheng
    Dong, Jian-Kang
    2007 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1-4, PROCEEDINGS, 2007, : 986 - +
  • [34] Organizing spectral image database using Self-Organizing Maps
    Kohonen, O
    Jääskeläinen, T
    Hauta-Kasari, M
    Parkkinen, J
    Miyazawa, K
    JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2005, 49 (04) : 431 - 441
  • [35] A Causal Model Using Self-Organizing Maps
    Chung, Younjin
    Takatsuka, Masahiro
    NEURAL INFORMATION PROCESSING, PT II, 2015, 9490 : 591 - 600
  • [36] SOM of SOMs: Self-organizing map which maps a group of self-organizing maps
    Furukawa, T
    ARTIFICIAL NEURAL NETWORKS: BIOLOGICAL INSPIRATIONS - ICANN 2005, PT 1, PROCEEDINGS, 2005, 3696 : 391 - 396
  • [37] Network Anomaly Detection with Bayesian Self-Organizing Maps
    de la Hoz Franco, Emiro
    Ortiz Garcia, Andres
    Ortega Lopera, Julio
    de la Hoz Correa, Eduardo
    Prieto Espinosa, Alberto
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT I, 2013, 7902 : 530 - +
  • [38] OUTLIER DETECTION IN SELF-ORGANIZING MAPS AND THEIR QUALITY ESTIMATION
    Stefanovic, P.
    Kurasova, O.
    NEURAL NETWORK WORLD, 2018, 28 (02) : 105 - 117
  • [39] Improving the Performance of Self-Organizing Maps for Intrusion Detection
    McElwee, Steven
    Cannady, James
    SOUTHEASTCON 2016, 2016,
  • [40] Integrating contextual information into text document clustering with Self-Organizing Maps
    Pullwitt, D
    Der, R
    ADVANCES IN SELF-ORGANISING MAPS, 2001, : 54 - 60