Similar document detection using self-organizing maps

被引:0
|
作者
Lensu, Anssi [1 ]
Koikkalainen, Pasi [1 ]
机构
[1] Univ of Jyvaskyla, Jyvaskyla, Finland
关键词
Algorithms - Data reduction - Data structures - Encoding (symbols);
D O I
暂无
中图分类号
学科分类号
摘要
This paper describes how similar free-form textual documents can be matched using the Self-Organizing Maps (SOMs). The analysis chain is made of three parts: first, similar words are located using an alphabet occurrence coding and SOM; second, three-word contexts are clustered using codes obtained from the word SOM to build a context map; and third, whole documents are clustered using codes from the context SOM. Although this work is inspired by the WEBSOM method, it is quite different since our goal was to build a fast system, which is tolerant to the special features of different languages.
引用
收藏
页码:174 / 177
相关论文
共 50 条
  • [21] Visualizing Syscalls using Self-organizing Maps for System Intrusion Detection
    Landauer, Max
    Skopik, Florian
    Wurzenberger, Markus
    Hotwagner, Wolfgang
    Rauber, Andreas
    ICISSP: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY, 2020, : 349 - 360
  • [22] Detection of Fake Followers using Feature Ratio in Self-Organizing Maps
    Simon, Nitin T.
    Elias, Susan
    2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
  • [23] Using Self-Organizing Maps with Learning Classifier System for Intrusion Detection
    Tamee, Kreangsak
    Rojanavasu, Pornthep
    Udomthanapong, Sonchai
    Pinngern, Ouen
    PRICAI 2008: TRENDS IN ARTIFICIAL INTELLIGENCE, 2008, 5351 : 1071 - +
  • [24] Host-based intrusion detection using self-organizing maps
    Lichodzijewski, P
    Zincir-Heywood, AN
    Heywood, MI
    PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 1714 - 1719
  • [25] Attack characterization and intrusion detection using an ensemble of self-organizing maps
    DeLooze, Lori L.
    2006 IEEE Information Assurance Workshop, 2006, : 108 - 115
  • [26] Video segmentation and shot boundary detection using self-organizing maps
    Muurinen, Hannes
    Laaksonen, Jorma
    IMAGE ANALYSIS, PROCEEDINGS, 2007, 4522 : 770 - +
  • [27] Defining similar regions of snow in the Colorado River Basin using self-organizing maps
    Fassnacht, S. R.
    Derry, J. E.
    WATER RESOURCES RESEARCH, 2010, 46
  • [28] Exploring Diseases based Biomedical Document Clustering and Visualization using Self-Organizing Maps
    Shah, Setu
    Luo, Xiao
    2017 IEEE 19TH INTERNATIONAL CONFERENCE ON E-HEALTH NETWORKING, APPLICATIONS AND SERVICES (HEALTHCOM), 2017,
  • [29] Regional analysis using self-organizing maps
    Chudy, L
    Farkas, I
    POLITICKA EKONOMIE, 2000, 48 (05) : 685 - 697
  • [30] Project Management Using Self-Organizing Maps
    Parvizian, Jamshid
    Tarkesh, Named
    Atighehchian, Arezoo
    Farid, Sara
    INDUSTRIAL ENGINEERING AND MANAGEMENT SYSTEMS, 2005, 5 (01): : 23 - 31