Large datasets visualization with neural network using clustered training data

被引:0
|
作者
Ivanikovas, Sergejus [1 ]
Dzemyda, Gintautas [1 ]
Medvedev, Viktor [1 ]
机构
[1] Inst Math & Informat, LT-08663 Vilnius, Lithuania
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the visualization of large datasets with SAMANN algorithm using clustering methods for initial dataset reduction for the network training. The visualization of multidimensional data is highly important in data mining because recent applications produce large amount of data that need specific means for the knowledge discovery. One of the ways to visualize multidimensional dataset is to project it onto a plane. This paper analyzes the visualization of multidimensional data using feed-forward neural network. We investigate an unsupervised backpropagation algorithm to train a multilayer feed-forward neural network (SAMANN) to perform the Sammon's nonlinear projection. The SAMANN network offers the generalization ability of projecting new data. Previous investigations showed that it is possible to train SAMANN using only a part of analyzed dataset without the loss of accuracy. It is very important to select proper vector subset for the neural network training. One of the ways to construct relevant training subset is to use clustering. This allows to speed up the visualization of large datasets.
引用
收藏
页码:143 / 152
页数:10
相关论文
共 50 条
  • [1] Neural network-based visualization using clustered data
    Ivanikovas, Sergejus
    Dzemyda, Gintautas
    Medvedev, Viktor
    20TH INTERNATIONAL CONFERENCE, EURO MINI CONFERENCE CONTINUOUS OPTIMIZATION AND KNOWLEDGE-BASED TECHNOLOGIES, EUROPT'2008, 2008, : 335 - 341
  • [2] INFERENCE WITH LARGE CLUSTERED DATASETS
    Mackinnon, James G.
    ACTUALITE ECONOMIQUE, 2016, 92 (04): : 649 - 665
  • [3] Online data visualization using the neural gas network
    Estevez, Pablo A.
    Figueroa, Cristian J.
    NEURAL NETWORKS, 2006, 19 (6-7) : 923 - 934
  • [4] Visualization in Deep Neural Network Training
    Kollias, Stefanos
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2022, 31 (03)
  • [5] Data mining for selective visualization of large spatial datasets
    Shekhar, S
    Lu, CT
    Zhang, PS
    Liu, RL
    14TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, : 41 - 48
  • [6] Synthesis and Visualization of Image Datasets of Parametric 3D Model for Neural Network Training and Testing in Data-Poor Conditions
    Kretinin O.V.
    Popov E.V.
    Tsapaev A.P.
    Fedosova L.O.
    Tyurikov M.I.
    Scientific Visualization, 2021, 13 (05): : 65 - 77
  • [7] Neural-Network-Optimized Vehicle Classification Using Clustered Image and Fiber-Sensor Datasets
    Kamencay, Patrik
    Markovic, Miroslav
    Dubovan, Jozef
    Dado, Milan
    Benedikovic, Daniel
    IEEE ACCESS, 2023, 11 : 41315 - 41324
  • [8] Neural network training with highly incomplete medical datasets
    Chang, Yu-Wei
    Natali, Laura
    Jamialahmadi, Oveis
    Romeo, Stefano
    Pereira, Joana B.
    Volpe, Giovanni
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (03):
  • [9] Content search within large environmental datasets using a convolution neural network
    Freeman, J.
    COMPUTERS & GEOSCIENCES, 2020, 139
  • [10] Retraining the neural network for data visualization
    Medvedev, Viktor
    Dzemyda, Gintautas
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2006, 204 : 27 - +