Large datasets visualization with neural network using clustered training data

被引:0
|
作者
Ivanikovas, Sergejus [1 ]
Dzemyda, Gintautas [1 ]
Medvedev, Viktor [1 ]
机构
[1] Inst Math & Informat, LT-08663 Vilnius, Lithuania
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the visualization of large datasets with SAMANN algorithm using clustering methods for initial dataset reduction for the network training. The visualization of multidimensional data is highly important in data mining because recent applications produce large amount of data that need specific means for the knowledge discovery. One of the ways to visualize multidimensional dataset is to project it onto a plane. This paper analyzes the visualization of multidimensional data using feed-forward neural network. We investigate an unsupervised backpropagation algorithm to train a multilayer feed-forward neural network (SAMANN) to perform the Sammon's nonlinear projection. The SAMANN network offers the generalization ability of projecting new data. Previous investigations showed that it is possible to train SAMANN using only a part of analyzed dataset without the loss of accuracy. It is very important to select proper vector subset for the neural network training. One of the ways to construct relevant training subset is to use clustering. This allows to speed up the visualization of large datasets.
引用
收藏
页码:143 / 152
页数:10
相关论文
共 50 条
  • [21] Using data tools and data visualization to interpret multifactorial flavour datasets
    Taylor, Andrew
    Mottram, Donald
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
  • [22] Dynamic PET Image Denoising Using Deep Convolutional Neural Network Without Training Datasets
    Hashimoto, Fumio
    Ote, Kibo
    Tsukada, Hideo
    JOURNAL OF NUCLEAR MEDICINE, 2019, 60
  • [23] Data Visualization Classification Using Simple Convolutional Neural Network Model Original
    Bajic, Filip
    Job, Josip
    Nenadic, Kresimir
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2020, 11 (01) : 43 - 51
  • [24] Methods for the visualization of clustered climate data
    Thomas Nocke
    Heidrun Schumann
    Uwe Böhm
    Computational Statistics, 2004, 19 : 75 - 94
  • [25] Learning Semantic Features for Classifying Very Large Image Datasets Using Convolution Neural Network
    Rao A.S.
    Mahantesh K.
    SN Computer Science, 2021, 2 (3)
  • [26] Visualization of large astrophysical simulations datasets
    Pomarède, Daniel
    Audit, Edouard
    Teyssier, Romain
    Thooris, Bruno
    COMPUTER PHYSICS COMMUNICATIONS, 2007, 177 (1-2) : 263 - 263
  • [27] The importance of locality in the visualization of large datasets
    Brooke, J. M.
    Marsh, J.
    Pettifer, S.
    Sastry, L. S.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2007, 19 (02): : 195 - 205
  • [28] Alternative visualization of large geospatial datasets
    Koua, EL
    Kraak, MJ
    CARTOGRAPHIC JOURNAL, 2004, 41 (03): : 217 - 228
  • [29] Methods for the visualization of clustered climate data
    Nocke, T
    Schumann, H
    Böhm, U
    COMPUTATIONAL STATISTICS, 2004, 19 (01) : 75 - 94
  • [30] Discovering trends in large datasets using neural networks
    Kaikhah, K
    Doddameti, S
    APPLIED INTELLIGENCE, 2006, 24 (01) : 51 - 60