Large datasets visualization with neural network using clustered training data

被引:0
|
作者
Ivanikovas, Sergejus [1 ]
Dzemyda, Gintautas [1 ]
Medvedev, Viktor [1 ]
机构
[1] Inst Math & Informat, LT-08663 Vilnius, Lithuania
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the visualization of large datasets with SAMANN algorithm using clustering methods for initial dataset reduction for the network training. The visualization of multidimensional data is highly important in data mining because recent applications produce large amount of data that need specific means for the knowledge discovery. One of the ways to visualize multidimensional dataset is to project it onto a plane. This paper analyzes the visualization of multidimensional data using feed-forward neural network. We investigate an unsupervised backpropagation algorithm to train a multilayer feed-forward neural network (SAMANN) to perform the Sammon's nonlinear projection. The SAMANN network offers the generalization ability of projecting new data. Previous investigations showed that it is possible to train SAMANN using only a part of analyzed dataset without the loss of accuracy. It is very important to select proper vector subset for the neural network training. One of the ways to construct relevant training subset is to use clustering. This allows to speed up the visualization of large datasets.
引用
收藏
页码:143 / 152
页数:10
相关论文
共 50 条
  • [31] Discovering Trends in Large Datasets Using Neural Networks
    Khosrow Kaikhah
    Sandesh Doddameti
    Applied Intelligence, 2006, 24 : 51 - 60
  • [32] Deep Neural Network Training and Testing Datasets for License Plate Recognition
    Khan, Ishtiaq Rasool
    Alshomrani, Saleh M.
    Khan, Muhammad Murtaza
    Rahardja, Susanto
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (12) : 371 - 379
  • [33] Deep Neural Network Image Fusion Without Using Training Data
    Zhu, L.
    Baturin, P.
    MEDICAL PHYSICS, 2019, 46 (06) : E382 - E382
  • [34] Increasing the accuracy of neural network classification using refined training data
    Kavzoglu, Taskin
    ENVIRONMENTAL MODELLING & SOFTWARE, 2009, 24 (07) : 850 - 858
  • [35] Visualization of Feature Evolution During Convolutional Neural Network Training
    Punjabi, Arjun
    Katsaggelos, Aggelos K.
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 311 - 315
  • [36] Visualization of training sample creation process for artificial neural network
    Mikhailov, A.S.
    Staroverov, B.A.
    Scientific Visualization, 2016, 8 (02): : 85 - 97
  • [38] A space efficient clustered visualization of large graphs
    Huang, Mao Lin
    Nguyen, Quang Vinh
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON IMAGE AND GRAPHICS, 2007, : 920 - +
  • [39] Visualization of Time Series Data Using Clustered Heatmaps and Line Graphs
    Endo, Reika
    Hosobe, Hiroshi
    17TH INTERNATIONAL SYMPOSIUM ON VISUAL INFORMATION COMMUNICATION AND INTERACTION, VINCI 2024, 2024,
  • [40] Using R-Trees for Interactive Visualization of Large Multidimensional Datasets
    Gimenez, Alfredo
    Rosenbaum, Rene
    Hlawitschka, Mario
    Hamann, Bernd
    ADVANCES IN VISUAL COMPUTING, PT II, 2010, 6454 : 554 - 563