Visualization techniques for mining large databases: A comparison

被引:158
|
作者
Keim, DA
Kriegel, HP
机构
[1] Insitute for Computer Science, University of Munich, D-80538 München
关键词
data mining; explorative data analysis; visualizing large databases; visualizing multidimensional; multivariate data;
D O I
10.1109/69.553159
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual data mining techniques have proven to be of high value in exploratory data analysis, and they also have a high potential for mining large databases. In this article, we describe and evaluate a new visualization-based approach to mining large databases. The basic idea of our visual data mining techniques is to represent as many data items as possible on the screen at the same time by mapping each data value to a pixel of the screen and arranging the pixels adequately. The major goal of this article is to evaluate our visual data mining techniques and to compare them to other well-known visualization techniques for multidimensional data. the parallel coordinate and stick figure visualization techniques. For the evaluation of visual data mining techniques, in the first place the perception of properties of the data counts, and only in the second place the CPU time and the number of secondary storage accesses are important. In addition to testing the visualization techniques using real data, we developed a testing environment for database visualizations similar to the benchmark approach used for comparing the performance of database systems. The testing environment allows the generation of test data sets with predefined data characteristics which are important for comparing the perceptual abilities of visual data mining techniques.
引用
收藏
页码:923 / 938
页数:16
相关论文
共 50 条
  • [21] Mining and visualizing the chemical content of large databases
    Villar, Hugo O.
    Hansen, Mark R.
    CURRENT OPINION IN DRUG DISCOVERY & DEVELOPMENT, 2009, 12 (03) : 367 - 375
  • [22] Mining categorical concept hierarchies in large databases
    Chien, BC
    Liao, SY
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING, 2003, : 244 - 249
  • [23] Mining and visualizing the chemical content of large databases
    Villar, Hugo O.
    Hansen, Mark R.
    Hodges, Jason
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2007, 233
  • [24] Incremental mining of sequential patterns in large databases
    Masseglia, F
    Poncelet, P
    Teisseire, M
    DATA & KNOWLEDGE ENGINEERING, 2003, 46 (01) : 97 - 121
  • [25] Visual data mining modeling techniques for the visualization of mining outcomes
    Kopanakis, I
    Theodoulidis, B
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2003, 14 (06): : 543 - 589
  • [26] A new visualization tool for data mining techniques
    Martinez-Martinez, Jose M.
    Escandell-Montero, Pablo
    Soria-Olivas, Emilio
    Martin-Guerrero, Jose D.
    Serrano-Lopez, Antonio J.
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2016, 5 (02) : 137 - 154
  • [27] Mining Protein Databases using Machine Learning Techniques
    Camargo, Renata da Silva
    Niranjan, Mahesan
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2008, 5 (02):
  • [28] WHOLISTIC VISUALIZATION - COMPARISON OF TECHNIQUES
    BEECH, JR
    BULLETIN OF THE BRITISH PSYCHOLOGICAL SOCIETY, 1977, 30 (MAY): : 148 - 149
  • [29] Visualization Techniques for Schedule Comparison
    Huang, Dandan
    Tory, Melanie
    Staub-French, Sheryl
    Pottinger, Rachel
    COMPUTER GRAPHICS FORUM, 2009, 28 (03) : 951 - 958
  • [30] Visualization-Aware Sampling for Very Large Databases
    Park, Yongjoo
    Cafarella, Michael
    Mozafari, Barzan
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 755 - 766