A Geometric View of Similarity Measures in Data Mining

被引:5
|
作者
Darvishi, A. [1 ]
Hassanpour, H. [1 ]
机构
[1] Univ Shahrood, Fac Comp Engn, Shahrood, Iran
来源
INTERNATIONAL JOURNAL OF ENGINEERING | 2015年 / 28卷 / 12期
关键词
Data Mining; Feature Extraction; Similarity Measures; Geometric View;
D O I
10.5829/idosi.ije.2015.28.12c.05
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The main objective of data mining is to acquire information from a set of data for prospect applications using a measure. The concerning issue is that one often has to deal with large scale data. Several dimensionality reduction techniques like various feature extraction methods have been developed to resolve the issue. However, the geometric view of the applied measure, as an additional consideration, is generally neglected. Since each measure has its own perspective to the data, different interpretations may achieved on data depending on the used measure. While efforts are often focused on adjusting the feature extraction techniques for mining the data, choosing a suitable measure regarding to the nature or general characteristics of the data or application is more appropriate. Given a couple of sequences, a specific measure may consider them as similar while another one may quantify them as dissimilar. The goal of this research is twofold: evincing the role of feature extraction in data mining and revealing the significance of similarity measures geometric attributes in detecting the relationships between data. Differrent similarity measures are also applied to three synthetic datasets and a real set of ECG time series to examine their performance.
引用
收藏
页码:1728 / 1737
页数:10
相关论文
共 50 条
  • [41] Analysis of spontaneous magnetoencephalography data by similarity measures
    Tretyakov, Alex
    Chen, Zhihua
    Takayasu, Hideki
    Nakasato, Nobukazu
    Physica A: Statistical Mechanics and its Applications, 1999, 270 (03): : 543 - 551
  • [42] Data clustering using efficient similarity measures
    Bisandu, Desmond Bala
    Prasad, Rajesh
    Liman, Musa Muhammad
    JOURNAL OF STATISTICS AND MANAGEMENT SYSTEMS, 2019, 22 (05) : 901 - 922
  • [43] Similarity measures between SAR and optic data
    Shabou, Aymen
    Tupin, Florence
    Chaabane, Ferdaous
    IGARSS: 2007 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-12: SENSING AND UNDERSTANDING OUR PLANET, 2007, : 4858 - 4861
  • [44] Analysis of spontaneous magnetoencephalography data by similarity measures
    Tretyakov, A
    Chen, ZH
    Takayasu, H
    Nakasato, N
    PHYSICA A, 1999, 270 (3-4): : 543 - 551
  • [45] A Survey of Distance / Similarity Measures for Categorical Data
    Alamuri, Madhavi
    Surampudi, Bapi Raju
    Negi, Atul
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 1907 - 1914
  • [46] The geometric framework for exact and similarity querying XML data
    Krátky, M
    Pokorny, J
    Skopal, T
    Snásel, V
    EURASIA-ICT 2002: INFORMATION AND COMMUNICATION TECHNOLOGY, PROCEEDINGS, 2002, 2510 : 35 - 46
  • [47] Geometric data perturbation for privacy preserving outsourced data mining
    Chen, Keke
    Liu, Ling
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 29 (03) : 657 - 695
  • [48] Geometric data perturbation for privacy preserving outsourced data mining
    Keke Chen
    Ling Liu
    Knowledge and Information Systems, 2011, 29 : 657 - 695
  • [49] Evaluation of Similarity Measures for Gene Expression Data and Their Correspondent Combined Measures
    Li, Gang-Guo
    Wang, Zheng-Zhi
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2009, 1 (01) : 72 - 80
  • [50] Evaluation of similarity measures for gene expression data and their correspondent combined measures
    Gang-Guo Li
    Zheng-Zhi Wang
    Interdisciplinary Sciences: Computational Life Sciences, 2009, 1 : 72 - 80