Hierarchical conceptual clustering based on quantile method for identifying microscopic details in distributional data

被引:4
|
作者
Umbleja, Kadri [1 ]
Ichino, Manabu [1 ]
Yaguchi, Hiroyuki [1 ]
机构
[1] Tokyo Denki Univ, Saitama, Japan
基金
日本学术振兴会;
关键词
Conceptual clustering; Quantile method; Symbolic data;
D O I
10.1007/s11634-020-00411-w
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Symbolic data is aggregated from bigger traditional datasets in order to hide entry specific details and to enable analysing large amounts of data, like big data, which would otherwise not be possible. Symbolic data may appear in many different but complex forms like intervals and histograms. Identifying patterns and finding similarities between objects is one of the most fundamental tasks of data mining. In order to accurately cluster these sophisticated data types, usual methods are not enough. Throughout the years different approaches have been proposed but they mainly concentrate on the "macroscopic" similarities between objects. Distributional data, for example symbolic data, has been aggregated from sets of large data and thus even the smallest microscopic differences and similarities become extremely important. In this paper a method is proposed for clustering distributional data based on these microscopic similarities by using quantile values. Having multiple points for comparison enables to identify similarities in small sections of distribution while producing more adequate hierarchical concepts. Proposed algorithm, called microscopic hierarchical conceptual clustering, has a monotone property and has been found to produce more adequate conceptual clusters during experimentation. Furthermore, thanks to the usage of quantiles, this algorithm allows us to compare different types of symbolic data easily without any additional complexity.
引用
收藏
页码:407 / 436
页数:30
相关论文
共 50 条
  • [1] Hierarchical conceptual clustering based on quantile method for identifying microscopic details in distributional data
    Kadri Umbleja
    Manabu Ichino
    Hiroyuki Yaguchi
    Advances in Data Analysis and Classification, 2021, 15 : 407 - 436
  • [2] Hierarchical clustering method based on data fields
    Gan, Wen-Yan
    Li, De-Yi
    Wang, Jian-Min
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2006, 34 (02): : 258 - 262
  • [3] Exploratory Analysis of Distributional Data Using the Quantile Method
    Ichino, Manabu
    APPLIEDMATH, 2024, 4 (01): : 261 - 288
  • [4] Graph-based hierarchical conceptual clustering
    Jonyer, I
    Cook, DJ
    Holder, LB
    JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (01) : 19 - 43
  • [5] Hierarchical Distance-Based Conceptual Clustering
    Funes, A.
    Ferri, C.
    Hernandez-Orallo, J.
    Ramirez-Quintana, M. J.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART I, PROCEEDINGS, 2008, 5211 : 349 - +
  • [6] Hierarchical Clustering for Smart Meter Electricity Loads Based on Quantile Autocovariances
    Alonso, Andres M.
    Nogales, Francisco J.
    Ruiz, Carlos
    IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (05) : 4522 - 4530
  • [7] A hierarchical clustering method based on the threshold of semantic feature in big data
    School of Information Science and Engineering, Central South University, Changsha
    410083, China
    不详
    425006, China
    Dianzi Yu Xinxi Xuebao, 12 (2795-2801):
  • [8] Data summarization based fast hierarchical clustering method for large datasets
    Patra, Bidyut Kr.
    Nandi, Sukumar
    Viswanath, P.
    2009 INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT AND ENGINEERING, PROCEEDINGS, 2009, : 278 - +
  • [9] Quantile-regression-based clustering for panel data
    Zhang, Yingying
    Wang, Huixia Judy
    Zhu, Zhongyi
    JOURNAL OF ECONOMETRICS, 2019, 213 (01) : 54 - 67
  • [10] Data clustering and analyzing techniques using hierarchical clustering method
    Hu, Wen
    Pan, Qing He
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (19) : 8495 - 8504