ON EXPONENTIALLY CONSISTENCY OF LINKAGE-BASED HIERARCHICAL CLUSTERING ALGORITHM USING KOLMOGROV-SMIRNOV DISTANCE

被引:0
|
作者
Wang, Tiexing [1 ]
Liu, Yang [1 ]
Chen, Biao [1 ]
机构
[1] Syracuse Univ, Dept EECS, Syracuse, NY 13244 USA
基金
美国国家科学基金会;
关键词
Kolmogorov-Smirnov distance; clustering; exponential consistency; probability of error; hierarchical clustering algorithm; EFFICIENT;
D O I
10.1109/icassp40776.2020.9053708
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper focuses on performance analysis of linkage-based hierarchical agglomerative clustering algorithms for sequence clustering using the Kolmogrov-Smirnov distance. Data sequences are assumed to be generated from unknown continuous distributions. The goal is to group the data sequences whose underlying generative distributions belong to one cluster without a priori knowledge of both the underlying distributions as well as the number of clusters. Upper bounds on the clustering error probability are derived. The upper bounds help establish the fact that the error probability decays exponentially fast as the sequence length goes to infinity and the obtained error exponent bound has a simple form. Tighter upper bounds on the error probability of single-linkage and complete-linkage algorithms are derived by taking advantage of the simplified metric updating for these two special cases. Simulation results are provided to validate the analysis.
引用
收藏
页码:3997 / 4001
页数:5
相关论文
共 34 条
  • [21] LANDSLIDE IDENTIFICATION BASED ON HIERARCHICAL FUZZY CONTOUR MODEL CLUSTERING ALGORITHM USING POLSAR IMAGES
    Wang, Cong
    Chen, Yan
    Du, Min
    Wu, Lei
    Chen, Yunping
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 9322 - 9325
  • [22] New clustering algorithm-based fault diagnosis using compensation distance evaluation technique
    Lei, Yaguo
    He, Zhengjia
    Zi, Yanyang
    Chen, Xuefeng
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2008, 22 (02) : 419 - 435
  • [23] The Effect of Different Similarity Distance Measures in Detecting Outliers Using Single-Linkage Clustering Algorithm for Univariate Circular Biological Data
    Zulkipli, Nur Syahirah
    Satari, Siti Zanariah
    Yusoff, Wan Nur Syahidah Wan
    PAKISTAN JOURNAL OF STATISTICS AND OPERATION RESEARCH, 2022, 18 (03) : 561 - 573
  • [24] Profiler for Smartphone Users Interests Using Modified Hierarchical Agglomerative Clustering Algorithm Based on Browsing History
    Khusumanegara, Priagung
    Mafrur, Rischan
    Choi, Deokjai
    INFORMATION AND COMMUNICATION TECHNOLOGY, 2015, 9357 : 89 - 96
  • [25] Performance Evaluation of Distance based Angular Clustering Algorithm (DACA) using Data Aggregation for Heterogeneous WSN
    Kumar, Navjot
    Kaur, Surinder
    2016 INTERNATIONAL CONFERENCE ON COMPUTATION OF POWER, ENERGY INFORMATION AND COMMUNICATION (ICCPEIC), 2016, : 97 - 101
  • [26] Web Page Access Prediction Using Hierarchical Clustering Based on Modified Levenshtein Distance and Higher Order Markov Model
    Kumar, Harish B. T.
    Vibha, L.
    Venugopal, K. R.
    2016 IEEE REGION 10 SYMPOSIUM (TENSYMP), 2016, : 1 - 6
  • [27] Genomic Prediction Accuracy Using Haplotypes Defined by Size and Hierarchical Clustering Based on Linkage Disequilibrium (vol 11, 134, 2020)
    Won, Sohyoung
    Park, Jong-Eun
    Son, Ju-Hwan
    Lee, Seung-Hwan
    Park, Byeong Ho
    Park, Mina
    Park, Won-Chul
    Chai, Han-Ha
    Kim, Heebal
    Lee, Jungjae
    Lim, Dajeong
    FRONTIERS IN GENETICS, 2021, 12
  • [28] Improving search speed on pointer-based large data structures using a hierarchical clustering copying algorithm
    Yasugi, Masahiro
    Yuasa, Taiichi
    INNOVATIVE ARCHITECTURE FOR FUTURE GENERATION HIGH-PERFORMANCE PROCESSORS AND SYSTEMS, 2007, : 43 - 52
  • [29] The Effect of Different Distance Measures in Detecting Outliers using Clustering-based Algorithm for Circular Regression Model
    Di, Nur Faraidah Muhammad
    Satari, Siti Zanariah
    3RD ISM INTERNATIONAL STATISTICAL CONFERENCE 2016 (ISM III): BRINGING PROFESSIONALISM AND PRESTIGE IN STATISTICS, 2017, 1842
  • [30] Statistical versus Distance-Based Meta-Features for Clustering Algorithm recommendation Using Meta-Learning
    Pimentel, Bruno Almeida
    de Carvalho, Andre C. P. L. E.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 845 - 852