A PERMUTATION-BASED ALGORITHM FOR BLOCK CLUSTERING

被引:33
|
作者
DUFFY, DE [1 ]
QUIROZ, AJ [1 ]
机构
[1] UNIV SIMON BOLIVAR, CARACAS, VENEZUELA
关键词
BINARY SPLITTING; BLOCK CLUSTERING; MARKOV CHAIN SIMULATION METHOD; PERMUTATION DISTRIBUTION;
D O I
10.1007/BF02616248
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Hartigan (1972) discusses the direct clustering of a matrix of data into homogeneous blocks. He introduces a stepwise divisive method for block clustering within a certain class of block structures which induce clustering trees for both row and column margins. While this class of structures is appealing, the stopping criterion for his method, which is based on asymptotic theory and the assumption that the individual elements of the data matrix are normally distributed, is quite restrictive. In this paper we propose a permutation-based algorithm for block clustering within the same class of block structures. By using permutation arguments to decide where to split and when to stop, our algorithm becomes applicable in a wide variety of cases, including matrices of categorical data and matrices of small-to-moderate size. In addition, our algorithm offers considerable flexibility in how block homogeneity is defined. The algorithm is studied in a series of simulation experiments on matrices of known structure, and illustrated in examples drawn from the fields of taxonomy, political science, and data architecture.
引用
收藏
页码:65 / 91
页数:27
相关论文
共 50 条
  • [31] The Problem Aware Local Search algorithm: an efficient technique for permutation-based problems
    Minetti, Gabriela F.
    Luque, Gabriel
    Alba, Enrique
    SOFT COMPUTING, 2017, 21 (18) : 5193 - 5206
  • [32] The query complexity of a permutation-based variant of Mastermind
    Afshani, Peyman
    Agrawal, Manindra
    Doerr, Benjamin
    Doerr, Carola
    Larsen, Kasper Green
    Mehlhorn, Kurt
    DISCRETE APPLIED MATHEMATICS, 2019, 260 : 28 - 50
  • [33] Farasha: A Provable Permutation-Based Parallelizable PRF
    Aaraj, Najwa
    Bellini, Emanuele
    Jejurikar, Ravindra
    Manzano, Marc
    Rohit, Raghvendra
    Salazar, Eugenio
    SELECTED AREAS IN CRYPTOGRAPHY, SAC 2022, 2024, 13742 : 437 - 458
  • [34] A permutation-based estimator for monotone index models
    Bhattacharya, Debopam
    ECONOMETRIC THEORY, 2008, 24 (03) : 795 - 807
  • [35] Permutation-Based Hashing Beyond the Birthday Bound
    Lefevre, Charlotte
    Mennink, Bart
    IACR TRANSACTIONS ON SYMMETRIC CRYPTOLOGY, 2024, 2024 (01) : 71 - 113
  • [36] Employing GPU architectures for permutation-based indexing
    Krulis, Martin
    Osipyan, Hasmik
    Marchand-Maillet, Stephane
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (09) : 11859 - 11887
  • [37] Runtime Analysis for Permutation-based Evolutionary Algorithms
    Benjamin Doerr
    Yassine Ghannane
    Marouane Ibn Brahim
    Algorithmica, 2024, 86 : 90 - 129
  • [38] A Permutation-Based Kernel Conditional Independence Test
    Doran, Gary
    Muandet, Krikamol
    Zhang, Kun
    Scholkoepf, Bernhard
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2014, : 132 - 141
  • [39] An efficient memetic, permutation-based evolutionary algorithm for real-world train timetabling
    Semet, Y
    Schoenauer, M
    2005 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-3, PROCEEDINGS, 2005, : 2752 - 2759
  • [40] Permutation-based Causal Inference Algorithms with Interventions
    Wang, Yuhao
    Solus, Liam
    Yang, Karren Dai
    Uhler, Caroline
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30