A parameter-less algorithm for tensor co-clustering

被引:0
|
作者
Elena Battaglia
Ruggero G. Pensa
机构
[1] University of Turin,Department of Computer Science
来源
Machine Learning | 2023年 / 112卷
关键词
Clustering; Higher-order data; Unsupervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
The majority of the data produced by human activities and modern cyber-physical systems involve complex relations among their features. Such relations can be often represented by means of tensors, which can be viewed as generalization of matrices and, as such, can be analyzed by using higher-order extensions of existing machine learning methods, such as clustering and co-clustering. Tensor co-clustering, in particular, has been proven useful in many applications, due to its ability of coping with n-modal data and sparsity. However, setting up a co-clustering algorithm properly requires the specification of the desired number of clusters for each mode as input parameters. This choice is already difficult in relatively easy settings, like flat clustering on data matrices, but on tensors it could be even more frustrating. To face this issue, we propose a new tensor co-clustering algorithm that does not require the number of desired co-clusters as input, as it optimizes an objective function based on a measure of association across discrete random variables (called Goodman and Kruskal’s τ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau$$\end{document}) that is not affected by their cardinality. We introduce different optimization schemes and show their theoretical and empirical convergence properties. Additionally, we show the effectiveness of our algorithm on both synthetic and real-world datasets, also in comparison with state-of-the-art co-clustering methods based on tensor factorization and latent block models.
引用
收藏
页码:385 / 427
页数:42
相关论文
共 50 条
  • [1] A parameter-less algorithm for tensor co-clustering
    Battaglia, Elena
    Pensa, Ruggero G.
    MACHINE LEARNING, 2023, 112 (02) : 385 - 427
  • [2] Parameter-Less Tensor Co-clustering
    Battaglia, Elena
    Pensa, Ruggero G.
    DISCOVERY SCIENCE (DS 2019), 2019, 11828 : 205 - 219
  • [3] Parameter-less co-clustering for star-structured heterogeneous data
    Dino Ienco
    Céline Robardet
    Ruggero G. Pensa
    Rosa Meo
    Data Mining and Knowledge Discovery, 2013, 26 : 217 - 254
  • [4] Parameter-less co-clustering for star-structured heterogeneous data
    Ienco, Dino
    Robardet, Celine
    Pensa, Ruggero G.
    Meo, Rosa
    DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 26 (02) : 217 - 254
  • [5] A parameter-less genetic algorithm
    Harik, GR
    Lobo, FG
    GECCO-99: PROCEEDINGS OF THE GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 1999, : 258 - 265
  • [6] An overview of the parameter-less genetic algorithm
    Lobo, FG
    Goldberg, DE
    PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2003, : 20 - 23
  • [7] The parameter-less genetic algorithm in practice
    Lobo, FG
    Goldberg, DE
    INFORMATION SCIENCES, 2004, 167 (1-4) : 217 - 232
  • [8] The Parameter-less Randomized Gravitational Clustering algorithm with online clusters' structure characterization
    Gomez, Jonatan
    Leon, Elizabeth
    Nasraoui, Olfa
    Giraldo, Fabian
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2014, 2 (04) : 217 - 236
  • [9] Parameter-less hierarchical Bayesian optimization algorithm
    Pelikan, Martin
    Hartmann, Alexander
    Lin, Tz-Kai
    PARAMETER SETTING IN EVOLUTIONARY ALGORITHMS, 2007, 54 : 225 - +
  • [10] COMBINATORIAL IMPLEMENTATION OF A PARAMETER-LESS EVOLUTIONARY ALGORITHM
    Papa, Gregor
    ECTA 2011/FCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION THEORY AND APPLICATIONS AND INTERNATIONAL CONFERENCE ON FUZZY COMPUTATION THEORY AND APPLICATIONS, 2011, : 307 - 310