A parameter-less algorithm for tensor co-clustering

被引:0
|
作者
Elena Battaglia
Ruggero G. Pensa
机构
[1] University of Turin,Department of Computer Science
来源
Machine Learning | 2023年 / 112卷
关键词
Clustering; Higher-order data; Unsupervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
The majority of the data produced by human activities and modern cyber-physical systems involve complex relations among their features. Such relations can be often represented by means of tensors, which can be viewed as generalization of matrices and, as such, can be analyzed by using higher-order extensions of existing machine learning methods, such as clustering and co-clustering. Tensor co-clustering, in particular, has been proven useful in many applications, due to its ability of coping with n-modal data and sparsity. However, setting up a co-clustering algorithm properly requires the specification of the desired number of clusters for each mode as input parameters. This choice is already difficult in relatively easy settings, like flat clustering on data matrices, but on tensors it could be even more frustrating. To face this issue, we propose a new tensor co-clustering algorithm that does not require the number of desired co-clusters as input, as it optimizes an objective function based on a measure of association across discrete random variables (called Goodman and Kruskal’s τ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau$$\end{document}) that is not affected by their cardinality. We introduce different optimization schemes and show their theoretical and empirical convergence properties. Additionally, we show the effectiveness of our algorithm on both synthetic and real-world datasets, also in comparison with state-of-the-art co-clustering methods based on tensor factorization and latent block models.
引用
收藏
页码:385 / 427
页数:42
相关论文
共 50 条
  • [31] Towards parameter-less and similarity-based fuzzy clustering based on PCM method
    Tseng, Vincent S.
    Kao, Ching-Pin
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 4106 - +
  • [32] Joint co-clustering: Co-clustering of genomic and clinical bioimaging data
    Ficarra, Elisa
    De Micheli, Giovanni
    Yoon, Sungroh
    Benini, Luca
    Macii, Enrico
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2008, 55 (05) : 938 - 949
  • [34] A Fuzzy Co-Clustering Algorithm via Modularity Maximization
    Liu, Yongli
    Chen, Jingli
    Chao, Hao
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
  • [35] A co-clustering algorithm based on structured Web document
    Deng, Dong-Mei
    Long, Ji-Zhen
    Yin, Xiang-Zhou
    Zhongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Central South University (Science and Technology), 2010, 41 (05): : 1871 - 1876
  • [36] General Tensor Spectral Co-clustering for Higher-Order Data
    Wu, Tao
    Benson, Austin R.
    Gleich, David F.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [37] Parameter-less optimization with the extended compact genetic algorithm and iterated local search
    Lima, CF
    Lobo, FG
    GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2004, PT 1, PROCEEDINGS, 2004, 3102 : 1328 - 1339
  • [38] Towards Parameter-less Support Vector Machines
    Nalepa, Jakub
    Siminski, Krzysztof
    Kawulok, Michal
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 211 - 215
  • [39] Bayesian Co-clustering
    Shan, Hanhuai
    Banerjee, Arindam
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 530 - 539
  • [40] A Survey of Co-Clustering
    Wang, Hongjun
    Song, Yi
    Chen, Wei
    Luo, Zhipeng
    Li, Chongshou
    Li, Tianrui
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (09)