Multi-assignment clustering for Boolean data

被引:0
|
作者
UC Berkeley, Computer Science Division, 721 Soda Hall, Berkeley, CA 94720, United States [1 ]
不详 [2 ]
不详 [3 ]
机构
来源
J. Mach. Learn. Res. | / 459-489期
关键词
D O I
暂无
中图分类号
学科分类号
摘要
We propose a probabilistic model for clustering Boolean data where an object can be simultaneously assigned to multiple clusters. By explicitly modeling the underlying generative process that combines the individual source emissions, highly structured data are expressed with substantially fewer clusters compared to single-assignment clustering. As a consequence, such a model provides robust parameter estimators even when the number of samples is low. We extend the model with different noise processes and demonstrate that maximum-likelihood estimation with multiple assignments consistently infers source parameters more accurately than single-assignment clustering. Our model is primarily motivated by the task of role mining for role-based access control, where users of a system are assigned one or more roles. In experiments with real-world access-control data, our model exhibits better generalization performance than state-of-the-art approaches. © 2012 Mario Frank, Andreas P. Streich, David Basin and Joachim M. Buhmann.
引用
收藏
相关论文
共 50 条
  • [1] Multi-Assignment Clustering for Boolean Data
    Frank, Mario
    Streich, Andreas P.
    Basin, David
    Buhmann, Joachim M.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 459 - 489
  • [2] Nonparametric multi-assignment clustering
    Liu, Chien-Liang
    Hsaio, Wen-Hoar
    Chang, Tao-Hsing
    Jou, Tzai-Min
    INTELLIGENT DATA ANALYSIS, 2017, 21 (04) : 893 - 911
  • [3] Multi-assignment clustering: Machine learning from a biological perspective
    Ulfenborg, Benjamin
    Karlsson, Alexander
    Riveiro, Maria
    Andersson, Christian X.
    Sartipy, Peter
    Synnergren, Jane
    JOURNAL OF BIOTECHNOLOGY, 2021, 326 : 1 - 10
  • [5] A semi-supervised learning algorithm for multi-label classification and multi-assignment clustering problems based on a Multivariate Data Analysis
    Gull, Carlos Quintero
    Aguilar, Jose
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
  • [6] Multi-assignment interacting multiple model for tracking microbubbles
    Li, Bing
    Tay, Peter
    Acton, Scott T.
    2005 39th Asilomar Conference on Signals, Systems and Computers, Vols 1 and 2, 2005, : 281 - 284
  • [7] A heuristic algorithm based on multi-assignment procedures for nurse scheduling
    Ademir Aparecido Constantino
    Dario Landa-Silva
    Everton Luiz de Melo
    Candido Ferreira Xavier de Mendonça
    Douglas Baroni Rizzato
    Wesley Romão
    Annals of Operations Research, 2014, 218 : 165 - 183
  • [8] A heuristic algorithm based on multi-assignment procedures for nurse scheduling
    Constantino, Ademir Aparecido
    Landa-Silva, Dario
    de Melo, Everton Luiz
    Xavier de Mendonca, Candido Ferreira
    Rizzato, Douglas Baroni
    Romao, Wesley
    ANNALS OF OPERATIONS RESEARCH, 2014, 218 (01) : 165 - 183
  • [9] Clustering algorithm for Boolean and categorical data
    Liu, H.
    Deng, H.
    Lu, S.
    Huazhong Ligong Daxue Xuebao/Journal Huazhong (Central China) University of Science and Technology, 2001, 29 (03): : 30 - 32
  • [10] Multi-Assignment Single Joins for Parallel Cross-Match of Astronomic Catalogs on Heterogeneous Clusters
    Jia, Xiaoying
    Luo, Qiong
    28TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM) 2016), 2016,