Pattern set mining with schema-based constraint

被引:4
|
作者
Cagliero, Luca [1 ]
Chiusano, Silvia [1 ]
Garza, Paolo [1 ]
Bruno, Giulia [2 ]
机构
[1] Politecn Torino, Dipartimento Automat & Informat, I-10129 Turin, Italy
[2] Dipartimento Ingn Gest & Prod, I-10129 Turin, Italy
关键词
Pattern set mining; Itemset mining; Data mining; FREQUENT CONJUNCTIVE QUERIES;
D O I
10.1016/j.knosys.2015.04.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pattern set mining entails discovering groups of frequent itemsets that represent potentially relevant knowledge. Global constraints are commonly enforced to focus the analysis on most interesting pattern sets. However, these constraints evaluate and select each pattern set individually based on its itemset characteristics. This paper extends traditional global constraints by proposing a novel constraint, called schema-based constraint, tailored to relational data. When coping with relational data itemsets consist of sets of items belonging to distinct data attributes, which constitute the itemset schema. The schema-based constraint allows us to effectively combine all the itemsets that are semantically correlated with each other into a unique pattern set, while filtering out those pattern sets covering a mixture of different data facets or giving a partial view of a single facet. Specifically, it selects all the pattern sets that are (i) composed only of frequent itemsets with the same schema and (ii) characterized by maximal size among those corresponding to that schema. Since existing approaches are unable to select one representative pattern set per schema in a single extraction, we propose a new Apriori-based algorithm to efficiently mine pattern sets satisfying the schema-based constraint. The experimental results achieved on both real and synthetic datasets demonstrate the efficiency and effectiveness of our approach. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:224 / 238
页数:15
相关论文
共 50 条
  • [1] Constraint-Based Pattern Set Mining
    De Raedt, Luc
    Zimmermann, Albrecht
    PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 237 - 248
  • [2] Schema-based Web wrapping
    Bettina Fazzinga
    Sergio Flesca
    Andrea Tagarelli
    Knowledge and Information Systems, 2011, 26 : 127 - 173
  • [3] Schema-based Web wrapping
    Fazzinga, Bettina
    Flesca, Sergio
    Tagarelli, Andrea
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 26 (01) : 127 - 173
  • [4] Schema-Based Automata Determinization
    Niehren, Joachim
    Sakho, Momar
    Al Serhali, Antonio
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2022, (370): : 49 - 65
  • [5] Commentary - Schema-based learning
    Corbacho, FJ
    ARTIFICIAL INTELLIGENCE, 1998, 101 (1-2) : 337 - 339
  • [6] SCHEMA-BASED MEMORY PROCESSES
    BREWER, WF
    TENPENNY, P
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1984, 22 (04) : 283 - 283
  • [7] SCHEMA-BASED PLANNING OF EVENTS
    BARSALOU, LW
    USHER, JA
    SEWELL, DR
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1985, 23 (04) : 284 - 284
  • [8] Stress and schema-based learning
    不详
    NEUROSCIENTIST, 2017, 23 (03): : 219 - 220
  • [9] Schema-based Web wrapping
    Flesca, S
    Tagarelli, A
    CONCEPTUAL MODELING - ER 2004, PROCEEDINGS, 2004, 3288 : 286 - 299
  • [10] Schema-based design and the AKIRA schema language: An overview
    Pezzulo, Giovanni
    Calvi, Gianguglielmo
    ANTICIPATORY BEHAVIOR IN ADAPTIVE LEARNING SYSTEMS: FROM BRAINS TO INDIVIDUAL AND SOCIAL BEHAVIOR, 2007, 4520 : 128 - +