Pattern set mining with schema-based constraint

被引:4
|
作者
Cagliero, Luca [1 ]
Chiusano, Silvia [1 ]
Garza, Paolo [1 ]
Bruno, Giulia [2 ]
机构
[1] Politecn Torino, Dipartimento Automat & Informat, I-10129 Turin, Italy
[2] Dipartimento Ingn Gest & Prod, I-10129 Turin, Italy
关键词
Pattern set mining; Itemset mining; Data mining; FREQUENT CONJUNCTIVE QUERIES;
D O I
10.1016/j.knosys.2015.04.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pattern set mining entails discovering groups of frequent itemsets that represent potentially relevant knowledge. Global constraints are commonly enforced to focus the analysis on most interesting pattern sets. However, these constraints evaluate and select each pattern set individually based on its itemset characteristics. This paper extends traditional global constraints by proposing a novel constraint, called schema-based constraint, tailored to relational data. When coping with relational data itemsets consist of sets of items belonging to distinct data attributes, which constitute the itemset schema. The schema-based constraint allows us to effectively combine all the itemsets that are semantically correlated with each other into a unique pattern set, while filtering out those pattern sets covering a mixture of different data facets or giving a partial view of a single facet. Specifically, it selects all the pattern sets that are (i) composed only of frequent itemsets with the same schema and (ii) characterized by maximal size among those corresponding to that schema. Since existing approaches are unable to select one representative pattern set per schema in a single extraction, we propose a new Apriori-based algorithm to efficiently mine pattern sets satisfying the schema-based constraint. The experimental results achieved on both real and synthetic datasets demonstrate the efficiency and effectiveness of our approach. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:224 / 238
页数:15
相关论文
共 50 条
  • [31] Schema-based natural language semantic mapping
    Stratica, N
    Desai, BC
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2004, 3136 : 103 - 113
  • [32] Schema-based processing in auditory scene analysis
    Bey, C
    McAdams, S
    PERCEPTION & PSYCHOPHYSICS, 2002, 64 (05): : 844 - 854
  • [33] Schema-based memory processes and eyewitness recollection
    Mallard, D
    Greig, J
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2004, 39 (5-6) : 279 - 279
  • [35] SCHEMA-BASED PLANNING OF EVENTS IN CONSUMER CONTEXTS
    BARSALOU, LW
    HUTCHINSON, JW
    ADVANCES IN CONSUMER RESEARCH, 1987, 14 : 114 - 118
  • [36] An XML schema-based semantic data integration
    Kim, Dongkwang
    Jeong, Karpjoo
    Shin, Hyoseop
    Hwang, Suntae
    GCC 2005: FIFTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2006, : 522 - +
  • [37] Schema-based transformations of logic programs in λProlog
    Olmer, P
    Stepánek, P
    LOGICS PROGRAMMING, PROCEEDINGS, 2002, 2401 : 472 - 472
  • [38] SCHEMA-BASED AUTHORING AND QUERYING OF LARGE HYPERTEXTS
    AMANN, B
    SCHOLL, M
    RIZK, A
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 1995, 43 (03) : 281 - 299
  • [39] Schema-Based Independence Analysis for XML Updates
    Benedikt, Michael
    Cheney, James
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (01): : 61 - 72
  • [40] Impact of Stress and Glucocorticoids on Schema-Based Learning
    Lisa Marieke Kluen
    Patricia Nixon
    Agorastos Agorastos
    Klaus Wiedemann
    Lars Schwabe
    Neuropsychopharmacology, 2017, 42 : 1254 - 1261