Measurement, selection, and visualization of association rules: A compositional data perspective A Compositional Data perspective on Association Rules

被引:2
|
作者
Vives-Mestres, Marina [1 ,2 ]
Kenett, Ron S. [3 ,4 ]
Thio-Henestrosa, Santiago [1 ]
Martin-Fernandez, Josep Antoni [1 ]
机构
[1] Univ Girona, Dept Comp Sci Appl Math & Stat, POLITECNICA 4,Campus Montilivi, Girona 17003, Spain
[2] Curelator Inc, Clin Stat, 210 Broadway 201, Cambridge, MA 02139 USA
[3] KPA Grp, Raanana, Israel
[4] Samuel Neaman Inst, Raanana, Israel
关键词
Aitchison geometry; association rule; independence test; measure of interestingness; odds ratio test; simplex representation;
D O I
10.1002/qre.2910
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Association rule mining is a powerful data analytic technique used for extracting information from transaction databases with a collection of itemsets. The aim is to indicate what item goes with what item (ie, an association rule) in a set of collected transactions. It is extensively used in text analytics of text records or social media. Here we use Compositional Data analysis (CoDa) techniques to generate new visualizations and insights from association rule mining. These CoDa methods show the relationship between itemsets, their strength, and direction of dependency. Moreover, after expressing each association rule as a contingency table, we discuss two statistical tests to guide identification of the relevant rules by analyzing the relative importance of the elements of the table. As an example, we use these visualizations and statistical tests for investigating the association of negative mood emotions to various types of headache/migraine events. Data for those analysis comes from N1-Headache(TM), a digital platform where individual users record attacks and symptoms as well as their daily exposure to a list of potential factors.
引用
收藏
页码:1327 / 1339
页数:13
相关论文
共 50 条
  • [31] Using association rules for completing missing data
    Wu, CH
    Wun, CH
    Chou, HJ
    HIS'04: FOURTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, : 236 - 241
  • [32] Finding association rules on heterogeneous genome data
    Satou, K
    Shibayama, G
    Ono, T
    Yamamura, Y
    Furuichi, E
    Kuhara, S
    Takagi, T
    PACIFIC SYMPOSIUM ON BIOCOMPUTING '97, 1996, : 397 - 408
  • [33] Role of sampling in data mining for association rules
    Jeragh, M
    Mehrotra, KG
    IC-AI'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS I-III, 2001, : 483 - 489
  • [34] Fuzzy Association Rules on Data with Undefined Values
    Murinova, Petra
    Pavliska, Viktor
    Burda, Michal
    INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: APPLICATIONS, IPMU 2018, PT III, 2018, 855 : 165 - 174
  • [35] Mining Multilevel Association Rules on RFID data
    Kim, Younghee
    Kim, Ungmo
    2009 FIRST ASIAN CONFERENCE ON INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2009, : 46 - 50
  • [36] Association rules for web data mining in WHOWEDA
    Madria, SK
    Raymond, C
    Bhowmick, S
    Mohania, M
    2000 KYOTO INTERNATIONAL CONFERENCE ON DIGITAL LIBRARIES: RESEARCH AND PRACTICE, PROCEEDINGS, 2000, : 227 - 233
  • [37] Mining association rules in big data with NGEP
    Yunliang Chen
    Fangyuan Li
    Junqing Fan
    Cluster Computing, 2015, 18 : 577 - 585
  • [38] Quantitative association rules over incomplete data
    Ng, V
    Lee, J
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 2821 - 2826
  • [39] Constraining and summarizing association rules in medical data
    Ordonez, C
    Ezquerra, N
    Santana, CA
    KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 9 (03) : 259 - 283
  • [40] Summarizing XML data by means of association rules
    Baralis, E
    Garza, P
    Quintarelli, E
    Tanca, L
    CURRENT TRENDS IN DATABASE TECHNOLOGY - EDBT 2004 WORKSHOPS, PROCEEDINGS, 2004, 3268 : 260 - 269