Frequent itemset discovery with SQL using universal quantification

被引:0
|
作者
Rantzau, R [1 ]
机构
[1] Univ Stuttgart, Dept Comp Sci Elect Engn & Informat Technol, D-70569 Stuttgart, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Algorithms for finding frequent itemsets fall into two broad categories: algorithms that are based on non-trivial SQL statements to query and update a database, and algorithms that employ sophisticated in-memory data structures, where the data is stored in flat files. Most performance experiments have shown that SQL-based approaches are inferior to main-memory algorithms. However, the current trend of database vendors to integrate analysis functionalities into their query execution and optimization components, i.e., "closer to the data," suggests to revisit these results and to search for new, potentially better solutions. We investigate approaches based on SQL-92 and present a new approach called Quiver that employs universal and existential quantifications. In the table schema for itemsets of our approach, a group of tuples represents a single itemset. Such a "vertical" layout is similar to the popular layout used for the transaction table, which is the input of frequent itemset discovery. We show that current DBMS do not provide efficient query processing strategies for dealing with quantified queries, mostly due to the lack of an adequate SQL syntax for set containment tests. Performance tests using a query processor prototype and a novel query operator, called set containment division, promise an improved performance for quantified queries like those used for Quiver.
引用
收藏
页码:194 / 213
页数:20
相关论文
共 50 条
  • [31] Mining Frequent Itemset Using Quine-McCluskey Algorithm
    Bajpayee, Kanishka
    Kant, Surya
    Pant, Bhaskar
    Chaudhary, Ankur
    Sharma, Shashi Kumar
    PROCEEDINGS OF FIFTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2015), VOL 2, 2016, 437 : 763 - 769
  • [32] Frequent Itemset Mining using Improved Apriori Algorithm with MapReduce
    Tribhuvan, Seema A.
    Gavai, Nitin R.
    Vasgi, Bharti P.
    2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,
  • [33] Grafting for combinatorial binary model using frequent itemset mining
    Taito Lee
    Shin Matsushima
    Kenji Yamanishi
    Data Mining and Knowledge Discovery, 2020, 34 : 101 - 123
  • [34] Acceleration of Frequent Itemset Mining on FPGA Using SDAccel and Vivado HLS
    Vinh Dang
    Skadron, Kevin
    2017 IEEE 28TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2017, : 195 - 200
  • [35] A Survey Paper on Frequent Itemset Mining
    Sastry, J. S. V. R. S.
    Suresh, V
    INTERNATIONAL CONFERENCE ON COMPUTER VISION AND MACHINE LEARNING, 2019, 1228
  • [36] Frequent Itemset Mining in Multirelational Databases
    Jimenez, Aida
    Berzal, Fernando
    Cubero, Juan-Carlos
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2009, 5722 : 15 - 24
  • [37] Verified Programs for Frequent Itemset Mining
    Loulergue, Frederic
    Whitney, Christopher D.
    2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 1516 - 1523
  • [38] MAFIA: A maximal frequent itemset algorithm
    Burdick, D
    Calimlim, M
    Flannick, J
    Gehrke, J
    Yiu, TM
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (11) : 1490 - 1504
  • [39] Oracle and Vertica for Frequent Itemset Mining
    Kyurkchiev, Hristo
    Kaloyanova, Kalinka
    DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 77 - 85
  • [40] A hybrid approach to frequent itemset hiding
    Gkoulalas-Divanis, Aris
    Verykios, Vassilios S.
    19TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL I, PROCEEDINGS, 2007, : 297 - 304