Frequent itemset discovery with SQL using universal quantification

被引：0

作者：

Rantzau, R ^{[1
]}

机构：

[1] Univ Stuttgart, Dept Comp Sci Elect Engn & Informat Technol, D-70569 Stuttgart, Germany

来源：

DATABASE SUPPORT FOR DATA MINING APPLICATIONS: DISCOVERING KNOWLEDGE WITH INDUCTIVE QUERIES | 2004年 / 2682卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Algorithms for finding frequent itemsets fall into two broad categories: algorithms that are based on non-trivial SQL statements to query and update a database, and algorithms that employ sophisticated in-memory data structures, where the data is stored in flat files. Most performance experiments have shown that SQL-based approaches are inferior to main-memory algorithms. However, the current trend of database vendors to integrate analysis functionalities into their query execution and optimization components, i.e., "closer to the data," suggests to revisit these results and to search for new, potentially better solutions. We investigate approaches based on SQL-92 and present a new approach called Quiver that employs universal and existential quantifications. In the table schema for itemsets of our approach, a group of tuples represents a single itemset. Such a "vertical" layout is similar to the popular layout used for the transaction table, which is the input of frequent itemset discovery. We show that current DBMS do not provide efficient query processing strategies for dealing with quantified queries, mostly due to the lack of an adequate SQL syntax for set containment tests. Performance tests using a query processor prototype and a novel query operator, called set containment division, promise an improved performance for quantified queries like those used for Quiver.

引用

页码：194 / 213

页数：20

共 50 条

[31] Mining Frequent Itemset Using Quine-McCluskey Algorithm
Bajpayee, Kanishka
Kant, Surya
Pant, Bhaskar
Chaudhary, Ankur
Sharma, Shashi Kumar
PROCEEDINGS OF FIFTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2015), VOL 2, 2016, 437 : 763 - 769
[32] Frequent Itemset Mining using Improved Apriori Algorithm with MapReduce
Tribhuvan, Seema A.
Gavai, Nitin R.
Vasgi, Bharti P.
2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,
[33] Grafting for combinatorial binary model using frequent itemset mining
Taito Lee
Shin Matsushima
Kenji Yamanishi
Data Mining and Knowledge Discovery, 2020, 34 : 101 - 123
[34] Acceleration of Frequent Itemset Mining on FPGA Using SDAccel and Vivado HLS
Vinh Dang
Skadron, Kevin
2017 IEEE 28TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2017, : 195 - 200
[35] A Survey Paper on Frequent Itemset Mining
Sastry, J. S. V. R. S.
Suresh, V
INTERNATIONAL CONFERENCE ON COMPUTER VISION AND MACHINE LEARNING, 2019, 1228
[36] Frequent Itemset Mining in Multirelational Databases
Jimenez, Aida
Berzal, Fernando
Cubero, Juan-Carlos
FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2009, 5722 : 15 - 24
[37] Verified Programs for Frequent Itemset Mining
Loulergue, Frederic
Whitney, Christopher D.
2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 1516 - 1523
[38] MAFIA: A maximal frequent itemset algorithm
Burdick, D
Calimlim, M
Flannick, J
Gehrke, J
Yiu, TM
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (11) : 1490 - 1504
[39] Oracle and Vertica for Frequent Itemset Mining
Kyurkchiev, Hristo
Kaloyanova, Kalinka
DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 77 - 85
[40] A hybrid approach to frequent itemset hiding
Gkoulalas-Divanis, Aris
Verykios, Vassilios S.
19TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL I, PROCEEDINGS, 2007, : 297 - 304

← 1 2 3 4 5 →