Synthesizing High-Utility Patterns from Different Data Sources

被引:0
|
作者
Muley, Abhinav [1 ]
Gudadhe, Manish [1 ]
机构
[1] St Vincent Pallotti Coll Engn & Technol, Dept Comp Engn, Nagpur 441108, Maharashtra, India
关键词
data integration; data mining; high-utility patterns; knowledge discovery; weighted model; multi-database mining; distributed data mining;
D O I
10.3390/data3030032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In large organizations, it is often required to collect data from the different geographic branches spread over different locations. Extensive amounts of data may be gathered at the centralized location in order to generate interesting patterns via mono-mining the amassed database. However, it is feasible to mine the useful patterns at the data source itself and forward only these patterns to the centralized company, rather than the entire original database. These patterns also exist in huge numbers, and different sources calculate different utility values for each pattern. This paper proposes a weighted model for aggregating the high-utility patterns from different data sources. The procedure of pattern selection was also proposed to efficiently extract high-utility patterns in our weighted model by discarding low-utility patterns. Meanwhile, the synthesizing model yielded high-utility patterns, unlike association rule mining, in which frequent itemsets are generated by considering each item with equal utility, which is not true in real life applications such as sales transactions. Extensive experiments performed on the datasets with varied characteristics show that the proposed algorithm will be effective for mining very sparse and sparse databases with a huge number of transactions. Our proposed model also outperforms various state-of-the-art distributed models of mining in terms of running time.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Mining of High-Utility Patterns in Big IoT-based Databases
    Jimmy Ming-Tai Wu
    Gautam Srivastava
    Jerry Chun-Wei Lin
    Youcef Djenouri
    Min Wei
    Reza M. Parizi
    Mohammad S. Khan
    Mobile Networks and Applications, 2021, 26 : 216 - 233
  • [22] Mining of high-utility itemsets with negative utility
    Singh, Kuldeep
    Shakya, Harish Kumar
    Singh, Abhimanyu
    Biswas, Bhaskar
    EXPERT SYSTEMS, 2018, 35 (06)
  • [23] Synthesizing heavy association rules from different real data sources
    Adhikari, Animesh
    Rao, P. R.
    PATTERN RECOGNITION LETTERS, 2008, 29 (01) : 59 - 71
  • [24] An Efficient Algorithm for Mining Stable Periodic High-Utility Sequential Patterns
    Xie, Shiyong
    Zhao, Long
    SYMMETRY-BASEL, 2022, 14 (10):
  • [25] Mining Recent High-Utility Patterns from Temporal Databases with Time-Sensitive Constraint
    Gan, Wensheng
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    Chao, Han-Chieh
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2016, 2016, 9829 : 3 - 18
  • [26] A Novel Approach for Mining High-Utility Sequential Patterns in Sequence Databases
    Ahmed, Chowdhury Farhan
    Tanbeer, Syed Khairuzzaman
    Jeong, Byeong-Soo
    ETRI JOURNAL, 2010, 32 (05) : 676 - 686
  • [27] Clustering-Based Aggregation of High-Utility Patterns from Unknown Multi-database
    Muley, Abhinav
    Gudadhe, Manish
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2019, 11820 LNCS : 29 - 43
  • [28] Efficient algorithms for mining up-to-date high-utility patterns
    Lin, Jerry Chun-Wei
    Gan, Wensheng
    Hong, Tzung-Pei
    Tseng, Vincent S.
    ADVANCED ENGINEERING INFORMATICS, 2015, 29 (03) : 648 - 661
  • [29] Efficiently Discover Multi-level Maximal High-Utility Patterns from Hierarchical Databases
    Nguyen, Trinh D. D.
    Tung, N. T.
    Nguyen, Loan T. T.
    Bay Vo
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT I, ICCCI 2024, 2024, 14810 : 382 - 393
  • [30] Efficient approach for mining high-utility patterns on incremental databases with dynamic profits
    Kim, Sinyoung
    Kim, Hanju
    Cho, Myungha
    Kim, Hyeonmo
    Vo, Bay
    Lin, Jerry Chun-Wei
    Yun, Unil
    KNOWLEDGE-BASED SYSTEMS, 2023, 282