ASSOCIATION-RULES-BASED DATA IMPUTATION WITH SPARK

被引:0
|
作者
Qu, Zhaowei [1 ]
Yan, Jianru [1 ]
Yin, Sixing [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
关键词
Association rules; Data preprocessing; Spark; Distributed algorithm;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to technical bottlenecks and errors caused by artificial operation, the problem of incomplete data always exists in big data research. Traditional data imputation algorithms incur high complexity and the accuracy cannot reach the desired level. At the same time, analysis and computation involved in mass data makes limitation of traditional algorithms and computing platform more noticeable. In this paper, we propose a data imputation method based on Apriori algorithm, and implement the corresponding algorithm on the distributed computing system built with Spark, The experimental results show that the proposed algorithm outperforms a traditional data imputation algorithm in terms of efficiency and accuracy.
引用
收藏
页码:145 / 149
页数:5
相关论文
共 50 条
  • [1] Association-Rules-Based Recommender System for Personalization in Adaptive Web-Based Applications
    Mican, Daniel
    Tomai, Nicolae
    CURRENT TRENDS IN WEB ENGINEERING, 2010, 6385s : 85 - 90
  • [2] Spark solutions for discovering fuzzy association rules in Big Data
    Fernandez-Basso, Carlos
    Dolores Ruiz, M.
    Martin-Bautista, Maria J.
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2021, 137 : 94 - 112
  • [3] Research on Optimization of Association Rules Algorithm Based on Spark
    Li, Chengang
    Liu, Yu
    Li, Zeng
    2019 2ND INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONIC AND ENGINEERING TECHNOLOGY (MEET 2019), 2019, : 460 - 465
  • [4] A Multidimensional Time-series Association Rules Algorithm based on Spark
    Liu, DongYue
    Wu, Bin
    Gu, Chao
    Ma, Yan
    Wang, Bai
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017,
  • [5] Fuzzy Association Rules Mining Using Spark
    Fernandez-Bassso, Carlos
    Dolores Ruiz, M.
    Martin-Bautista, Maria J.
    INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: THEORY AND FOUNDATIONS, PT II, 2018, 854 : 15 - 25
  • [6] A data cleaning method based on association rules
    Wei, Weijie
    Zhang, Mingwei
    Zhang, Bin
    Tang, Xiaochun
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE 2007), 2007,
  • [7] APPLICATION OF ASSOCIATION RULES IN MISSING VALUES IMPUTATION IN CATEGORICAL DATASETS
    Kaiser, Jiri
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MODELLING AND SIMULATION 2010 IN PRAGUE (MS'10 PRAGUE), 2010, : 203 - 206
  • [8] Spark based Parallel Frequent Pattern Rules for Social Media Data Analytics
    Chaturvedi, Shubhangi
    Saritha, Sri Khetwat
    Chaturvedi, Animesh
    2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING WORKSHOPS, CCGRIDW, 2023, : 168 - 175
  • [9] Approach of organization data based on mining of association rules
    Kong, Lingfu
    Wang, Han
    Lian, Qiusheng
    Jisuanji Gongcheng/Computer Engineering, 2006, 32 (21): : 12 - 14
  • [10] Data mining technology based on association rules algorithm
    Zhang, Guihong
    Liu, Caiming
    Tao, Men
    International Journal of Mechatronics and Applied Mechanics, 2019, 2019 (05): : 106 - 112