Hashing Supported Iterative MapReduce Based Scalable SBE Reduct Computation

被引:4
|
作者
Divya, U. Venkata [1 ]
Prasad, P. S. V. S. Sai [2 ]
机构
[1] Quadrat Insights Pvt Ltd, Hyderabad, India
[2] Univ Hyderabad, Sch Comp & Informat Sci, Hyderabad, India
关键词
Rough Sets; Reduct; Iterative MapReduce; Apache Spark; Scalable feature selection;
D O I
10.1007/978-3-319-72344-0_13
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Feature Selection plays a major role in preprocessing stage of Data mining and helps in model construction by recognizing relevant features. Rough Sets has emerged in recent years as an important paradigm for feature selection i.e. finding Reduct of conditional attributes in given data set. Two control strategies for Reduct Computation are Sequential Forward Selection (SFS), Sequential Backward Elimination(SBE). With the objective of scalable feature seletion, several MapReduce based approaches were proposed in literature. All these approaches are SFS based and results in super set of reduct i.e. with redundant attributes. Even though SBE approaches results in exact Reduct, it requires lot of data movement in shuffle and sort phase of MapReduce. To overcome this problem and to optimize the network bandwidth utilization, a novel hashing supported SBE Reduct algorithm(MRSBER Hash) is proposed in this work and implemented using Iterative MapReduce framework of Apache Spark. Experiments conducted on large benchmark decision systems have empirically established the relevance of proposed approach for decision systems with large cardinality of conditional attributes.
引用
收藏
页码:163 / 170
页数:8
相关论文
共 50 条
  • [31] A Scalable Optimization Mechanism for Pairwise Based Discrete Hashing
    Shi, Xiaoshuang
    Xing, Fuyong
    Zhang, Zizhao
    Sapkota, Manish
    Guo, Zhenhua
    Yang, Lin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 1130 - 1142
  • [32] Locality Sensitive Hashing Based Scalable Collaborative Filtering
    Aytekin, Ahmet Maruf
    Aytekin, Tevfik
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1030 - 1033
  • [33] Research on MapReduce Based Incremental Iterative Model and Framework
    Song, Jie
    Guo, Chaopeng
    Zhang, Yichuan
    Zhu, Zhiliang
    Yu, Ge
    IETE JOURNAL OF RESEARCH, 2015, 61 (01) : 32 - 40
  • [34] Research And Implementation of Iterative MapReduce Based On BP Algorithm
    Yang, Yu
    Zhang, Longjun
    PROCEEDINGS OF THE 2016 3RD INTERNATIONAL CONFERENCE ON MATERIALS ENGINEERING, MANUFACTURING TECHNOLOGY AND CONTROL, 2016, 67 : 507 - 510
  • [35] An Iterative MapReduce Based Frequent Subgraph Mining Algorithm
    Bhuiyan, Mansurul A.
    Al Hasan, Mohammad
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (03) : 608 - 620
  • [36] Design and implementation of algorithm Apriori based on iterative MapReduce
    Ji, G. (glji@njnu.edu.cn), 1600, Huazhong University of Science and Technology (40):
  • [37] Iterative computation of rate-distortion bounds for scalable source coding
    Tuncel, E
    Rose, K
    2000 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, PROCEEDINGS, 2000, : 234 - 234
  • [38] MapReduce based computation of the diffusion method in recommender systems
    Peng F.
    You J.
    Zeng X.
    Deng H.
    Peng, Fei (pengf@dsp.ac.cn), 1600, Inst. of Scientific and Technical Information of China (22): : 288 - 296
  • [39] Knowledge Extraction from Big Data using MapReduce-based Parallel-Reduct Algorithm
    Chowdhury, Tapan
    Chakraborty, Susanta
    Setua, S. K.
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2016, : 240 - 246
  • [40] MapReduce based computation of the diffusion method in recommender systems
    彭飞
    You Jiali
    Zeng Xuewen
    Deng Haojiang
    High Technology Letters, 2016, 22 (03) : 288 - 296