Rapid and optimized parallel attribute reduction based on neighborhood rough sets and MapReduce

被引:4
|
作者
Hanuman, V. K. [1 ]
Chebrolu, Srilatha [1 ]
机构
[1] Natl Inst Technol Andhra Pradesh, Dept Comp Sci & Engn, Tadepalligudem 534101, Andhra Pradesh, India
关键词
Attribute reduction; Neighborhood rough sets; MapReduce; Neighborhood information; Data preprocessing; Computational complexity; High-dimensional data; ALGORITHM; EFFICIENT;
D O I
10.1016/j.eswa.2024.125323
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attribute reduction is a crucial step in data pre-processing and feature engineering. It is the selection of a subset of relevant data attributes to reduce the computational complexity of machine learning models and improve their performance. Neighborhood rough set (NRS) theory provides a valuable framework for attribute reduction. It leverages neighborhood information to identify non-redundant and informative attributes for data analysis and machine learning tasks. Attribute subsets based on NRS theory are highly qualitative, producing effective prediction accuracies in Euclidean space. However, existing NRS-based solutions are resource-intensive because of the large search space required for finding neighborhoods and redundant computations. To overcome these limitations, we propose the rapid and optimized attribute reduction (ROAR) algorithm that optimizes the current state-of-the-art attribute-reduction method in NRS theory. The strength of ROAR lies in its ability to accelerate computations by rapidly determining the neighborhood consistency of data samples and consequently expediting the identification of both positive and boundary regions. This efficiency significantly enhances the overall processing time for the data analysis tasks. Experimental results on 12 standard datasets demonstrate that the ROAR algorithm exhibits high efficiency by obtaining accurate reduction results with rapid response times. To ensure that the ROAR algorithm is suitable for high-dimensional datasets, we provide a parallel implementation, namely, the P-ROAR algorithm. The P-ROAR algorithm is the first parallel attribute-reduction algorithm in the classical NRS theory. Computational speeds and scalability metrics establish that P-ROAR is much faster and more scalable for datasets with an enormous attribute space. These algorithms provide a tool for handling feature reduction in data engineering without compromising accuracy and performance.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Attribute Reduction for Massive Data Based on Rough Set Theory and MapReduce
    Yang, Yong
    Chen, Zhengrong
    Liang, Zhu
    Wang, Guoyin
    ROUGH SET AND KNOWLEDGE TECHNOLOGY (RSKT), 2010, 6401 : 672 - 678
  • [32] MapReduce based parallel attribute reduction in Incomplete Decision Systems
    Sowkuntla, Pandu
    Dunna, Sravya
    Prasad, P. S. V. S. Sai
    KNOWLEDGE-BASED SYSTEMS, 2021, 213
  • [33] Reduction of Neighborhood-Based Generalized Rough Sets
    Wang, Zhaohao
    Shu, Lan
    Ding, Xiuyong
    JOURNAL OF APPLIED MATHEMATICS, 2011,
  • [34] Evidence-theory-based numerical algorithms of attribute reduction with neighborhood-covering rough sets
    Chen, Degang
    Li, Wanlu
    Zhang, Xiao
    Kwong, Sam
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2014, 55 (03) : 908 - 923
  • [35] Attribute reduction based on adaptive neighborhood rough sets and three-way pied kingfisher optimizer
    Qiu, Wenjing
    Liu, Caihui
    Lin, Bowen
    Chen, Xiying
    Miao, Duoqian
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 271
  • [36] Class-specific attribute reducts based on neighborhood rough sets
    Zhang, Xianyong
    Fan, Yunrui
    Yao, Yuesong
    Yang, Jilin
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (06) : 7891 - 7910
  • [38] Numerical attribute reduction based on neighborhood granulation and rough approximation
    College of Energy Science and Engineering, Harbin Institute of Technology, Harbin 150001, China
    Ruan Jian Xue Bao, 2008, 3 (640-649):
  • [39] Parallel attribute reduction algorithms using MapReduce
    Qian, Jin
    Miao, Duoqian
    Zhang, Zehua
    Yue, Xiaodong
    INFORMATION SCIENCES, 2014, 279 : 671 - 690
  • [40] Stable Attribute Reduction for Neighborhood Rough Set
    Liang, Shaochen
    Yang, Xibei
    Chen, Xiangjian
    Li, Jingzheng
    FILOMAT, 2018, 32 (05) : 1809 - 1815