Rapid and optimized parallel attribute reduction based on neighborhood rough sets and MapReduce

被引:4
|
作者
Hanuman, V. K. [1 ]
Chebrolu, Srilatha [1 ]
机构
[1] Natl Inst Technol Andhra Pradesh, Dept Comp Sci & Engn, Tadepalligudem 534101, Andhra Pradesh, India
关键词
Attribute reduction; Neighborhood rough sets; MapReduce; Neighborhood information; Data preprocessing; Computational complexity; High-dimensional data; ALGORITHM; EFFICIENT;
D O I
10.1016/j.eswa.2024.125323
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attribute reduction is a crucial step in data pre-processing and feature engineering. It is the selection of a subset of relevant data attributes to reduce the computational complexity of machine learning models and improve their performance. Neighborhood rough set (NRS) theory provides a valuable framework for attribute reduction. It leverages neighborhood information to identify non-redundant and informative attributes for data analysis and machine learning tasks. Attribute subsets based on NRS theory are highly qualitative, producing effective prediction accuracies in Euclidean space. However, existing NRS-based solutions are resource-intensive because of the large search space required for finding neighborhoods and redundant computations. To overcome these limitations, we propose the rapid and optimized attribute reduction (ROAR) algorithm that optimizes the current state-of-the-art attribute-reduction method in NRS theory. The strength of ROAR lies in its ability to accelerate computations by rapidly determining the neighborhood consistency of data samples and consequently expediting the identification of both positive and boundary regions. This efficiency significantly enhances the overall processing time for the data analysis tasks. Experimental results on 12 standard datasets demonstrate that the ROAR algorithm exhibits high efficiency by obtaining accurate reduction results with rapid response times. To ensure that the ROAR algorithm is suitable for high-dimensional datasets, we provide a parallel implementation, namely, the P-ROAR algorithm. The P-ROAR algorithm is the first parallel attribute-reduction algorithm in the classical NRS theory. Computational speeds and scalability metrics establish that P-ROAR is much faster and more scalable for datasets with an enormous attribute space. These algorithms provide a tool for handling feature reduction in data engineering without compromising accuracy and performance.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Incremental reduction methods based on granular ball neighborhood rough sets and attribute grouping
    Li, Yan
    Wu, Xiaoxue
    Wang, Xizhao
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 160
  • [22] Attribute reduction based on fuzzy rough sets
    Chen, Degang
    Wang, Xizhao
    Zhao, Suyun
    ROUGH SETS AND INTELLIGENT SYSTEMS PARADIGMS, PROCEEDINGS, 2007, 4585 : 381 - +
  • [23] WalkNAR: A neighborhood rough sets-based attribute reduction approach using random walk
    Li, Haibo
    Xiong, Wuyang
    Li, Yanbin
    Xie, Xiaojun
    APPLIED INTELLIGENCE, 2024, : 7099 - 7117
  • [24] A Neighborhood Rough Sets-Based Attribute Reduction Method Using Lebesgue and Entropy Measures
    Sun, Lin
    Wang, Lanying
    Xu, Jiucheng
    Zhang, Shiguang
    ENTROPY, 2019, 21 (02)
  • [25] Multi-Label Attribute Reduction Based on Neighborhood Multi-Target Rough Sets
    Zheng, Wenbin
    Li, Jinjin
    Liao, Shujiao
    Lin, Yidong
    SYMMETRY-BASEL, 2022, 14 (08):
  • [26] Attribute reduction based on weighted neighborhood constrained fuzzy rough sets induced by grouping functions ☆
    He, Shan
    Qiao, Junsheng
    Jian, Chengxi
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2025, 178
  • [27] MapReduce accelerated attribute reduction based on neighborhood entropy with Apache Spark
    Luo, Chuan
    Cao, Qian
    Li, Tianrui
    Chen, Hongmei
    Wang, Sizhao
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 211
  • [28] Entropy Based Attribute Reduction Algorithms for Rough Sets
    Yan, Hua
    MATERIALS, MECHANICAL ENGINEERING AND MANUFACTURE, PTS 1-3, 2013, 268-270 : 1859 - 1862
  • [29] An attribute reduction algorithm in rough sets based on GA
    Xie, KM
    Cao, JQ
    Xu, XY
    ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, : 1096 - 1099
  • [30] Attribute Reduction Algorithm Based on Rough Vague Sets
    Hu Yaxi
    Chentiejun
    2018 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2018, : 199 - 205