共 50 条
Granular Ball Fuzzy Neighborhood Rough Sets-Based Feature Selection via Multiobjective Mayfly Optimization
被引:2
|作者:
Sun, Lin
[1
]
Liang, Hanbo
[2
]
Ding, Weiping
[3
]
Xu, Jiucheng
[2
]
机构:
[1] Tianjin Univ Sci & Technol, Coll Artificial Intelligence, Tianjin 300457, Peoples R China
[2] Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang 453007, Peoples R China
[3] Nantong Univ, Sch Artificial Intelligence & Comp Sci, Nantong 226019, Peoples R China
基金:
中国国家自然科学基金;
关键词:
Feature extraction;
Optimization;
Rough sets;
Entropy;
Noise measurement;
Noise;
Uncertainty;
Feature selection;
fuzzy neighborhood;
granular ball;
high-dimensional data classification;
mayfly optimization;
GENETIC ALGORITHM;
D O I:
10.1109/TFUZZ.2024.3440575
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Most feature selection models via swarm intelligence optimization have difficulty achieving an optimal global subset of features and are not ideal for classifying high-dimensional data. We study a granular ball fuzzy neighborhood rough sets-based feature selection approach via multiobjective mayfly optimization on high-dimensional datasets. First, to enhance the ability to search for samples in granular balls, the granular ball radius is defined by the standard deviation coefficient. To measure sparse samples with noise in the granular ball, a new fuzzy neighborhood is constructed inside the granular ball, and upper and lower approximations are presented to develop the granular ball fuzzy neighborhood sets model. Second, to estimate the uncertainty of features in granular balls, fuzzy neighborhood entropy is provided. In the process of searching for features in fuzzy neighborhood decision systems, a feature-partitioning strategy based on the average fuzzy neighborhood entropy is studied. A subset of the preselected features is subsequently formed in the first stage. Third, to enhance the diversity in nondominated solutions, the feature vector is decoded into the mayfly, which is optimized through the mesh model. The mayfly ranking strategy updates the mayfly velocity and position to avoid local optima. Thus, in the second stage, the improved multiobjective mayfly optimization strategy can be utilized in selecting the optimal subset of features. Finally, a feature selection scheme is proposed for high-dimensional data with noise. Experimental findings prove that the developed methodology is viable and has excellent classification efficiency on 12 high-dimensional datasets.
引用
收藏
页码:6112 / 6124
页数:13
相关论文