Research on Feature Selection Based on Hybrid Evolutionary Algorithm

被引:0
|
作者
Gao H.-M. [1 ]
Wang Y.-H. [2 ]
Bian C. [1 ]
Li X.-T. [1 ]
机构
[1] School of Artificial Intelligence, Jilin University, Jilin, Changchun
[2] School of Artificial Intelligence, Hebei University of Technology, Tianjin
来源
Tien Tzu Hsueh Pao/Acta Electronica Sinica | 2023年 / 51卷 / 06期
基金
中国国家自然科学基金;
关键词
classification; feature selection; gene expression data; local search; new Wrapper hybrid feature selection algorithm; teaching and learning-based optimization algorithm;
D O I
10.12263/DZXB.20210399
中图分类号
学科分类号
摘要
Feature selection (FS) is an effective data pre-processing method that solves the dimensionality disaster caused by data redundancy by selecting a set of features with high relevance and low redundancy in high-dimensional data. Many computational methods have been applied to solve the FS problem, among which the teaching and learning-based optimization algorithm (TLBO) feature selection model has received increasing attention from scholars due to its efficient global search capability. However, with the increasing size of data, the limitations of these algorithms, such as model instability, low model accuracy and poor local search ability, have gradually put the research of the algorithms into difficulties. To address these problems, this paper proposes a hybrid evolutionary Wrapper algorithm model (Teaching and Learning-Based Optimization- Local Search algorithm,TLBOLS) that integrates teaching-learning optimization algorithms with local search methods. Firstly, the algorithm converts the real-type coding to binary coding in the initialization phase, then introduces the worst individual restart mechanism in the teaching phase, and proposes a binary teaching-learning feature selection algorithm for the evolutionary class process using different values of TF values for the two identities of learners and pedagogues (Binary Teaching and Learning-Based Optimization- Local Search algorithm, BTLBOLS). Subsequently, a local search method combining multiple operations and variable neighborhood search is proposed to gradually enhance the perturbation strength and improve the individual quality of the whole population. To optimize the feature selection results, BTLBOLS utilizes a comprehensive evaluation metric as an objective function to guide the overall evolutionary process. Forty-five high-dimensional cancer gene expression datasets are selected for testing and compared with ten feature selection algorithms, and the experimental results show that compared to other algorithms, the BTLBOLS has certain advantages in terms of classification accuracy and number of features, which effectively improves the algorithm classification performance. © 2023 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:1619 / 1636
页数:17
相关论文
共 45 条
  • [41] ZHANG Y Y, JIN Z G, MIRJALILI S., Generalized normal distribution optimization and its applications in parameter extraction of photovoltaic models, Energy Conversion and Management, 224, (2020)
  • [42] LI S M, CHEN H L, WANG M J, Et al., Slime mould algorithm: A new method for stochastic optimization, Future Generation Computer Systems, 111, pp. 300-323, (2020)
  • [43] FARAMARZI A, HEIDARINEJAD M, STEPHENS B, Et al., Equilibrium optimizer: A novel optimization algorithm, Knowledge-Based Systems, 191, (2020)
  • [44] ZHAO W G, ZHANG Z X, WANG L Y., Manta ray foraging optimization: An effective bio-inspired optimizer for engineering applications, Engineering Applications of Artificial Intelligence, 87, (2020)
  • [45] ZHAO W G, ZHANG Z X, WANG L Y., Manta ray foraging optimization: An effective bio-inspired optimizer for engineering applications, Engineering Applications of Artificial Intelligence, 87, (2020)