A Novel Outlier Detection with Feature Selection Enabled Streaming Data Classification

被引:1
|
作者
Rajakumar, R. [1 ]
Devi, S. Sathiya [2 ]
机构
[1] Anna Univ, Chennai 600025, India
[2] Univ Coll Engn, Anna Univ, BIT Campus, Trichirappali 620024, India
来源
关键词
Streaming data classi fi cation; outlier removal; feature selection; machine learning; metaheuristics; BIG DATA;
D O I
10.32604/iasc.2023.028889
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the advancements in information technologies, massive quantity of data is being produced by social media, smartphones, and sensor devices. The investigation of data stream by the use of machine learning (ML) approaches to address regression, prediction, and classification problems have received consid-erable interest. At the same time, the detection of anomalies or outliers and feature selection (FS) processes becomes important. This study develops an outlier detec-tion with feature selection technique for streaming data classification, named ODFST-SDC technique. Initially, streaming data is pre-processed in two ways namely categorical encoding and null value removal. In addition, Local Correla-tion Integral (LOCI) is used which is significant in the detection and removal of outliers. Besides, red deer algorithm (RDA) based FS approach is employed to derive an optimal subset of features. Finally, kernel extreme learning machine (KELM) classifier is used for streaming data classification. The design of LOCI based outlier detection and RDA based FS shows the novelty of the work. In order to assess the classification outcomes of the ODFST-SDC technique, a series of simulations were performed using three benchmark datasets. The experimental results reported the promising outcomes of the ODFST-SDC technique over the recent approaches.
引用
收藏
页码:2101 / 2116
页数:16
相关论文
共 50 条
  • [31] Novel Outlier Detection by Integration of Clustering and Classification
    Tripathy, Sarita
    Sahoo, Laxman
    DATA SCIENCE AND BIG DATA ANALYTICS, 2019, 16 : 169 - 176
  • [32] A Robust AUC Maximization Framework With Simultaneous Outlier Detection and Feature Selection for Positive-Unlabeled Classification
    Ren, Ke
    Yang, Haichuan
    Zhao, Yu
    Chen, Wu
    Xue, Mingshan
    Miao, Hongyu
    Huang, Shuai
    Liu, Ji
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (10) : 3072 - 3083
  • [33] A Novel Cloud Intrusion Detection System Using Feature Selection and Classification
    Kannan, Anand
    Venkatesan, Karthik Gururajan
    Stagkopoulou, Alexandra
    Li, Sheng
    Krishnan, Sathyavakeeswaran
    Rahman, Arifur
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2015, 11 (04) : 1 - 15
  • [34] Online streaming feature selection for multigranularity hierarchical classification learning
    Wang, Chenxi
    Zhang, Xiaoqing
    Ye, Liqin
    Mao, Yu
    Li, Shaozi
    Lin, Yaojin
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (17):
  • [35] Microarray classification with hierarchical data representation and novel feature selection criteria
    Bosio, Mattia
    Bellot, Pau
    Salembier, Philippe
    Oliveras Verges, Albert
    IEEE 12TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS & BIOENGINEERING, 2012, : 344 - 349
  • [36] A novel oversampling and feature selection hybrid algorithm for imbalanced data classification
    Feng, Fang
    Li, Kuan-Ching
    Yang, Erfu
    Zhou, Qingguo
    Han, Lihong
    Hussain, Amir
    Cai, Mingjiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3231 - 3267
  • [37] A novel oversampling and feature selection hybrid algorithm for imbalanced data classification
    Fang Feng
    Kuan-Ching Li
    Erfu Yang
    Qingguo Zhou
    Lihong Han
    Amir Hussain
    Mingjiang Cai
    Multimedia Tools and Applications, 2023, 82 : 3231 - 3267
  • [38] Streaming feature selection algorithms for big data: A survey
    AlNuaimi, Noura
    Masud, Mohammad Mehedy
    Serhani, Mohamed Adel
    Zaki, Nazar
    APPLIED COMPUTING AND INFORMATICS, 2022, 18 (1/2) : 113 - 135
  • [39] Optimal and Novel Hybrid Feature Selection Framework for Effective Data Classification
    Venkataraman, Sivakumar
    Selvaraj, Rajalakshmi
    ADVANCES IN SYSTEMS, CONTROL AND AUTOMATION, 2018, 442 : 499 - 514
  • [40] Local Feature Selection for Data Classification
    Armanfard, Narges
    Reilly, James P.
    Komeili, Majid
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (06) : 1217 - 1227