Outlier detection toward high-dimensional industrial data using extreme tensor-train learning machine with compression

被引:0
|
作者
Deng, Xiaowu [1 ]
Shi, Yuanquan
Yao, Dunhong
机构
[1] Huaihua Univ, Sch Comp Sci & Engn, Huaihua, Peoples R China
基金
中国国家自然科学基金;
关键词
High-dimensional industrial data; Outlier detection; Tensorized compression; Extreme tensor-train Learning Machine; RECOGNITION;
D O I
10.1016/j.jksuci.2023.101576
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection in a high-dimensional dataset is a significant but challenging task in a number of applications. Extreme learning machine (ELM) is a powerful modeling tool for identifying outlier in an underlying dataset. However, when dealing with outliers in high-dimensional industry data, ELM brings huge storage and computational cost. To address this issue, we propose ELM based on a tensor-train format (ETFLM). Specifically, a tensor-train layer is builded with tensor-train decomposition. The fully connected layers of a neural network are replaced with tensor-train layers. Based on tensor-train layers and ELM, ETFLM is proposed in this study and its training algorithm is further presented. The experimental results show that ETFLM achieves high compression rate on low-dimensional data, and detection accuracy is slightly decreased. However, on high-dimensional data, ETFLM achieves more than 60%, whereas traditional algorithms achieve less than 40%.& COPY; 2023 The Author(s). Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Projected outlier detection in high-dimensional mixed-attributes data set
    Ye, Mao
    Li, Xue
    Orlowska, Maria E.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 7104 - 7113
  • [42] A Method for Measurement Data Modeling and High-Dimensional Outlier Detection Based on Large Dimensional Matrix
    Chen, Gang
    Fan, Huanhuan
    An, Baoran
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2274 - 2279
  • [43] Sparse Modeling-Based Sequential Ensemble Learning for Effective Outlier Detection in High-Dimensional Numeric Data
    Pang, Guansong
    Cao, Longbing
    Chen, Ling
    Lian, Defu
    Liu, Huan
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3892 - 3899
  • [44] A hierarchical structure of extreme learning machine (HELM) for high-dimensional datasets with noise
    He, Yan-Lin
    Geng, Zhi-Qiang
    Xu, Yuan
    Zhu, Qun-Xiong
    NEUROCOMPUTING, 2014, 128 : 407 - 414
  • [45] Data Compression and Prediction Using Machine Learning for Industrial IoT
    Park, Junmin
    Park, Hyunjae
    Choi, Young-June
    2018 32ND INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN), 2018, : 818 - 820
  • [46] High-Dimensional Data Visualisation Methods Using Machine Learning and Their Use in Image Analysis
    Tian, Ying
    Ali, Majid Khan Majahar
    Wu, Lili
    Li, Tao
    TRAITEMENT DU SIGNAL, 2024, 41 (03) : 1355 - 1364
  • [47] Extreme Learning Machine on High Dimensional and Large Data Applications
    Lin, Zhiping
    Cao, Jiuwen
    Chen, Tao
    Jin, Yi
    Sun, Zhan-Li
    Lendasse, Amaury
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [48] Seismic data compression using high-dimensional wavelet transforms
    Villasenor, JD
    Ergas, RA
    Donoho, PL
    DCC '96 - DATA COMPRESSION CONFERENCE, PROCEEDINGS, 1996, : 396 - 405
  • [49] Outlier detection for high dimensional data using the Comedian approach
    Sajesh, T. A.
    Srinivasan, M. R.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2012, 82 (05) : 745 - 757
  • [50] A fast outlier detection strategy for distributed high-dimensional data sets with mixed attributes
    Anna Koufakou
    Michael Georgiopoulos
    Data Mining and Knowledge Discovery, 2010, 20 : 259 - 289