Data clustering: application and trends

被引:65
|
作者
Oyewole, Gbeminiyi John [1 ]
Thopil, George Alex [1 ]
机构
[1] Univ Pretoria, Dept Engn & Technol Management, Pretoria, South Africa
关键词
Clustering; Clustering classification; Clustering components; Industry applications; Clustering algorithms; Clustering trends; PATTERN-CLASSIFICATION; R PACKAGE; ALGORITHMS; SYSTEM; ICT; INFORMATION; EXPLORATION; INDICATORS; CHALLENGES; MANAGEMENT;
D O I
10.1007/s10462-022-10325-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering has primarily been used as an analytical technique to group unlabeled data for extracting meaningful information. The fact that no clustering algorithm can solve all clustering problems has resulted in the development of several clustering algorithms with diverse applications. We review data clustering, intending to underscore recent applications in selected industrial sectors and other notable concepts. In this paper, we begin by highlighting clustering components and discussing classification terminologies. Furthermore, specific, and general applications of clustering are discussed. Notable concepts on clustering algorithms, emerging variants, measures of similarities/dissimilarities, issues surrounding clustering optimization, validation and data types are outlined. Suggestions are made to emphasize the continued interest in clustering techniques both by scholars and Industry practitioners. Key findings in this review show the size of data as a classification criterion and as data sizes for clustering become larger and varied, the determination of the optimal number of clusters will require new feature extracting methods, validation indices and clustering techniques. In addition, clustering techniques have found growing use in key industry sectors linked to the sustainable development goals such as manufacturing, transportation and logistics, energy, and healthcare, where the use of clustering is more integrated with other analytical techniques than a stand-alone clustering technique.
引用
收藏
页码:6439 / 6475
页数:37
相关论文
共 50 条
  • [41] Clustering Data with Temporal Evolution: Application to Electrophysiological Signals
    Medina, Liliana A. S.
    Fred, Ana L. N.
    AGENTS AND ARTIFICIAL INTELLIGENCE, 2011, 129 : 101 - 115
  • [42] A novel memetic algorithm and its application to data clustering
    Ni, JiaCheng
    Li, Li
    Qiao, Fei
    Wu, QiDi
    MEMETIC COMPUTING, 2013, 5 (01) : 65 - 78
  • [43] Analysis and Application of Data Mining Based on Clustering Algorithm
    Lai Honghui
    Lai Xiao Tao
    PROCEEDINGS OF THE 2015 INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE, 2015, 7 : 129 - 133
  • [44] Incremental Spatio Temporal Clustering Application on Hotspot Data
    Sitanggang, I. S.
    Radiatun, N.
    Risal, A. A. Nur
    2ND INTERNATIONAL CONFERENCE ON ENVIRONMENT AND FOREST CONSERVATION (ICEFC2019): ECOSYSTEM RESEARCH AND INNOVATION TO ACHIEVE SUSTAINABLE DEVELOPMENT GOALS, 2020, 528
  • [45] Application of Data Clustering to Railway Delay Pattern Recognition
    Cerreto, Fabrizio
    Nielsen, Bo Friis
    Nielsen, Otto Anker
    Harrod, Steven S.
    JOURNAL OF ADVANCED TRANSPORTATION, 2018,
  • [46] Fuzzy Clustering with ε-Hyperballs and Its Application to Data Classification
    Jezewski, Michal
    Czabanski, Robert
    Leski, Jacek
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2017, PT II, 2017, 10246 : 84 - 93
  • [47] The Application of Clustering Analysis in Medical Expenses Data Mining
    Shen, Pei
    Zhang, Jikai
    Hua, Haiying
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1987 - +
  • [48] Fuzzy Gaussian Lasso clustering with application to cancer data
    Yang, Miin-Shen
    Ali, Wajid
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2020, 17 (01) : 250 - 265
  • [49] Research and Application of Clustering Algorithm for Text Big Data
    Chen, Zi Li
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [50] Clustering of variables with missing data: Application to preference studies
    Sahmer, K
    Qannari, EM
    Kunert, J
    Classification - the Ubiquitous Challenge, 2005, : 208 - 215