Data clustering: application and trends

被引:65
|
作者
Oyewole, Gbeminiyi John [1 ]
Thopil, George Alex [1 ]
机构
[1] Univ Pretoria, Dept Engn & Technol Management, Pretoria, South Africa
关键词
Clustering; Clustering classification; Clustering components; Industry applications; Clustering algorithms; Clustering trends; PATTERN-CLASSIFICATION; R PACKAGE; ALGORITHMS; SYSTEM; ICT; INFORMATION; EXPLORATION; INDICATORS; CHALLENGES; MANAGEMENT;
D O I
10.1007/s10462-022-10325-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering has primarily been used as an analytical technique to group unlabeled data for extracting meaningful information. The fact that no clustering algorithm can solve all clustering problems has resulted in the development of several clustering algorithms with diverse applications. We review data clustering, intending to underscore recent applications in selected industrial sectors and other notable concepts. In this paper, we begin by highlighting clustering components and discussing classification terminologies. Furthermore, specific, and general applications of clustering are discussed. Notable concepts on clustering algorithms, emerging variants, measures of similarities/dissimilarities, issues surrounding clustering optimization, validation and data types are outlined. Suggestions are made to emphasize the continued interest in clustering techniques both by scholars and Industry practitioners. Key findings in this review show the size of data as a classification criterion and as data sizes for clustering become larger and varied, the determination of the optimal number of clusters will require new feature extracting methods, validation indices and clustering techniques. In addition, clustering techniques have found growing use in key industry sectors linked to the sustainable development goals such as manufacturing, transportation and logistics, energy, and healthcare, where the use of clustering is more integrated with other analytical techniques than a stand-alone clustering technique.
引用
收藏
页码:6439 / 6475
页数:37
相关论文
共 50 条
  • [21] Unsupervised clustering methods for medical data: An application to thyroid gland data
    Albayrak, S
    ARTIFICAIL NEURAL NETWORKS AND NEURAL INFORMATION PROCESSING - ICAN/ICONIP 2003, 2003, 2714 : 695 - 701
  • [22] Recent trends in data mining (DM): Document clustering of DM publications
    Peng, Yi
    Kou, Gang
    Chen, Zhengxin
    Shi, Yong
    2006 INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, VOLS 1 AND 2, PROCEEDINGS, 2006, : 1653 - 1659
  • [23] DETECT: A Hierarchical Clustering Algorithm for Behavioural Trends in Temporal Educational Data
    McBroom, Jessica
    Yacef, Kalina
    Koprinska, Irena
    ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2020), PT I, 2020, 12163 : 374 - 385
  • [24] Application of improved ant algorithm in spatial data clustering
    Hazhong Qian
    Fang Wu
    Lei Ge
    Bo Chen
    Huilian Wang
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13 : 798 - 801
  • [25] Multivariate functional clustering and its application to typhoon data
    Misumi T.
    Matsui H.
    Konishi S.
    Behaviormetrika, 2019, 46 (1) : 163 - 175
  • [26] The Application of Data Mining Clustering Algorithm in Fuzzy Control
    Li Guodong
    Xia Kewen
    SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING: THEORY AND PRACTICE, VOL 1, 2012, 114 : 105 - 113
  • [27] NONLINEAR DIMENSION REDUCTION FOR FUNCTIONAL DATA WITH APPLICATION TO CLUSTERING
    Tan, Ruoxu
    Zang, Yiming
    Yin, Guosheng
    STATISTICA SINICA, 2024, 34 (03) : 1391 - 1412
  • [28] Density of points clustering, application to transcriptomic data analysis
    Wicker, N
    Dembele, D
    Raffelsberger, W
    Poch, O
    NUCLEIC ACIDS RESEARCH, 2002, 30 (18) : 3992 - 4000
  • [29] Weighted consensus clustering and its application to Big data
    Alguliyev, Rasim M.
    Aliguliyev, Ramiz M.
    Sukhostat, Lyudmila V.
    Expert Systems with Applications, 2021, 150
  • [30] Application of Inclined Planes system Optimization on Data Clustering
    Mozaffari, Mohammad Hamed
    Abdy, Hamed
    Zahiri, Seyed Hamid
    2013 FIRST IRANIAN CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS (PRIA), 2013,