Data clustering: application and trends

被引:65
|
作者
Oyewole, Gbeminiyi John [1 ]
Thopil, George Alex [1 ]
机构
[1] Univ Pretoria, Dept Engn & Technol Management, Pretoria, South Africa
关键词
Clustering; Clustering classification; Clustering components; Industry applications; Clustering algorithms; Clustering trends; PATTERN-CLASSIFICATION; R PACKAGE; ALGORITHMS; SYSTEM; ICT; INFORMATION; EXPLORATION; INDICATORS; CHALLENGES; MANAGEMENT;
D O I
10.1007/s10462-022-10325-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering has primarily been used as an analytical technique to group unlabeled data for extracting meaningful information. The fact that no clustering algorithm can solve all clustering problems has resulted in the development of several clustering algorithms with diverse applications. We review data clustering, intending to underscore recent applications in selected industrial sectors and other notable concepts. In this paper, we begin by highlighting clustering components and discussing classification terminologies. Furthermore, specific, and general applications of clustering are discussed. Notable concepts on clustering algorithms, emerging variants, measures of similarities/dissimilarities, issues surrounding clustering optimization, validation and data types are outlined. Suggestions are made to emphasize the continued interest in clustering techniques both by scholars and Industry practitioners. Key findings in this review show the size of data as a classification criterion and as data sizes for clustering become larger and varied, the determination of the optimal number of clusters will require new feature extracting methods, validation indices and clustering techniques. In addition, clustering techniques have found growing use in key industry sectors linked to the sustainable development goals such as manufacturing, transportation and logistics, energy, and healthcare, where the use of clustering is more integrated with other analytical techniques than a stand-alone clustering technique.
引用
收藏
页码:6439 / 6475
页数:37
相关论文
共 50 条
  • [31] Application of Clustering Algorithm in Intelligent Transportation Data Analysis
    Long Qiong
    Yu Jie
    Zhang Jinfang
    INFORMATION AND MANAGEMENT ENGINEERING, PT VI, 2011, 236 : 467 - 473
  • [32] Tight Clustering for Large Datasets with an Application to Microarray Data
    Karmakar, Bikram
    Das, Sarmistha
    Bhattacharya, Sohom
    Sarkar, Rohan
    Mukhopadhyay, Indranil
    GENETIC EPIDEMIOLOGY, 2016, 40 (07) : 644 - 645
  • [33] Clustering of categoric data in medicine - Application of evolutionary algorithms
    Villmann, Thomas
    Albani, Conny
    COMPUTATIONAL INTELLIGENCE: THEORY AND APPLICATIONS, PROCEEDINGS, 2001, 2206 : 619 - 627
  • [34] Weighted consensus clustering and its application to Big data
    Alguliyev, Rasim M.
    Aliguliyev, Ramiz M.
    Sukhostat, Lyudmila, V
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 150
  • [35] Clustering of spatiotemporal signals: Application to the analysis of FMRI data
    Meyer, FG
    Chinrungrueng, J
    PROCEEDINGS OF THE 2003 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING, 2003, : 486 - 489
  • [36] Feature Clustering of Noisy Data and Application in the Currency Market
    Seidpisheh, Mohammad
    Babayi, Salman
    Mohammadpour, Adel
    FLUCTUATION AND NOISE LETTERS, 2022, 21 (06):
  • [37] Clustering, assessment and validation: an application to gene expression data
    Ciaramella, A.
    Cocozza, S.
    Lorio, E.
    Miele, G.
    Napolitano, F.
    Pinelli, M.
    Raiconi, G.
    Tagliaferri, R.
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1613 - +
  • [38] Application of Density Based Clustering to Microarray Data Analysis
    Raczynski, Lech
    Wozniak, Krzysztof
    Rubel, Tymon
    Zaremba, Krzysztof
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2010, 56 (03) : 281 - 286
  • [39] Clustering Application for Streaming Big Data in Smart Grid
    Banga, Alisha
    Sinha, Amrita
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 1051 - 1054
  • [40] An Application of Clustering Techniques to Reducing Crystallographic Texture Data
    Ostapovich, Kirill V.
    Trusov, Peter V.
    28TH RUSSIAN CONFERENCE ON MATHEMATICAL MODELLING IN NATURAL SCIENCES, 2020, 2216