Model-based clustering and classification of functional data

被引:34
|
作者
Chamroukhi, Faicel [1 ]
Nguyen, Hien D. [2 ]
机构
[1] Normandie Univ, Dept Math & Comp Sci, UNICAEN, UMR CNRS LMNO, F-14000 Caen, France
[2] La Trobe Univ, Dept Math & Stat, Melbourne, Vic, Australia
基金
澳大利亚研究理事会;
关键词
algorithms; classification; clustering; EM; functional data analysis; mixture models; HIDDEN MARKOV MODEL; DISCRIMINANT-ANALYSIS; MAXIMUM-LIKELIHOOD; MIXTURE MODEL; EM ALGORITHM; REGRESSION; INFERENCE; TUTORIAL;
D O I
10.1002/widm.1298
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Complex data analysis is a central topic of modern statistics and learning systems which is becoming of broader interest with the increasing prevalence of high-dimensional data. The challenge is to develop statistical models and autonomous algorithms that are able to discern knowledge from raw data, which can be achieved through clustering techniques, or to make predictions of future data via classification techniques. Latent data models, including mixture model-based approaches, are among the most popular and successful approaches in both supervised and unsupervised learning. Although being traditional tools in multivariate analysis, they are growing in popularity when considered in the framework of functional data analysis (FDA). FDA is the data analysis paradigm in which each datum is a function, rather than a real vector. In many areas of application, including signal and image processing, functional imaging, bioinformatics, etc., the analyzed data are indeed often available in the form of discretized values of functions, curves, or surfaces. This functional aspect of the data adds additional difficulties when compared to classical multivariate data analysis. We review and present approaches for model-based clustering and classification of functional data. We present well-grounded statistical models along with efficient algorithmic tools to address problems regarding the clustering and the classification of these functional data, including their heterogeneity, missing information, and dynamical hidden structures. The presented models and algorithms are illustrated via real-world functional data analysis problems from several areas of application. This article is categorized under: Fundamental Concepts of Data and Knowledge > Data Concepts Structure Discovery and Clustering
引用
收藏
页数:36
相关论文
共 50 条
  • [41] Bayesian model-based clustering for longitudinal ordinal data
    Roy Costilla
    Ivy Liu
    Richard Arnold
    Daniel Fernández
    Computational Statistics, 2019, 34 : 1015 - 1038
  • [42] BAYESIAN MODEL-BASED CLUSTERING FOR POPULATIONS OF NETWORK DATA
    Mantziou, Anastasia
    Lunagomez, Simon
    Mitra, Robin
    ANNALS OF APPLIED STATISTICS, 2024, 18 (01): : 266 - 302
  • [43] Model-Based Clustering of Inhomogeneous Paired Comparison Data
    Busse, Ludwig M.
    Buhmann, Joachim M.
    SIMILARITY-BASED PATTERN RECOGNITION: FIRST INTERNATIONAL WORKSHOP, SIMBAD 2011, 2011, 7005 : 207 - 221
  • [44] Cloud Model-based Data Attributes Reduction for Clustering
    Xu Ru-zhi
    Nie Pei-yao
    Lin Pei-guang
    Chu Dong-sheng
    PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 33 - 36
  • [45] Model-Based Clustering of Mixed Data With Sparse Dependence
    Choi, Young-Geun
    Ahn, Soohyun
    Kim, Jayoun
    IEEE ACCESS, 2023, 11 : 75945 - 75954
  • [46] Model-based clustering of Gaussian copulas for mixed data
    Marbac, Matthieu
    Biernacki, Christophe
    Vandewalle, Vincent
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (23) : 11635 - 11656
  • [47] Scalable model-based clustering by working on data summaries
    Jin, HD
    Wong, ML
    Leung, KS
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 91 - 98
  • [48] Probabilistic model-based clustering of multivariate and sequential data
    Smyth, P
    ARTIFICIAL INTELLIGENCE AND STATISTICS 99, PROCEEDINGS, 1999, : 299 - 304
  • [49] Model-based clustering for multivariate partial ranking data
    Jacques, Julien
    Biernacki, Christophe
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2014, 149 : 201 - 217
  • [50] Model-based co-clustering for ordinal data
    Jacques, Julien
    Biernacki, Christophe
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 123 : 101 - 115