Volume 9, Issue 4
ADVANCED REVIEW

Model‐based clustering and classification of functional data

Faicel Chamroukhi

Corresponding Author

E-mail address: faicel.chamroukhi@unicaen.fr

Department of Mathematics and Computer Science, Normandie University, UNICAEN, UMR CNRS LMNO, Caen, France

Correspondence

Faicel Chamroukhi, Department of Mathematics and Computer Science, Normandie University, UNICAEN, UMR CNRS LMNO, 14000 Caen, France.

Email: faicel.chamroukhi@unicaen.fr

Search for more papers by this author
Hien D. Nguyen

Department of Mathematics and Statistics, La Trobe University, Melbourne, Victoria, Australia

Search for more papers by this author
First published: 18 January 2019
Citations: 2

Funding information: Australian Research Council, Grant/Award Number: DP180101192, DE170101134; Région Normandie, Grant/Award Number: RIN AStERiCs; ANR, Grant/Award Number: SMILES ANR‐18‐CE40‐0014

Abstract

Complex data analysis is a central topic of modern statistics and learning systems which is becoming of broader interest with the increasing prevalence of high‐dimensional data. The challenge is to develop statistical models and autonomous algorithms that are able to discern knowledge from raw data, which can be achieved through clustering techniques, or to make predictions of future data via classification techniques. Latent data models, including mixture model‐based approaches, are among the most popular and successful approaches in both supervised and unsupervised learning. Although being traditional tools in multivariate analysis, they are growing in popularity when considered in the framework of functional data analysis (FDA). FDA is the data analysis paradigm in which each datum is a function, rather than a real vector. In many areas of application, including signal and image processing, functional imaging, bioinformatics, etc., the analyzed data are indeed often available in the form of discretized values of functions, curves, or surfaces. This functional aspect of the data adds additional difficulties when compared to classical multivariate data analysis. We review and present approaches for model‐based clustering and classification of functional data. We present well‐grounded statistical models along with efficient algorithmic tools to address problems regarding the clustering and the classification of these functional data, including their heterogeneity, missing information, and dynamical hidden structures. The presented models and algorithms are illustrated via real‐world functional data analysis problems from several areas of application.

This article is categorized under:

  • Fundamental Concepts of Data and Knowledge > Data Concepts
  • Algorithmic Development > Statistics
  • Technologies > Structure Discovery and Clustering

Abstract

Functional data clustering.

Number of times cited according to CrossRef: 2

  • Identifying Qualitative Between-Subject and Within-Subject Variability: A Method for Clustering Regime-Switching Dynamics, Frontiers in Psychology, 10.3389/fpsyg.2020.01136, 11, (2020).
  • Comparison of new individual and hybrid machine learning algorithms for modeling and mapping fire hazard: a supplementary analysis of fire hazard in different counties of Golestan Province in Iran, Natural Hazards, 10.1007/s11069-020-04169-4, (2020).

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.