Distance Metrics and Clustering Methods for Mixed‐type Data
Summary
In spite of the abundance of clustering techniques and algorithms, clustering mixed interval (continuous) and categorical (nominal and/or ordinal) scale data remain a challenging problem. In order to identify the most effective approaches for clustering mixed‐type data, we use both theoretical and empirical analyses to present a critical review of the strengths and weaknesses of the methods identified in the literature. Guidelines on approaches to use under different scenarios are provided, along with potential directions for future research.
Citing Literature
Number of times cited according to CrossRef: 6
- Amir Ahmad, Santosh Kumar Ray, Ch. Aswani Kumar, Clustering Mixed Datasets by Using Similarity Features, Sustainable Communication Networks and Application, 10.1007/978-3-030-34515-0_50, (478-485), (2020).
- Aurea Grané, Silvia Salini, Elena Verdolini, Robust multivariate analysis for mixed-type data: Novel algorithm and its practical application in socio-economic research, Socio-Economic Planning Sciences, 10.1016/j.seps.2020.100907, (100907), (2020).
- Reinel Tabares-Soto, Simon Orozco-Arias, Victor Romero-Cano, Vanesa Segovia Bucheli, José Luis Rodríguez-Sotelo, Cristian Felipe Jiménez-Varón, A comparative study of machine learning and deep learning algorithms to classify cancer types based on microarray gene expression data, PeerJ Computer Science, 10.7717/peerj-cs.270, 6, (e270), (2020).
- Flavia Dalia Frumosu, Georg Ørnskov Rønsch, Murat Kulahci, Mould wear-out prediction in the plastic injection moulding industry: a case study, International Journal of Computer Integrated Manufacturing, 10.1080/0951192X.2020.1829062, (1-14), (2020).
- Amir Ahmad, Shehroz Khan, Survey of State-of-the-Art Mixed Data Clustering Algorithms, IEEE Access, 10.1109/ACCESS.2019.2903568, (1-1), (2019).
- Michel van de Velden, Alfonso Iodice D'Enza, Angelos Markos, Distance‐based clustering of mixed data, Wiley Interdisciplinary Reviews: Computational Statistics, 10.1002/wics.1456, 11, 3, (2018).




