Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Partitioning Clustering algorithms for handling numerical and categorical data: a review (1311.7219v3)

Published 28 Nov 2013 in cs.DB

Abstract: Clustering is widely used in different field such as biology, psychology, and economics. Most traditional clustering algorithms are limited to handling datasets that contain either numeric or categorical attributes. However, datasets with mixed types of attributes are common in real life data mining applications. In this paper, we review partitioning based algorithm such as K-prototype, Extension of K-prototype, K-histogram, Fuzzy approaches, genetic approaches, etc. These algorithm works on both numerical and categorical data. The approaches has been proposed to handle mixed data are based on four different perceptive: i) split data set into two part such that each part contain either numerical or categorical data, then apply separate clustering algorithm on each data set, finally combined the result of both clustering algorithm, ii) converting categorical attribute into numerical attribute and apply numerical attribute clustering algorithm; iii) discrimination of numerical attribute and apply categorical based clustering algorithm; iv) Conversion of the categorical attributes into binary ones and apply any numerical based clustering algorithm

Citations (2)

Summary

We haven't generated a summary for this paper yet.