Number of the records: 1
Evaluation of Categorical Data Clustering
- 1.0356106 - ÚI 2011 RIV DE eng C - Conference Paper (international conference)
Řezanková, H. - Löster, T. - Húsek, Dušan
Evaluation of Categorical Data Clustering.
Advances in Intelligent Web Mastering - 3. Berlin: Springer, 2011 - (Mugellini, E.; Szczepaniak, P.; Pettenati, M.; Sokhn, M.), s. 173-182. Advances in Intelligent and Soft Computing, 86. ISBN 978-3-642-18028-6. ISSN 1867-5662.
[AWIC 2011. Atlantic Web Intelligence Conference /7./. Fribourg (CH), 26.01.2011-28.01.2011]
R&D Projects: GA ČR GAP202/10/0262; GA ČR GA205/09/1079
Institutional research plan: CEZ:AV0Z10300504
Keywords : cluster analysis * nominal variable * determination of cluster numbers * evaluation of clustering
Subject RIV: IN - Informatics, Computer Science
Methods of cluster analysis are well known techniques of multivariate analysis used for many years. Their main applications concern clustering objects characterized by quantitative variables. For this case various coefficients for clustering evaluation and determination of cluster numbers have been proposed. However, in some areas, i.e., for segmentation of Internet users, the variables are often nominal or ordinal as their origin in questionnaire responses. That is why we are dealing with the evaluation criteria for the case of categorical variables here. The criteria based on variability measures are proposed. Instead of variance as a measure for quantitative variables, three measures for nominal variables are considered: the variability measure based on a modal frequency, Gini’s coefficient of mutability, and the entropy. The proposed evaluation criteria are applied to a real-dataset.
Permanent Link: http://hdl.handle.net/11104/0194719
Number of the records: 1