Počet záznamů: 1
Evaluation of Categorical Data Clustering
0356106 - UIVT-O 2011 RIV DE eng C - Konferenční příspěvek (zahraniční konf.)
Řezanková, H. - Löster, T. - Húsek, Dušan
Evaluation of Categorical Data Clustering.
Advances in Intelligent Web Mastering - 3. Berlin: Springer, 2011 - (Mugellini, E.; Szczepaniak, P.; Pettenati, M.; Sokhn, M.), s. 173-182. Advances in Intelligent and Soft Computing, 86. ISBN 978-3-642-18028-6. ISSN 1867-5662.
[AWIC 2011. Atlantic Web Intelligence Conference /7./. Fribourg (CH), 26.01.2011-28.01.2011]
Grant CEP: GA ČR GAP202/10/0262; GA ČR GA205/09/1079
Výzkumný záměr: CEZ:AV0Z10300504
Klíčová slova: cluster analysis * nominal variable * determination of cluster numbers * evaluation of clustering
Kód oboru RIV: IN - Informatika
Methods of cluster analysis are well known techniques of multivariate analysis used for many years. Their main applications concern clustering objects characterized by quantitative variables. For this case various coefficients for clustering evaluation and determination of cluster numbers have been proposed. However, in some areas, i.e., for segmentation of Internet users, the variables are often nominal or ordinal as their origin in questionnaire responses. That is why we are dealing with the evaluation criteria for the case of categorical variables here. The criteria based on variability measures are proposed. Instead of variance as a measure for quantitative variables, three measures for nominal variables are considered: the variability measure based on a modal frequency, Gini’s coefficient of mutability, and the entropy. The proposed evaluation criteria are applied to a real-dataset.
Trvalý link: http://hdl.handle.net/11104/0194719