IAA2075302 GA AV ČR - Academy of Sciences of the Czech Republic (AV ČR)
KSK1019101 GA AV ČR - Academy of Sciences of the Czech Republic (AV ČR)
1M0572 GA MŠMT - Ministry of Education, Youth and Sports (MEYS)
CEZ
AV0Z10750506 - UTIA-B (2005-2011)
Annotation
During the last twenty years the number of text documents in digital form is enormously growing in size. As a consequence the need to automatically organize and classify documents is of great practical importance. Text classification aims for partition of an unstructured set of documents into groups that describe the contents of the document. There are two main variants of text classification: text clustering and text categorization. A major characteristic of the problem is the high dimension of text data.