1M0572 GA MŠMT - Ministerstvo školství, mládeže a tělovýchovy
CEZ
AV0Z10750506 - UTIA-B (2005-2011)
Anotace
During the last twenty years the number of text documents in digital form is enormously growing in size. As a consequence the need to automatically organize and classify documents is of great practical importance. Text classification aims for partition of an unstructured set of documents into groups that describe the contents of the document. There are two main variants of text classification: text clustering and text categorization. A major characteristic of the problem is the high dimension of text data.