Počet záznamů: 1
Dimensionality Reduction in Boolean Data: Comparison of Four BMF Methods
- 1.0454876 - ÚI 2016 DE eng C - Konferenční příspěvek (zahraniční konf.)
Bartl, E. - Bělohlávek, R. - Osicka, P. - Řezanková, Hana
Dimensionality Reduction in Boolean Data: Comparison of Four BMF Methods.
Clustering High-Dimensional Data. Berlin: Springer, 2015 - (Masulli, F.; Petrosino, A.; Rovetta, S.), s. 118-133. Lecture Notes in Computer Science, 7627. ISBN 978-3-662-48576-7. ISSN 0302-9743.
[CHDD 2012. Clustering High-Dimensional Data. International Workshop /1./. Naples (IT), 15.05.2012-15.05.2012]
Grant CEP: GA ČR GAP202/10/0262
Grant ostatní: GA MŠk CZ.1.07/2.3.00/20.0059
Klíčová slova: binary data * dimensionality reduction * Boolean factor analysis * matrix decomposition
Kód oboru RIV: BB - Aplikovaná statistika, operační výzkum
We compare four methods for Boolean matrix factorization (BMF). The oldest of these methods is the 8M method implemented in the BMDP statistical software package developed in the 1960s. The three other methods were developed recently. All the methods compute from an input object-attribute matrix I two matrices, namely an object-factor matrix A and a factor-attribute matrix B in such a way that the Boolean matrix product of A and B is approximately equal to I. Such decompositions are utilized directly in Boolean factor analysis or indirectly as a dimensionality reduction method for Boolean data in machine learning. While some comparison of the BMF methods with matrix decomposition methods designed for real valued data exists in the literature, a mutual comparison of the various BMF methods is a severely neglected topic. In this paper, we compare the four methods on real datasets. In particular, we observe the reconstruction ability of the first few computed factors as well as the number of computed factors necessary to fully reconstruct the input matrix, i.e. the approximation to the Boolean rank of I computed by the methods. In addition, we present some general remarks on all the methods being compared.
Trvalý link: http://hdl.handle.net/11104/0255530
Název souboru Staženo Velikost Komentář Verze Přístup a0454876.pdf 1 296.1 KB Vydavatelský postprint vyžádat
Počet záznamů: 1