Number of the records: 1  

A Comparative study of two methodologies for large binary datasets analysis.

  1. 1.
    0387233 - ÚI 2013 RIV CZ eng J - Journal Article
    Frolov, A. - Húsek, Dušan - Polyakov, P.Y. - Řezanková, H.
    A Comparative study of two methodologies for large binary datasets analysis.
    Neural Network World. Roč. 22, č. 6 (2012), s. 565-582. ISSN 1210-0552
    R&D Projects: GA ČR GAP202/10/0262
    Grant - others:GA MŠk(CZ) ED1.1.00/02.0070
    Program: ED
    Institutional support: RVO:67985807
    Keywords : dimension reduction * statistics * data mining * Boolean factor analysis * Boolean matrix factorization * information gain * likelihood-maximization * bars problem
    Subject RIV: IN - Informatics, Computer Science
    Impact factor: 0.362, year: 2012

    Studied are differences of two approaches targeted to reveal latent variables in binary data. These approaches assume that the observed high dimensional data are driven by a small number of hidden binary sources combined due to Boolean superposition. The first approach is the Boolean matrix factorization (BMF) and the second one is the Boolean factor analysis (BFA). The two BMF methods are used for comparison. First is the M8 method from the BMDP statistical software package and the second one is the method suggested by Belohlavek \& Vychodil. These two are compared to BFA, especially with the Expectation-maximization Boolean Factor Analysis we had developed earlier has, however, been extended with a binarization step developed here. The well-known bars problem and the mushroom dataset are used for revealing the methods' peculiarities. In particular, the reconstruction ability of the computed factors and the information gain as the measure of dimension reduction was under scrutiny. It was shown that BFA slightly loses to BMF in performance when noise-free signals are analyzed. Conversely, BMF loses considerably to BFA when input signals are noisy.
    Permanent Link: http://hdl.handle.net/11104/0216965

     
    FileDownloadSizeCommentaryVersionAccess
    0387233.pdf5493.6 KBPublisher’s postprintopen-access
     
Number of the records: 1  

  This site uses cookies to make them easier to browse. Learn more about how we use cookies.