Number of the records: 1
Using Copulas in Data Mining Based on the Observational Calculus
- 1.0447829 - ÚI 2016 RIV US eng J - Journal Article
Holeňa, Martin - Bajer, L. - Ščavnický, M.
Using Copulas in Data Mining Based on the Observational Calculus.
IEEE Transactions on Knowledge and Data Engineering. Roč. 27, č. 10 (2015), s. 2851-2864. ISSN 1041-4347. E-ISSN 1558-2191
R&D Projects: GA ČR GA13-17187S
Grant - others:SLU(CZ) SGS/21/2014
Institutional support: RVO:67985807
Keywords : data mining * observational calculus * generalized quantifiers * joint probability distribution * copulas * hierarchical Archimedean copulas
Subject RIV: IN - Informatics, Computer Science
Impact factor: 2.476, year: 2015
The objective of the paper is a contribution to data mining within the framework of the observational calculus, through introducing generalized quantifiers related to copulas. Fitting copulas to multidimensional data is an increasingly important method for analyzing dependencies, and the proposed quantifiers of observational calculus assess the results of estimating the structure of joint distributions of continuous variables by means of hierarchical Archimedean copulas. To this end, the existing theory of hierarchical Archimedean copulas has been slightly extended in the paper: It has been proven that sufficient conditions for the function defining a hierarchical Archimedean copula to be indeed a copula, which have so far been rigorously established only for the special case of fully nested Archimedean copulas, hold in general. These conditions allow us to define three new generalized quantifiers, which are then thoroughly validated on four benchmark data sets and one data set from a real-world application. The paper concludes by comparing the proposed quantifiers to a more traditional approach—maximum weight spanning trees.
Permanent Link: http://hdl.handle.net/11104/0249603
File Download Size Commentary Version Access a0447829.pdf 8 668.1 KB Publisher’s postprint require
Number of the records: 1