Počet záznamů: 1
A Robust Supervised Variable Selection for Noisy High-Dimensional Data
- 1.
SYSNO ASEP 0444727 Druh ASEP J - Článek v odborném periodiku Zařazení RIV J - Článek v odborném periodiku Poddruh J Článek ve WOS Název A Robust Supervised Variable Selection for Noisy High-Dimensional Data Tvůrce(i) Kalina, Jan (UIVT-O) RID, SAI, ORCID
Schlenker, Anna (UIVT-O) RID, ORCIDČíslo článku 320385 Zdroj.dok. BioMed Research International. - : Hindawi - ISSN 2314-6133
Roč. 2015 (2015)Poč.str. 10 s. Jazyk dok. eng - angličtina Země vyd. US - Spojené státy americké Klíč. slova dimensionality reduction ; variable selection ; robustness Vědní obor RIV BA - Obecná matematika CEP GA13-17187S GA ČR - Grantová agentura ČR Institucionální podpora UIVT-O - RVO:67985807 UT WOS 000356261700001 EID SCOPUS 84934967829 DOI 10.1155/2015/320385 Anotace The Minimum Redundancy Maximum Relevance (MRMR) approach to supervised variable selection represents a successful methodology for dimensionality reduction, which is suitable for high-dimensional data observed in two or more different groups. Various available versions of the MRMR approach have been designed to search for variables with the largest relevance for a classification task while controlling for redundancy of the selected set of variables. However, usual relevance and redundancy criteria have the disadvantages of being too sensitive to the presence of outlying measurements and/or being inefficient. We propose a novel approach called Minimum Regularized Redundancy Maximum Robust Relevance (MRRMRR), suitable for noisy high-dimensional data observed in two groups. It combines principles of regularization and robust statistics. Particularly, redundancy is measured by a new regularized version of the coefficient of multiple correlation and relevance is measured by a highly robust correlation coefficient based on the least weighted squares regression with data-adaptive weights. We compare various dimensionality reduction methods on three real data sets. To investigate the influence of noise or outliers on the data, we perform the computations also for data artificially contaminated by severe noise of various forms. The experimental results confirm the robustness of the method with respect to outliers. Pracoviště Ústav informatiky Kontakt Tereza Šírová, sirova@cs.cas.cz, Tel.: 266 053 800 Rok sběru 2016
Počet záznamů: 1