Number of the records: 1  

A genetic algorithm for multivariate missing data imputation

  1. 1.
    0565679 - ÚI 2024 RIV US eng J - Journal Article
    Figueroa-Garcia, J.C. - Neruda, Roman - Hernandez–Pérez, G.
    A genetic algorithm for multivariate missing data imputation.
    Information Sciences. Roč. 619, January 2023 (2023), s. 947-967. ISSN 0020-0255. E-ISSN 1872-6291
    Institutional support: RVO:67985807
    Keywords : Missing data * Genetic algorithms * Multivariate missing data * Data imputation
    OECD category: Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
    Impact factor: 8.1, year: 2022
    Method of publishing: Open access
    https://dx.doi.org/10.1016/j.ins.2022.11.037

    Some data mining, AI and data processing tasks might have data loss whose estimation/imputation is an important problem to be solved. Genetic algorithms are efficient and flexible global optimization methods able to deal with both multiple missing observations and multiple features such as continuous/discrete/binary data which are often found in multivariate databases unlike classical missing data estimation methods which only deal with univariate–continuous data. This paper presents a genetic algorithm to impute multiple missing observations in multivariate data which minimizes a new multi–objective (fitness) function based on the Minkowski distance of the means, variances, covariances and skewness between available/completed data. To do so, two sets of examples were tested: a continuous/discrete dataset which is compared to both the EM algorithm and auxiliary regressions, and a comparison over seven benchmark datasets.
    Permanent Link: https://hdl.handle.net/11104/0337194

     
    FileDownloadSizeCommentaryVersionAccess
    0565679-aoa.pdf6699.9 KBOA CC BY NC ND 4.0Publisher’s postprintopen-access
     
Number of the records: 1  

  This site uses cookies to make them easier to browse. Learn more about how we use cookies.