Extreme learning machine for missing data using multiple imputations

Dušan Sovilj, Emil Eirola, Yoan Miche, Kaj-Mikael Björk, Rui Nian, Anton Akusok, Amaury Lendasse

Research output: Contribution to journalArticleScientificpeer-review

96 Citations (Scopus)

Abstract

In the paper, we examine the general regression problem under the missing data scenario. In order to provide reliable estimates for the regression function (approximation), a novel methodology based on Gaussian Mixture Model and Extreme Learning Machine is developed. Gaussian Mixture Model is used to model the data distribution which is adapted to handle missing values, while Extreme Learning Machine enables to devise a multiple imputation strategy for final estimation. With multiple imputation and ensemble approach over many Extreme Learning Machines, final estimation is improved over the mean imputation performed only once to complete the data. The proposed methodology has longer running times compared to simple methods, but the overall increase in accuracy justifies this trade-off.
Original languageEnglish
Peer-reviewed scientific journalNeurocomputing
Volume174, Part A
Issue numberJanuary
Pages (from-to)220-231
Number of pages12
ISSN0925-2312
DOIs
Publication statusPublished - 22.01.2016
MoE publication typeA1 Journal article - refereed

Keywords

  • 512 Business and Management
  • Extreme LearningMachine
  • Missing data
  • Multiple imputation
  • Gaussian mixturemodel
  • Mixture ofGaussians
  • Conditional distribution

Fingerprint

Dive into the research topics of 'Extreme learning machine for missing data using multiple imputations'. Together they form a unique fingerprint.

Cite this