Semi-parametric optimization for missing data imputation

作者:Yongsong Qin, Shichao Zhang, Xiaofeng Zhu, Jilian Zhang, Chengqi Zhang

摘要

Missing data imputation is an important issue in machine learning and data mining. In this paper, we propose a new and efficient imputation method for a kind of missing data: semi-parametric data. Our imputation method aims at making an optimal evaluation about Root Mean Square Error (RMSE), distribution function and quantile after missing-data are imputed. We evaluate our approaches using both simulated data and real data experimentally, and demonstrate that our stochastic semi-parametric regression imputation is much better than existing deterministic semi-parametric regression imputation in efficiency and effectiveness.

论文关键词:Missing data, Missing data imputation, Semi-parametric data

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10489-006-0032-0