Robust Learning with Missing Data

作者:Marco Ramoni, Paola Sebastiani

摘要

This paper introduces a new method, called the robust Bayesian estimator (RBE), to learn conditional probability distributions from incomplete data sets. The intuition behind the RBE is that, when no information about the pattern of missing data is available, an incomplete database constrains the set of all possible estimates and this paper provides a characterization of these constraints. An experimental comparison with two popular methods to estimate conditional probability distributions from incomplete data—Gibbs sampling and the EM algorithm—shows a gain in robustness. An application of the RBE to quantify a naive Bayesian classifier from an incomplete data set illustrates its practical relevance.

论文关键词:Bayesian learning, Bayesian networks, Bayesian classifiers, probability intervals, missing data

论文评审过程:

论文官网地址:https://doi.org/10.1023/A:1010968702992