Version spaces and the consistency problem

作者：

摘要

A version space is a collection of concepts consistent with a given set of positive and negative examples. Mitchell [Artificial Intelligence 18 (1982) 203–226] proposed representing a version space by its boundary sets: the maximally general (G) and maximally specific consistent concepts (S). For many simple concept classes, the size of G and S is known to grow exponentially in the number of positive and negative examples. This paper argues that previous work on alternative representations of version spaces has disguised the real question underlying version space reasoning. We instead show that tractable reasoning with version spaces turns out to depend on the consistency problem, i.e., determining if there is any concept consistent with a set of positive and negative examples. Indeed, we show that tractable version space reasoning is possible if and only if there is an efficient algorithm for the consistency problem. Our observations give rise to new concept classes for which tractable version space reasoning is now possible, e.g., 1-decision lists, monotone depth two formulas, and halfspaces.

论文关键词：Version spaces,Boundary sets,Consistency problem,Inductive learning

论文评审过程：Received 11 September 2002, Revised 21 March 2003, Accepted 7 April 2003, Available online 10 May 2004.

论文官网地址：https://doi.org/10.1016/j.artint.2003.04.003