Statistical Themes and Lessons for Data Mining

作者:Clark Glymour, David Madigan, Daryl Pregibon, Padhraic Smyth

摘要

Data mining is on the interface of Computer Science andStatistics, utilizing advances in both disciplines to make progressin extracting information from large databases. It is an emergingfield that has attracted much attention in a very short period oftime. This article highlights some statistical themes and lessonsthat are directly relevant to data mining and attempts to identifyopportunities where close cooperation between the statistical andcomputational communities might reasonably provide synergy forfurther progress in data analysis.

论文关键词:statistics, uncertainty, modeling, bias, variance

论文评审过程:

论文官网地址:https://doi.org/10.1023/A:1009773905005