A cost model to estimate the effort of data mining projects (DMCoMo)

作者:

Highlights:

摘要

CRISP-DM is the standard to develop Data Mining projects. CRISP-DM proposes processes and tasks that you have to carry out to develop a Data Mining project. A task proposed by CRISP-DM is the cost estimation of the Data Mining project.In software development a lot of methods are described to estimate the costs of project development (SLIM, SEER-SEM, PRICE-S and COCOMO). These methods are not appropriate in the case of Data Mining projects because in Data Mining software development is not the first goal.Some methods have been proposed to estimate some phases of a Data Mining project, but there is no method to estimate the global cost of a generic Data Mining project. The lack of Data Mining project estimation methods is because of many real-life project failures due to the non-realistic estimation at the beginning of the projects.Consequently, in this paper we propose to design and validate a parametric cost estimation model, similar to COCOMO or SLIM in software development, for Data Mining projects (DMCoMo1). The drivers of the model will be proposed first and later the equation of the model will be proposed.

论文关键词:Data Mining,Knowledge discovery,Cost estimation,Parametric model

论文评审过程:Received 26 February 2007, Accepted 7 July 2007, Available online 20 July 2007.

论文官网地址:https://doi.org/10.1016/j.is.2007.07.004