Combination of genetic network programming and knapsack problem to support record clustering on distributed databases

作者:

Highlights:

• A decision support algorithm for record clustering in databases is proposed.

• Capacity limitation problem is introduced to make a general clustering application.

• Rule extraction from datasets is realized by the proposed evolutionary algorithm.

• Rule clustering considering capacity limitation is solved by knapsack problem.

• The simulations of record clustering show some advantages of the proposed method.

摘要

•A decision support algorithm for record clustering in databases is proposed.•Capacity limitation problem is introduced to make a general clustering application.•Rule extraction from datasets is realized by the proposed evolutionary algorithm.•Rule clustering considering capacity limitation is solved by knapsack problem.•The simulations of record clustering show some advantages of the proposed method.

论文关键词:Genetic network programming,Database clustering,Knapsack problem,Record clustering

论文评审过程:Received 14 August 2014, Revised 30 September 2015, Accepted 1 October 2015, Available online 19 October 2015, Version of Record 3 November 2015.

论文官网地址:https://doi.org/10.1016/j.eswa.2015.10.006