Skypattern mining: From pattern condensed representations to dynamic constraint satisfaction problems

作者:

摘要

Data mining is the study of how to extract information from data and express it as useful knowledge. One of its most important subfields, pattern mining, involves searching and enumerating interesting patterns in data. Various aspects of pattern mining are studied in the theory of computation and statistics. In the last decade, the pattern mining community has witnessed a sharp shift from efficiency-based approaches to methods which can extract more meaningful patterns. Recently, new methods adapting results from studies of economic efficiency and multi-criteria decision analyses such as Pareto efficiency, or skylines, have been studied. Within pattern mining, this novel line of research allows the easy expression of preferences according to a dominance relation. This approach is useful from a user-preference point of view and tends to promote the use of pattern mining algorithms for non-experts. We present a significant extension of our previous work [1], [2] on the discovery of skyline patterns (or “skypatterns”) based on the theoretical relationships with condensed representations of patterns. We show how these relationships facilitate the computation of skypatterns and we exploit them to propose a flexible and efficient approach to mine skypatterns using a dynamic constraint satisfaction problems (CSP) framework.

论文关键词:Skypatterns,Pattern mining,Constraint programming,Dynamic CSP,User preferences

论文评审过程:Revised 7 April 2015, Accepted 14 April 2015, Available online 28 April 2015, Version of Record 9 February 2017.

论文官网地址:https://doi.org/10.1016/j.artint.2015.04.003