Towards a new approach for mining frequent itemsets on data stream
作者:Chedy Raïssi, Pascal Poncelet, Maguelonne Teisseire
摘要
Mining frequent patterns on streaming data is a new challenging problem for the data mining community since data arrives sequentially in the form of continuous rapid streams. In this paper we propose a new approach for mining itemsets. Our approach has the following advantages: an efficient representation of items and a novel data structure to maintain frequent patterns coupled with a fast pruning strategy. At any time, users can issue requests for frequent itemsets over an arbitrary time interval. Furthermore our approach produces an approximate answer with an assurance that it will not bypass user-defined frequency and temporal thresholds. Finally the proposed method is analyzed by a series of experiments on different datasets.
论文关键词:Data streams, Frequent itemsets, Approximate answer
论文评审过程:
论文官网地址:https://doi.org/10.1007/s10844-006-0002-3