Finding tendencies in streaming data using Big Data frequent itemset mining
作者:
Highlights:
•
摘要
The amount of information generated in social media channels or economical/business transactions exceeds the usual bounds of static databases and is in continuous growing. In this work, we propose a frequent itemset mining method using sliding windows capable of extracting tendencies from continuous data flows. For that aim, we develop this method using Big Data technologies, in particular, using the Spark Streaming framework enabling distributing the computation along several clusters and thus improving the algorithm speed. The experimentation carried out shows the capability of our proposal and its scalability when massive amounts of data coming from streams are taken into account.
论文关键词:Streaming data,Big Data,Frequent itemset mining,Tendencies
论文评审过程:Received 17 April 2018, Revised 13 September 2018, Accepted 16 September 2018, Available online 1 October 2018, Version of Record 21 November 2018.
论文官网地址:https://doi.org/10.1016/j.knosys.2018.09.026