APRA: An approximate parallel recommendation algorithm for Big Data

作者:

Highlights:

摘要

Finding relevant and interesting items according to the preferences of each user has become an important challenge in the era of Big Data. Recommender systems have emerged in response to this problem. Collaborative Filtering (CF) is one of the most successful recommender systems used by several big online shopping companies. However, CF is computationally demanding, especially in Big Data context, where the number of users and items are too big to be effectively processed by traditional approaches.In this paper, we propose a new solution based on Spark, which is tailored to handle large-scale data and provide better results. Particularly, we take advantage of the in-memory operations available through Spark, to improve the performance of recommendation systems in context of Big Data. Experimental results on two real-world data sets confirm the claim.

论文关键词:Recommender system,Big Data,Apache Hadoop,Apache Spark,Collaborative Filtering

论文评审过程:Received 16 December 2017, Revised 25 March 2018, Accepted 5 May 2018, Available online 17 May 2018, Version of Record 17 June 2018.

论文官网地址:https://doi.org/10.1016/j.knosys.2018.05.006