A utility-based news recommendation system

作者:

Highlights:

• The news utility model goes beyond click through rate analysis for news recommendations.

• The system is designed based on the MapReduce framework to analyze huge volume of data in parallel.

• News Cold Start Problem has been addressed effectively by a novel probabilistic approach.

• Experiments on 2 billion records demonstrate the effectiveness and efficiency of the system.

摘要

News platforms exhibit both the challenges as well as opportunities for enhancing the functionalities of recommendation systems in today's big data environment. Novel use of big data storage and programming models can improve news recommendation systems through efficient handling and analysis of clickstream data and a better understanding of users' interests. Most existing approaches to news recommendation consider users' clicks as the implicit feedback to understand user behaviors. However, “clicks” may not be an effective indicator of real user interests. We address this problem by developing a novel news recommendation system based on a news utility model. Given the new utility model, we propose a two stage news recommendation framework. The framework first generates article-level recommendation rules based on the utility model, then integrates the notion of utility and probabilistic topic models and generates topic-level recommendation rules. We argue that the proposed utility-based news recommendation system also addresses the news cold start problem which is one of the most challenging obstacles for news agencies. We evaluate the framework on a massive real dataset (two billion records) obtained from a major newspaper (i.e., The Globe and Mail) in Canada and show that it outperforms the existing methods.

论文关键词:News recommendation systems,News utility model,Big clickstream data,MapReduce,Rule mining

论文评审过程:Received 20 July 2018, Revised 5 December 2018, Accepted 5 December 2018, Available online 15 December 2018, Version of Record 20 December 2018.

论文官网地址:https://doi.org/10.1016/j.dss.2018.12.001