Performance of latent factor models with extended linear biases

作者:

Highlights:

摘要

High-dimensional and sparse (HiDS) matrices are frequently encountered in various industrial applications, due to the exploding number of involved entities and great needs to describe the relationships among them. Latent factor (LF) models are highly effective and efficient in extracting useful knowledge from these HiDS matrices. They well represent the known data of a HiDS matrix with high computational and storage efficiency. When building an LF model, the incorporation of linear biases has proven to be effect in further improving its performance on HiDS matrices in many applications. However, prior works all propose to assign a single bias to each entity, i.e., a single bias for each user/movie from a user-movie HiDS matrix. In this work we argue that to extend the linear biases, i.e., to assign multiple biases to each involved entity, can further improve an LF model's performance in some applications. To verify this hypothesis, we first extended the linear biases of an LF model, and then deduced the corresponding training rule of involved LFs. Subsequently, we conducted experiments on ten HiDS matrices generated by different industrial applications, evaluating the resulting LF models’ prediction accuracy for the missing data of involved HiDS matrices. The experimental results indicate that on most testing cases an LF model needs to extend its linear biases to achieve the highest prediction accuracy. Hence, the number of linear biases should be chosen with care to make an LF model achieve the best performance in practice.

论文关键词:Big data,Recommender system,Latent factor model,Linear bias,Industrial application

论文评审过程:Received 19 February 2016, Revised 6 February 2017, Accepted 7 February 2017, Available online 14 February 2017, Version of Record 27 March 2017.

论文官网地址:https://doi.org/10.1016/j.knosys.2017.02.010