Alike people, alike interests? Inferring interest similarity in online social networks

作者:

Highlights:

• Pose a practical research problem: how to infer the interest similarity of two users where we do not know one of the user's interests.

• Reveal the homophily of interest similarity with respect to various social features, relying on a large Facebook dataset (479,048 users and 5,263,351 user-generated interests).

• Devise an interest similarity prediction model based on the learned social features.

• A recommendation system for new users is illustrated to show the practicality of the proposed interest similarity prediction model.

摘要

Understanding how much two individuals are alike in their interests (i.e., interest similarity) has become virtually essential for many applications and services in Online Social Networks (OSNs). Since users do not always explicitly elaborate their interests in OSNs like Facebook, how to determine users' interest similarity without fully knowing their interests is a practical problem. In this paper, we investigate how users' interest similarity relates to various social features (e.g. geographic distance); and accordingly infer whether the interests of two users are alike or unalike where one of the users' interests are unknown. Relying on a large Facebook dataset, which contains 479,048 users and 5,263,351 user-generated interests, we present comprehensive empirical studies and verify the homophily of interest similarity across three interest domains (movies, music and TV shows). The homophily reveals that people tend to exhibit more similar tastes if they have similar demographic information (e.g., age, location), or if they are friends. It also shows that the individuals with a higher interest entropy usually share more interests with others. Based on these results, we provide a practical prediction model under a real OSN environment. For a given user with no interest information, this model can select some individuals who not only exhibit many interests but also probably achieve high interest similarities with the given user. Eventually, we illustrate a use case to demonstrate that the proposed prediction model could facilitate decision-making for OSN applications and services.

论文关键词:Social networks,Interest similarity,Homophily,Prediction model

论文评审过程:Received 16 April 2014, Revised 22 September 2014, Accepted 30 November 2014, Available online 8 December 2014.

论文官网地址:https://doi.org/10.1016/j.dss.2014.11.008