Tracking geographical locations using a geo-aware topic model for analyzing social media data
作者:
Highlights:
• Online discussions can be tracked geographically over time.
• A distributed geo-aware streaming LDA (SLDA) model has been developed.
• Evaluation was performed during the 2016 American presidential primary elections.
• It was shown that locations correlated with the actual election locations.
• SLDA was shown to be an effective model for tracking topics and geographical trends.
摘要
Tracking how discussion topics evolve in social media and where these topics are discussed geographically over time has the potential to provide useful information for many different purposes. In crisis management, knowing a specific topic's current geographical location could provide vital information to where, or even which, resources should be allocated. This paper describes an attempt to track online discussions geographically over time. A distributed geo-aware streaming latent Dirichlet allocation model was developed for the purpose of recognizing topics' locations in unstructured text. To evaluate the model it has been implemented and used for automatic discovery and geographical tracking of election topics during parts of the 2016 American presidential primary elections. It was shown that the locations correlated with the actual election locations, and that the model provides a better geolocation classification compared to using a keyword-based approach.
论文关键词:Social media,Topic modeling,Geo-awareness,Trend analysis,Latent Dirichlet allocation,Streaming media
论文评审过程:Received 20 September 2016, Revised 13 April 2017, Accepted 4 May 2017, Available online 15 May 2017, Version of Record 26 June 2017.
论文官网地址:https://doi.org/10.1016/j.dss.2017.05.006