Using social network and semantic analysis to analyze online travel forums and forecast tourism demand

作者:

Highlights:

• We use social network and semantic analysis to analyze the TripAdvisor travel forum.

• We collect about 2,660,000 posts and study the interactions of about 147,100 users.

• We use online big data to forecast tourist arrivals in 7 European capital cities.

• We present new measures which can be integrated into traditional forecasting models.

• Best predictors are language complexity and the presence of eminent contributors.

摘要

Forecasting tourism demand has important implications for both policy makers and companies operating in the tourism industry. In this research, we applied methods and tools of social network and semantic analysis to study user-generated content retrieved from online communities which interacted on the TripAdvisor travel forum. We analyzed the forums of 7 major European capital cities, over a period of 10 years, collecting more than 2,660,000 posts, written by about 147,000 users. We present a new methodology of analysis of tourism-related big data and a set of variables which could be integrated into traditional forecasting models. We implemented Factor Augmented Autoregressive and Bridge models with social network and semantic variables which often led to a better forecasting performance than univariate models and models based on Google Trend data. Forum language complexity and the centralization of the communication network – i.e. the presence of eminent contributors – were the variables that contributed more to the forecasting of international airport arrivals.

论文关键词:Tourism forecasting,Social network analysis,Semantic analysis,Online community,Text mining,Big data

论文评审过程:Received 24 January 2019, Revised 9 June 2019, Accepted 10 June 2019, Available online 12 June 2019, Version of Record 15 July 2019.

论文官网地址:https://doi.org/10.1016/j.dss.2019.113075