Make your travel smarter: Summarizing urban tourism information from massive blog data

作者:

Highlights:

摘要

In this work, we propose a research framework to help people summarize tourism information, such as popular tourist locations as well as their travel sequences (routes), for a previously unknown city from massive travel blog with the objective of providing users with better travel scheduling. To do this, we first crawl the massive travel blogs for a targeted city online. Then, we transfer the textual contents of these blogs to a series of word vectors to form the initial data source. Next, we implement the frequent pattern mining method on the data to identify the city's popular locations by their sequenced co-occurrences among the usual tourism activities, which can be visualized into a word network. Finally, we develop a max-confidence based method to detect travel routes from the network. We illustrate the benefits of this approach by applying it to the data from a blog web-site run by a Chinese online tourism service company. The results show that the proposed method can efficiently explore the popular travel information from massive data.

论文关键词:Blog mining,Geographic term,Tourist location,Word network,Travel route

论文评审过程:Received 31 May 2015, Revised 27 January 2016, Accepted 23 February 2016, Available online 9 March 2016, Version of Record 27 October 2016.

论文官网地址:https://doi.org/10.1016/j.ijinfomgt.2016.02.009