Efficient top-k recently-frequent term querying over spatio-temporal textual streams
作者:
Highlights:
• Introduce a location-based time-decaying query to retrieve recently frequent terms within a user specified region of interest, and we propose both exact and approximate algorithms to address it efficiently.
• Introduce the time-weighted term list structure to enable both quad-trees and R-trees to index social post streams.
• Demonstrate how to support fast digestion of social streams with a batch insertion of simultaneous Morton encoding and time-weighted frequency pre-calculation.
• Extensive experimental evaluation was performed to evaluate query response performance and index cost. The results show that our methods are highly efficient in terms of query response time, accuracy and scalability.
摘要
•Introduce a location-based time-decaying query to retrieve recently frequent terms within a user specified region of interest, and we propose both exact and approximate algorithms to address it efficiently.•Introduce the time-weighted term list structure to enable both quad-trees and R-trees to index social post streams.•Demonstrate how to support fast digestion of social streams with a batch insertion of simultaneous Morton encoding and time-weighted frequency pre-calculation.•Extensive experimental evaluation was performed to evaluate query response performance and index cost. The results show that our methods are highly efficient in terms of query response time, accuracy and scalability.
论文关键词:Frequent terms,Time-weighted,Spatio-temporal query,Top-k query,Spatial index,Spatio-temporal textual stream
论文评审过程:Received 18 June 2020, Revised 12 October 2020, Accepted 19 November 2020, Available online 5 December 2020, Version of Record 9 December 2020.
论文官网地址:https://doi.org/10.1016/j.is.2020.101687