I-TWEC: Interactive clustering tool for Twitter

作者:

Highlights:

• Open source scalable Twitter Clustering Tool for technical and nontechnical experts.

• Interactive clustering with a web interface for lexical and semantic clustering.

• Suffix tree index structure for fast clustering.

• Clusters 60K tweets in 23.8 - 25.5 s, 1 million tweets in 1500 s.

摘要

•Open source scalable Twitter Clustering Tool for technical and nontechnical experts.•Interactive clustering with a web interface for lexical and semantic clustering.•Suffix tree index structure for fast clustering.•Clusters 60K tweets in 23.8 - 25.5 s, 1 million tweets in 1500 s.

论文关键词:Tweet clustering,Short text clustering,Suffix tree based clustering,LCS based clustering

论文评审过程:Received 15 May 2017, Revised 28 November 2017, Accepted 29 November 2017, Available online 1 December 2017, Version of Record 1 December 2017.

论文官网地址:https://doi.org/10.1016/j.eswa.2017.11.055