Keyword selection and processing strategy for applying text mining to patent analysis

作者:

Highlights:

• We focused on keyword strategies for applying text-mining to patent data.

• Four factors were evaluated through k-means clustering and entropy values.

• Using an abstract based on a TF–IDF represent the best keyword selection strategy.

• Using Boolean expression represents the best keyword processing strategy.

摘要

•We focused on keyword strategies for applying text-mining to patent data.•Four factors were evaluated through k-means clustering and entropy values.•Using an abstract based on a TF–IDF represent the best keyword selection strategy.•Using Boolean expression represents the best keyword processing strategy.

论文关键词:Patent analysis,Text-mining,Keyword selection,Keyword processing,Document clustering

论文评审过程:Available online 2 February 2015.

论文官网地址:https://doi.org/10.1016/j.eswa.2015.01.050