Keyword selection and processing strategy for applying text mining to patent analysis
作者:
Highlights:
• We focused on keyword strategies for applying text-mining to patent data.
• Four factors were evaluated through k-means clustering and entropy values.
• Using an abstract based on a TF–IDF represent the best keyword selection strategy.
• Using Boolean expression represents the best keyword processing strategy.
摘要
•We focused on keyword strategies for applying text-mining to patent data.•Four factors were evaluated through k-means clustering and entropy values.•Using an abstract based on a TF–IDF represent the best keyword selection strategy.•Using Boolean expression represents the best keyword processing strategy.
论文关键词:Patent analysis,Text-mining,Keyword selection,Keyword processing,Document clustering
论文评审过程:Available online 2 February 2015.
论文官网地址:https://doi.org/10.1016/j.eswa.2015.01.050