A tool to discover the main themes in a Spanish or English document

作者:

Highlights:

摘要

While most work on Knowledge Discovery in databases has been concerned with structured databases, there has been little work on handling the huge amount of information that is available only in unstructured textual form. In this paper a system based on information retrieval and text mining methods is presented. In addition, it is shown how the system analyzes a document containing natural language sentences in order to recognize its main topics or themes. The knowledge base used for the system is conformed by trees of concept. The architecture and the main algorithms of the system are discussed in this work.

论文关键词:Text mining,Concept trees,Text analysis,Natural language processing,Knowledge discovering

论文评审过程:Available online 30 November 2000.

论文官网地址:https://doi.org/10.1016/S0957-4174(00)00043-9