Content management in the SYNDIKATE system – How technical documents are automatically transformed to text knowledge bases

作者:

Highlights:

摘要

SYNDIKATE is a family of natural language understanding systems for automatically acquiring knowledge from real-world texts (e.g., information technology test reports, medical finding reports), and for transferring their content to formal representation structures which constitute a corresponding text knowledge base. We present a general system architecture which integrates requirements from the analysis of single sentences, as well as those of referentially linked sentences forming cohesive texts. Properly accounting for text cohesion phenomena is a prerequisite for the soundness and validity of the generated text representation structures. It is also crucial for any information system application making use of automatically generated text knowledge bases in a reliable way, e.g., by inferentially supported fact retrieval.

论文关键词:Natural language processing,Text understanding,Knowledge acquisition from texts

论文评审过程:Received 31 January 2000, Revised 31 January 2000, Accepted 25 May 2000, Available online 6 September 2000.

论文官网地址:https://doi.org/10.1016/S0169-023X(00)00031-8