Content management in the SYNDIKATE system – How technical documents are automatically transformed to text knowledge bases
作者:
Highlights:
•
摘要
SYNDIKATE is a family of natural language understanding systems for automatically acquiring knowledge from real-world texts (e.g., information technology test reports, medical finding reports), and for transferring their content to formal representation structures which constitute a corresponding text knowledge base. We present a general system architecture which integrates requirements from the analysis of single sentences, as well as those of referentially linked sentences forming cohesive texts. Properly accounting for text cohesion phenomena is a prerequisite for the soundness and validity of the generated text representation structures. It is also crucial for any information system application making use of automatically generated text knowledge bases in a reliable way, e.g., by inferentially supported fact retrieval.
论文关键词:Natural language processing,Text understanding,Knowledge acquisition from texts
论文评审过程:Received 31 January 2000, Revised 31 January 2000, Accepted 25 May 2000, Available online 6 September 2000.
论文官网地址:https://doi.org/10.1016/S0169-023X(00)00031-8