An Adaptable IE System to New Domains

作者：J. Turmo, N. Català, H. Rodríguez

摘要

The most extended way of acquiring information for knowledge based systems is to do it manually. However, the high cost of this approach and the availability of alternative Knowledge Sources has lead to an increasing use of automatic acquisition approaches. In this paper we present M-TURBIO, a Text-Based Intelligent System (TBIS) that extracts information contained in restricted-domain documents. The system acquires part of its knowledge about the structure of the documents and the way the information is presented (i.e., syntactic-semantic rules) from a training set of these. Then, a database is created by means of applying these syntactic-semantic rules to extract the information contained in the whole document.

论文关键词：information extraction, automatic pattern acquisition, machine learning, EuroWordNet

论文评审过程：

论文官网地址：https://doi.org/10.1023/A:1008332021052