Ontology-driven discourse analysis for information extraction

作者:

Highlights:

摘要

This paper presents a novel approach to discourse analysis within information extraction systems. It makes use of DRT as formal representation of the linguistic context as well as of a domain-specific ontology as a basis to compute conceptual relations between extracted events thus establishing discourse coherence. The approach has been implemented within GenIE, an information extraction system with the aim of extracting information about biochemical pathways, about sequences, structures and functions of genomes and proteins. The approach is evaluated against a semantically hand-annotated set of Swiss-Prot protein function descriptions and shows very promising results.

论文关键词:Information extraction,Discourse analysis,Event ontology,Biomedical NLP

论文评审过程:Received 4 November 2004, Accepted 4 November 2004, Available online 21 December 2004.

论文官网地址:https://doi.org/10.1016/j.datak.2004.11.009