Ontology-based information extraction of regulatory networks from scientific articles with case studies for Escherichia coli

作者:

Highlights:

摘要

The amount of scientific papers in the Molecular Biology field has experienced an enormous growth in the last years, prompting the need of developing automatic Information Extraction (IE) systems. This work is a first step towards the ontology-based domain-independent generalization of a system that identifies Escherichia coli regulatory networks. First, a domain ontology based on the RegulonDB database was designed and populated. After that, the steps of the existing IE system were generalized to use the knowledge contained in the ontology, so that it could be potentially applied to other domains. The resulting system has been tested both with abstract and full articles that describe regulatory interactions for E. coli, obtaining satisfactory results.

论文关键词:Ontology-based information extraction,Knowledge representation,Regulatory networks

论文评审过程:Available online 3 January 2013.

论文官网地址:https://doi.org/10.1016/j.eswa.2012.12.090