OntoPlus: Text-driven ontology extension using ontology content, structure and co-occurrence information

作者:

Highlights:

摘要

This paper addresses the process of semi-automatic text-driven ontology extension using ontology content, structure and co-occurrence information. A novel OntoPlus methodology is proposed for semi-automatic ontology extension based on text mining methods. It allows for the effective extension of the large ontologies, providing a ranked list of potentially relevant concepts and relationships given a new concept (e.g., glossary term) to be inserted in the ontology. A number of experiments are conducted, evaluating measures for ranking correspondence between existing ontology concepts and new domain concepts suggested for the ontology extension. Measures for ranking are based on incorporating ontology content, structure and co-occurrence information. The experiments are performed using a well known Cyc ontology and textual material from two domains – finances and, fisheries & aquaculture. Our experiments show that the best results are achieved by combining content, structure and co-occurrence information. Furthermore, ontology content and structure seem to be more important than co-occurrence for our data in the financial domain. At the same time, ontology content and co-occurrence seem to have higher importance for our fisheries & aquaculture domain.

论文关键词:Knowledge engineering methodologies,Ontology extension,Large-scale ontology,Text mining,Semantic technologies

论文评审过程:Received 1 July 2010, Revised 20 April 2011, Accepted 1 June 2011, Available online 12 June 2011.

论文官网地址:https://doi.org/10.1016/j.knosys.2011.06.002