DBkWik: extracting and integrating knowledge from thousands of Wikis

作者:Sven Hertling, Heiko Paulheim

摘要

Popular cross-domain knowledge graphs, such as DBpedia and YAGO, are built from Wikipedia, and therefore similar in coverage. In contrast, Wikifarms like Fandom contain Wikis for specific topics, which are often complementary to the information contained in Wikipedia, and thus DBpedia and YAGO. Extracting these Wikis with the DBpedia extraction framework is possible, but results in many isolated knowledge graphs. In this paper, we show how to create one consolidated knowledge graph, called DBkWik, from thousands of Wikis. We perform entity resolution and schema matching, and show that the resulting large-scale knowledge graph is complementary to DBpedia. Furthermore, we discuss the potential use of DBkWik as a benchmark for knowledge graph matching.

论文关键词:Knowledge graph creation, Information extraction, Linked open data, Knowledge graph matching

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-019-01415-5