What is a Knowledge Graph?

loading page

James P. McCusker,
Deborah L. McGuinness,
John S. Erickson,
Katherine Chastain

Abstract

Google introduced its Knowledge Graph project in 2012, and has used it to improve query result relevancy and their overall search experience. They have leveraged existing knowledge graphs, such as DBpedia and Freebase, and also have opened up the process of contributing to the graph by ingesting RDFa and microdata formats from the Web pages they index, based on the vocabularies published by schema.org. The success of the Google Knowledge Graph, and its use of semantic technologies, has led to a resurgence in the use of the term in semantic research to describe similar projects. However, the term “knowledge graph” remains underspecified, and in many cases, simply refers to any directed labeled graph. We surveyed and synthesized current literature on knowledge graphs and the historical use of the term. The pre-Semantic Web conceptualization of knowledge graphs provides us with guidance as to what might currently “count” as a knowledge graph and also describes capabilities that do not yet exist in current knowledge graphs. From this synthesis, we propose an updated definition along with a set of knowledge graph requirements We include an implicit requirement: that knowledge graphs represent knowledge, as opposed to bare assertions with no justification or provenance. We discuss how knowledge graphs as defined are a crucial component of the future of the Web and have great potential for transformational change in data science and domain sciences.