Context Graph vs Knowledge Graph: Same Shape, Different Scope

By DataHub with Expert Insights from Sat Duggal

04.29.26

TL;DR

Context graphs and knowledge graphs share the same underlying structure: Entities connected by typed relationships. Most attempts to draw a sharper line between them invent a distinction that does not exist.

The real difference is scope and grounding. A knowledge graph is usually a world-model, capturing what is generally true about a domain. A context graph is an organization-model, grounded in one company’s data assets, owners, decisions, and institutional knowledge.

DataHub’s context platform is built on a graph that started as a metadata graph and has extended into a context graph, making it the structure underneath impact analysis, lineage traversal, and grounded reasoning for AI agents to access and update.

If you have spent any time in the data infrastructure or AI agent conversation lately, you’ve probably noticed that “context graph” and “knowledge graph” are starting to get used interchangeably. Sometimes by the same person, in the same sentence.

Most of the time, the people using them that way are right: The two terms describe structurally similar things, and most attempts to draw a sharp line between them end up inventing a distinction that does not actually exist.

The honest reality is that context graphs and knowledge graphs share the same underlying shape. But there is a meaningful difference: Scope and grounding.

A knowledge graph models entities and semantic relationships in a domain to support general reasoning.
A context graph models a specific organization’s data, decisions, and institutional knowledge to support reasoning grounded in that organization.

Once you see the difference, the vocabulary stops being the interesting part of the question.

What is a knowledge graph?

Quick definition: What is a knowledge graph?

A knowledge graph is a structured representation of entities and the typed relationships between them, usually grounded in a shared ontology. It is used to model what is generally true about a domain, so that humans and machines can reason over those facts.

The term comes from the semantic web tradition. Its lineage runs through RDF, OWL, SPARQL, and the linked data movement that grew up around the W3C in the early 2000s. The concept hit mainstream awareness in 2012, when Google introduced its Knowledge Graph as the structure behind the answer panels that started appearing alongside search results. Wikidata, schema.org, and a generation of biomedical and enterprise knowledge graphs came out around the same time.

What unites all of these is the structural pattern. A knowledge graph stores entities (a person, a company, a product, a city) as nodes, and connects them through typed semantic relationships (works at, manufactured by, headquartered in). Most knowledge graphs are grounded in an ontology that defines what kinds of entities and relationships are allowed. The classic job of a knowledge graph is to answer questions about what is generally true: What is a customer, what cities are in Canada, which companies belong to which industries.

That generality is the key. A traditional knowledge graph is usually a world-model. It tries to capture facts about a domain in a way that holds across organizations, applications, and contexts.

What is a context graph?

Quick definition: What is a context graph?

A context graph is an entity-relationship graph grounded in a specific organization’s data ecosystem. It includes structural metadata about assets, pipelines, and ownership, and extends to the institutional knowledge, business definitions, and documentation that explains why those assets exist and how they should be used.

Structurally, a context graph looks the same as a knowledge graph. Entities and typed relationships, often grounded in a schema. The shape is not what makes it different. What makes it different is what is in it and where it is grounded.

A context graph is built around a specific organization’s data ecosystem and the enterprise context that surrounds it. Its nodes are not generic entities like “company” or “city” in the abstract. They are the actual data assets, pipelines, dashboards, models, glossary terms, owners, and teams that exist inside one company.

And it goes further than structural metadata. A well-maintained context graph extends into knowledge management at scale: the documents that explain those assets, the decisions that produced them, the discussions that shaped how they should be used.

We’ve written a longer piece on what a context graph is and what goes into building one. The short version, for this post, is that a context graph is the graph an organization builds about itself.

Why the ‘context graph’ emerged when it did

The language did not appear out of nowhere. It surfaced because general-purpose knowledge graphs were not enough for the use cases that AI agents are now creating inside specific businesses.

An AI agent answering a question like “which dashboard should I trust for Q3 revenue” does not need a model of what a dashboard is in general. It needs to know which dashboards exist in your warehouse, which ones are certified, which team owns them, when they were last refreshed, and what known issues your data engineers flagged in Slack last week. That is grounded, organization-specific reasoning, and a world-model knowledge graph cannot provide it.

The “context” framing surfaced because that is what was missing. It is also a useful corrective to the assumption that context for an AI agent means stuffing more documents into a retrieval pipeline. As Kurt Cagle argued recently in The Ontologist, a context graph is structured knowledge that needs to be deterministic and governable, not a vibes-based retrieval blob. LLMs can help build it, but the graph itself is infrastructure.

We have written more on this distinction in retrieval augmented generation (RAG) vs context management.

The confusion: Why this has become a hotly debated distinction

Both definitions on the table now describe entity-relationship graphs. The graph structure is identical. So why are people arguing about it?

The cleanest explanation is that the two terms come from different communities that ended up in the same room:

“Knowledge graph” is the AI and semantic web community’s term, with a couple of decades of familiarity behind it.
“Context graph” is newer, and it is surfacing from the LLM and data infrastructure crowd.

When an ML team says “knowledge graph” and a data platform team says “context graph,” they often mean structurally similar things. They are just using the vocabulary their team has been using.

Cagle, who has spent years in the semantic web world, puts it about as plainly as anyone has: Context graphs are largely what the AI community has been calling knowledge graphs all along, just with the vocabulary catching up to a new set of use cases. We think that is the right read of the structural question. Where it gets interesting is not whether the structures are the same, but whether the jobs they are doing are the same. They are not.

The real distinction: Scope and grounding

Here is the distinction that actually matters, and that most discussions keep missing.

The difference between a context graph and a knowledge graph is not graph technology, schema choice, query language, or generational hierarchy. It is scope and grounding.

A traditional knowledge graph is a world-model: It answers questions about what is generally true about a kind of thing. Wikidata can tell you what a city is, which countries border France, and what year a film was released. An enterprise knowledge graph can tell you which suppliers serve which industries in general. These are facts that hold across organizations.
A context graph is an organization-model: It answers questions about what is true about this organization’s data, right now, and why it exists the way it does. It does not try to model the world. It tries to model one company’s data ecosystem and the institutional knowledge wrapped around it.

Both kinds of graphs are valid. They serve different jobs. The vocabulary tracks the community that built each tradition more than it tracks any deep architectural difference. If you want to call your context graph a knowledge graph, nobody is going to stop you. The question worth asking is not which word to use. It is whether the graph you have is grounded in the world or in your organization, and whether that grounding matches the job you need it to do.

What a context graph enables that a generic knowledge graph doesn’t

Once the scope question is settled, the practical consequences fall out naturally. Here is what a context graph adds that a world-model knowledge graph, on its own, cannot do.

Impact analysis grounded in your actual systems: Before you change a column in a source table, you can traverse the graph to see which downstream pipelines, dashboards, models, and reports depend on it. A general-purpose knowledge graph cannot tell you that, because it does not know your pipelines exist.
Root cause investigation across your stack: When a metric looks wrong, you can trace it backward through the specific transformations, joins, and sources that produced it. The graph knows the path because the path is part of the graph.
Reasoning that AI agents can actually rely on: An agent working inside your business needs to know which assets are trusted, which are deprecated, which documents explain a metric definition, and who owns the answer when it does not. That is grounded reasoning over your meaning and your trust signals, not pattern matching against general world knowledge.
Onboarding for humans and AI tools alike: A new hire and a new agent both face the same problem on day one: most of the answers they need used to live in the heads of specific people. A context graph captures that institutional knowledge in a structure both can query.
These are not hypothetical use cases: They are the everyday work of data teams, and they are exactly the work that breaks down when the only thing your graph knows is structural metadata.

Where DataHub fits

DataHub is built as a graph for your enterprise data. Every asset, team, pipeline, glossary term, business definition, and lineage edge is a connected node. That graph started its life as a metadata graph, focused on the technical relationships between data assets. It has now extended into a context graph by bridging structured technical metadata with the unstructured institutional knowledge that surrounds it, ingesting external knowledge sources like Confluence pages, Notion docs, and Context Documents and connecting them to the assets they describe.

We go deeper on the technical side of how that graph works in our post on the metadata knowledge graph.

The relevant point here is just that the abstract argument has a working instantiation: A context graph is not a thought experiment. It is the structure underneath every impact analysis query, every lineage traversal, and every grounded answer an agent gives when it is reasoning about your data.

But the question worth spending time on is not which word to use. It is whether your graph is grounded in the world or in your organization, and whether the grounding matches what your business and your AI agents actually need. If you are building something that needs to answer questions about your data, your decisions, and your meaning, you are building a context graph, regardless of what your team calls it.

FAQs

Mostly, yes. In an enterprise data architecture, a context graph and a knowledge graph often describe the same underlying semantic network. The distinction is primarily functional. A knowledge graph is the foundation that connects entities and relationships into a single queryable structure, while a context graph describes the application of that foundation to a specific organization’s data, decisions, and institutional knowledge, including for the job of grounding AI agents. If you scope a knowledge graph tightly enough to one company’s assets, owners, and documentation, you have built a context graph.

In practice, the difference shows up in what the graph can do for you. A general-purpose knowledge graph can tell you what a customer is in the abstract. A context graph can tell you which customer table your finance team trusts, when it was last refreshed, who owns it, and which downstream dashboards will break if you change it. The first is reference knowledge. The second is operational context for the work your team is actually doing

For agents working inside a specific business, a context graph is what they need. A world-model knowledge graph cannot tell an agent which of your dashboards to trust, which metric definitions are current, or which data sources have known issues. That kind of grounded, organization-specific reasoning is exactly what a context graph is built for. The structured context it provides is the foundation that makes agent answers reliable instead of plausible.

No. GraphRAG is a retrieval pattern that uses a graph to improve how an LLM finds relevant information. A context graph is the underlying structure being retrieved from. You can use GraphRAG techniques on top of a context graph, but the two are not interchangeable. The context graph is infrastructure. GraphRAG is one of the things you can do with it.

A data catalog is one of the things you can use to build a context graph, but the graph is the more general structure. A traditional data catalog focuses on inventorying assets and their metadata. A context graph extends that inventory with the relationships, documents, decisions, and institutional knowledge that explain what the assets mean and why they exist. If a catalog tells you what data you have, a context graph tells you what your data means and how to use it.

Yes, and many organizations effectively do. If you already have a knowledge graph that models your domain, you can extend it with the organization-specific entities, relationships, and institutional knowledge that turn it into a context graph. The work is less about replacing what you have and more about narrowing the scope, grounding the graph in your specific data ecosystem, and connecting it to the documents and decisions that explain your data.

There is no single answer. Context graphs have been built on RDF stacks, on property graph databases like Neo4j, and on custom storage layers optimized for specific workloads. The graph database is an implementation choice, not a definitional one. What matters more is whether the graph is grounded in your organization, whether it captures the institutional knowledge around your assets, and whether it can support the queries your teams and your agents need to run.

Context graphs work for AI agents by giving them a grounded, organization-specific structure to reason over and write back to. Instead of pattern matching against general world knowledge, an agent can traverse the graph to find which data assets exist in the company, which are trusted, and which documents and decisions explain how those assets should be used. The result is an answer the agent can defend, not just one it can plausibly generate.