What’s Next for DataHub?
A Sneak Peek into the 2025 Roadmap DataHub
But there’s a lot more to come. In this post, we’ll review the key features and initiatives in our 2025 DataHub roadmap.
Our focus for the year ahead will center around four core areas: data discovery, data observability, data governance, and improving the developer experience within DataHub’s platform.

1. Data Discovery
In 2025, we’re focusing on making data discovery more intuitive, insightful, and robust. Our goal is to streamline how users find, understand, and trust their data through:
- Human-centered Insights: Enhancing the way metadata, annotations, and discussions are integrated to capture and surface human-generated context
- Intelligent Exploration: Improving search and navigation to surface the most relevant, well-documented, and widely used assets.
- A Metrics Catalog: Enabling teams to register, associate, and document key metrics directly within DataHub.

- End-to-end Lineage: Enhancing end-to-end lineage to focus on deeper integrations with dashboarding, analytics, and AI/ML tools, along with hierarchical lineage (at the container, domain, data product platform level) for users to zoom out and analyze dependencies at a macro level.

Potential Data Discovery Features on the Horizon
We’re also evaluating several new capabilities based on community feedback, including:
- Data Explorer: Browse and query live data directly within DataHub.
- Discussions: Capture tribal knowledge about data assets via threaded discussions on asset profiles.
- Asset Announcements: Pin key updates or alerts to data assets.
- Expanded integrations: Support for Segment, Amplitude, Mixpanel, Hightouch, and Census.
Have suggestions to improve data discovery in DataHub? Share your ideas with us in our feature request portal.
2. Data Governance
In our focus on making governance more seamless, scalable, and actionable, we’re doubling down on three key areas:
- A Universal Data Registry: Ensuring complete visibility into every dataset, model, transformation, and dashboard so that DataHub remains the go-to place for discovering and understanding all your data assets in one unified view.
- Centralized Compliance: Building DataHub as a compliance hub, tracking ownership, segment monitoring, GDPR documentation, PII classification, and sensitivity tagging — ensuring that data is not only well-managed but also meets regulatory and organizational standards.
- Policy Enforcement: Introducing logical datasets and parent-child asset relationships so you can manage metadata for multiple physical assets in one place. This approach allows you to define your metadata once and automatically apply rich context — such as documentation, tags, and annotations — across all relevant assets, ensuring consistency wherever your data is stored.

Taking a Zoomed-Out Look at Data Governance
But governance doesn’t stop within DataHub. You might be familiar with the DataHub Actions Framework that enables you to automate tasks and workflows in response to metadata changes.
We’re expanding this framework to push tags, glossary terms, and classifications back to source systems. This will make tag-based policy enforcement more efficient — you can define policies centrally in DataHub and let them be applied where the data actually lives.

Other Potential Features for Data Governance
We’re also exploring additional governance enhancements, including:
- Metadata Completion Analytics v2: View historical trends for ownership, documentation, term, and data product completion.
- Glossary Synonyms: Link glossary terms to other synonym terms. In search, enable searching matching terms and synonyms of the term.
If you have ideas to improve governance in DataHub, we’re all ears — share your ideas on our feature request portal.
3. Data Observability
To make data quality more accessible, collaborative, and contextual across the organization, we want to ensure that data reliability is a shared responsibility — not just a concern for data engineers.
With these goals in mind, our data governance efforts will focus on:
- Redesigned DataHub Assertions Experience: A new interface for searching, filtering, and grouping data quality assertions, along with rich historical context to track assertion runs and runtime details.

- Enhanced Incident Management: Expanding incident tracking with priorities, stages, assignees, and activity history, so teams can track issues from detection to resolution.
- Asset & Column Statistics: Providing insights into table usage, key statistics, change history, and update patterns — so you can quickly assess data health at a glance.
Candidate Features for Advancing Data Observability
We’re also considering further enhancements, such as:
- Custom Asset Metrics: Allowing users to define and monitor custom quality metrics for specific assets.
- New Observability Integrations: Ingesting monitor and assertion status from Soda.io, Monte Carlo, Anomalo, and more to centralize data quality insights.
- PagerDuty Integration: Using the DataHub Actions framework to connect DataHub incidents with PagerDuty for seamless incident resolution workflows.
Think this list needs more/other features? Let us know what you think.
4. Platform Layer Improvements
Beyond our core verticals, we’re also investing in foundational improvements with a focus on:
- Developer Experience: Expanding APIs and SDKs to make metadata registration and retrieval more seamless via API.
- Quality of Service: Ensuring a smooth user experience even during heavy ingestion jobs and high system loads.
- Audit Logging & Tracing: Providing better visibility into key operations and user activity to enhance governance and security.
Two other impactful projects include:
- Python SDK v2: Incorporating community feedback to deliver higher-level APIs for registering, enriching, and retrieving data assets, to reduce time-to-value for developers.
- Service Accounts: Making it easier to create and manage service users within DataHub’s UI, enabling more secure programmatic workflows and automation.
Tell Us What You Think!
We would love your input as we refine these features. If there are specific enhancements that would improve your experience, please head over to the DataHub feature request portal and help shape the future of DataHub in 2025.
The journey ahead looks incredibly exciting, and we’re truly grateful to have you along for the ride!
Connect with DataHub
Join us on Slack • Sign up for our Newsletter • Subscribe to our Calendar
