Chime Connects Teams with Data Lineage

INDUSTRY

Financial Technology

SIZE

1,400+ employees

DATA STACK

Snowflake, Spark, Flink, Airflow, Protocol Buffer schema

SOLUTION

DataHub Cloud

USE CASE

Metadata Management, Discovery, Lineage, Governance

GOALS

Eliminate organizational silos between data producers (product engineers) and data consumers (analysts)
Establish clear data ownership and accountability across teams
Create a common “water cooler” for cross-team data collaboration
Implement shift-left data quality practices by collecting metadata at the source

See what DataHub Cloud can do for your team

Meet With Us

The Topline

Challenge
Siloed teams where data producers and consumers weren’t communicating, leading to hidden data issues that impacted business insights

Solution
Implemented DataHub Cloud as a central platform with X-platform lineage, ownership tracking, and proactive data quality monitoring

Impact
Improved collaboration, ownership, and seamless integration of metadata with a centralized, scalable platform

Note: This story was originally published October 2023.

Challenge

As Chime scaled, so did its data and the complexity of managing it. Multiple teams were managing data independently, with an ever-increasing number of tools adding to the complexity.

Business-critical metrics were scattered across systems. Source-of-truth datasets were hidden or duplicated. And when dashboards broke, no one knew whether it was a real business issue or just bad data.

The root cause? A disconnect between the teams producing data and the teams using it.

“In a lot of organizations, the producers (product engineering) and consumers (analytics teams) are in separate orgs… Because these two groups are not talking to each other, there are a lot of problems related to consumer expectations, producers not knowing how their data is being used, and so on and so forth.”

— Sherin Thomas, Software Engineer, Chime

Chime needed a unified approach to data management; one that would streamline workflows, improve data discovery, and foster better collaboration across teams.

Solution

To break down silos and restore trust in their data, Chime implemented DataHub Cloud as its centralized discovery and governance platform.

Instead of making data engineers the middlemen between data consumers and producers, Chime brought everyone (engineers, PMs, analysts, BI teams) into DataHub Cloud. In doing so, they established a “water cooler” for all data stakeholders: a shared space where everyone could access, contribute to, and collaborate around metadata.

A key reason for this approach’s effectiveness lies in DataHub’s X-platform lineage functionality. It bridges the gap between data producers and consumers. With lineage, producers can see exactly who is using their data and how. Consumers, on the other hand, can trace where the data comes from and how it’s been transformed. Besides ensuring transparency, this also simplifies accountability. Everyone knows what’s happening and why.

“My favorite part about DataHub is the lineage because this is one really easy way of connecting the producers to the consumers. Now the producers know who is using their data. Consumers know where the data is coming from. And it is easier to have accountability mechanisms.”

— Sherin Thomas, Software Engineer, Chime

Chime also embraced a shift-left approach to collecting metadata at the source. Using DataHub SDKs, vital context like schema definitions, documentation, and tags are transformed into searchable glossary terms, tags, and descriptions, making it accessible across teams. Effectively making metadata a first-class citizen at Chime.

My favorite part about DataHub is the lineage because this is one really easy way of connecting the producers to the consumers. Now the producers know who is using their data. Consumers know where the data is coming from. And it is easier to have accountability mechanisms.

SHERIN THOMAS

Software Engineer, Chime

Impact

With DataHub Cloud, Chime manages data across the organization, improving collaboration, efficiency, and data quality.

Key outcomes included:

Centralized platform for all teams, ensuring alignment and better collaboration
Enhanced visibility with lineage to track data flows and quickly spot issues
Streamlined metadata management using crowdsourced metadata ingestion and schema integration
Clear ownership and accountability by designating data stewards for each dataset and enforcing ownership policies
Proactive data quality monitoring with assertions to set and monitor data quality standards, automatically detecting issues early

“Now our engineers, PMs, analysts, and BI folks, everybody is using the same tool … They can just look at the lineage, and they can find if there is any node that has an active incident there, find their owners, and reach out to them.”

— Sherin Thomas, Software Engineer, Chime

Start your own success story with DataHub

Meet with us

See how DataHub Cloud can support enterprise needs and accelerate your journey toward context-rich, AI-ready data. Request a custom demo.

Join our open source community

Explore the project, contribute ideas, and connect with thousands of practitioners in the DataHub Slack community.

Chime’s Data Now Works in Harmony With Their Teams