Humans of DataHub: Liu Xianglong

Humans of DataHub

We are excited to share our fourth installment of Humans of DataHub. This week we are joined by Liu Xianglong, of the Centre for Strategic Infocomm Technologies, Singapore, where he is a Data Platform Engineer.

Liu Xianglong (@xl on DataHub Slack)

Liu Xianglong (@xl on DataHub Slack)

How did you first learn about DataHub?

“We were evaluating open-source data discovery solutions and Datahub was one of the projects we found.”

What do you enjoy most about the DataHub Community?

“Compared to some of the other open-source projects that I follow, I like the fact that the community is very active and I can get help and advice from the developers and other users very quickly.”

What has DataHub enabled within your organization?

“We plan to use Datahub to consolidate data definitions since it has the ability to accept metadata from a wide range of data sources as well as allow developers to do custom implementations.”

What are you most excited to see happen with DataHub in 2022?

“View ACL would be very much welcomed as we have requirements to control access to certain datasets. Also, I hope that version 1.0 of Datahub can be launched this year.”

What’s your favorite DataHub feature/use case?

“BrowsePaths, where you can specify where a dataset is located for browsing, is a very handy tool to cater to different user groups as we can put soft-links to common datasets at different points in the catalogue.”

Thank you, Liu, for speaking with the team and for all of your contributions to the DataHub Community.


If you are new to DataHub, just beginning to understand what “metadata” and “modern data stack” mean, or you’ve just read these words for the first time (welcome aboard! 🚀), let us take a moment to introduce ourselves and share a little history;

DataHub is an extensible metadata platform, enabling data discovery, data observability, and federated governance to tame the complexity of increasingly diverse data ecosystems. Originally built at LinkedIn, DataHub was open-sourced under the Apache 2.0 License in 2020. It now has a thriving community with over 2.3k members and 100+ code contributors, and many companies are actively using DataHub in production.

We believe that data-driven organizations need a reimagined developer-friendly data catalog to tackle the diversity and scale of the modern data stack. Our goal is to provide the most reliable and trusted enterprise data graph to empower data teams with best-in-class search and discovery and enable continuous data quality based on DataOps practices. This allows central data teams to scale their effectiveness and companies to maximize the value they derive from data.

Want to learn more about DataHub and how to join our community? Visit Datahub.com and say hello on Slack. 👋

Similar Posts