DiscoverOpen Source Archives - Software Engineering DailyDatahub: Open Source Data Lake with Pardhu Gunnam and Mars Lan
Datahub: Open Source Data Lake with Pardhu Gunnam and Mars Lan

Datahub: Open Source Data Lake with Pardhu Gunnam and Mars Lan

Update: 2021-03-19
Share

Description


As the volume and scope of data collected by an organization grow, tasks such as data discovery and data management grow in complexity. Simply put, the more data there is, the harder it is for users such as data analysts to find what they’re looking for. A metadata hub helps manage Big Data by providing metadata search and discovery tools, and a centralized hub which presents a holistic view of the data ecosystem. DataHub is Linkedin’s open-sourced metadata search and discovery tool. It is Linkedin’s second generation of metadata hubs after WhereHows. 


Pardhu Gunnam and Mars Lan join us today from Metaphor, a company they co-founded to build out the DataHub ecosystem. Pardhu and Mars, and the other co-founders of Metaphor, were part of the team at Linkedin that built the DataHub project. They join the show today to talk about how DataHub democratizes data access for an organization, why the new DataHub architecture was critical to Linkedin’s growth, and what we can expect to see from the DataHub project moving forwards.


Sponsorship inquiries: sponsor@softwareengineeringdaily.com


The post Datahub: Open Source Data Lake with Pardhu Gunnam and Mars Lan appeared first on Software Engineering Daily.

Comments 
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Datahub: Open Source Data Lake with Pardhu Gunnam and Mars Lan

Datahub: Open Source Data Lake with Pardhu Gunnam and Mars Lan

SE Daily