DataNation - Podcast for Data Engineers, Analysts and Scientists

Welcome to "Datanation," the podcast where your host, Alex Merced, takes you on a captivating journey through the fascinating world of data. In each episode, we explore a wide range of data topics, from data engineering and data analytics to the art and science of data-driven decision-making. In the age of information, data is the currency that drives innovation and progress. "Datanation" is your passport to this ever-evolving landscape, where we unravel the mysteries, dissect the trends, and celebrate the breakthroughs shaping the data-driven future. Join Alex Merced, a seasoned data enthusiast and educator, as he engages in enlightening discussions, informative interviews, and thought-provoking explorations of data concepts and practices. Whether you're a seasoned data professional, a curious tech enthusiast, or someone simply intrigued by the power of data, this podcast offers valuable insights and knowledge. Find all episodes at: https://host.alexmercedpodcast.com/series/datanation/ Follow Alex on Twitter @amdatalakehouse Find article Alex has written on Data related topics at Dremio.com/Subsurface

63 – Reinvent, AWS S3 Table Buckets and Apache Iceberg

Alex Merced discusses his experience at AWS re:invent follow Alex at AlexMered.com/data

12-06
--:--

BONUS: Data Lakehouse Crash Course (Polaris, Nessie, Unity, Gravitino, Lakekeeper and more!)

Register for the catalog Course: https://drmevn.fyi/catalogcourse1024 Watch the Iceberg Crash Course: https://drmevn.fyi/icebergcourse1024 London Meetup: https://lu.ma/Lakehouselinkups Paris Meetup: https://drmevn.fyi/1120-france-meetup My Calendar of Events: https://lu.ma/Lakehouselinkups

11-05
00:57

62 – Why Catalogs are so hot right now in the data space?

Alex Merced discusses why catalogs are so important in data:

10-30
07:00

61 – What’s New In dbt? (dbt coalesce 2024)

Alex Merced discusses the news and announcements for dbt coalest 2024. Announcements Alex didn’t mention:– dbt Apache Iceberg support, this is done by working with Iceberg supporting query engines like Dremio – Healthtiles with more information on your dashboard about the health of your models – Auto-exposures in Tableau triggering BI Dashboard updates when models […]

10-11
03:25

FREE Apache Iceberg Crash Course

Register for the Course: https://bit.ly/am-2024-iceberg-live-crash-course-1 Free Copy of Apache Iceberg Book: https://bit.ly/am-iceberg-book My social and blog links: https://bio.alexmerced.com/data

07-09
01:05

60 – Interoperability of Data Lake Table Format (Apache Iceberg, Apache Hudi, Delta Lake)

Alex Merced discusses where interoperability tools like Apache Xtable and Uniform

06-28
17:32

#59 – Apache Iceberg Catalogs (Nessie) vs Enterprise Data Catalogs (Colibra)

Alex Merced discusses the difference between Apache Iceberg Catalog and Enterprise Data Catalogs to help clarify the discussions around catalogs in today’s data trends. Follow Alex -> https://bio.alexmerced.com/data

06-25
06:40

58 – Databricks Announcements (Open Source Unity Catalog, Liquid Clustering, Nvidia)

Alex Merced discusses some of the Databricks announcement at the Data + AI summit Follow Alex by visit https://bio.alexmerced.com/data

06-12
07:36

57 – Databricks buys Tabular

I talk about the big news of the day. follow on Twitter @amdatalakehouse

06-05
19:08

56 – Open Source Apache Iceberg Catalogs (Nessie, Polaris, Gravitino)

Alex Merced discusses the value of Open Source Apache Iceberg catalogs in creating a truly open lakehouse environment without Vendor lock-in. Check out my article on the subject: https://open.substack.com/pub/amdatalakehouse/p/open-source-table-format-open-source?r=h4f8p&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true Follow me on twitter at @amdatalakehouse

06-04
08:54

55 – Discussing the Apache Iceberg Kafka Connect Connector

In this episode, we delve into the Apache Iceberg Kafka Connector, a critical tool for streaming data into your data lakehouse. We’ll explore how this connector facilitates seamless data ingestion from Apache Kafka into Apache Iceberg, enhancing your real-time analytics capabilities and data lakehouse efficiency. We’ll cover: Join us to understand how the Apache Iceberg […]

05-16
--:--

54 – Major Architectural Differences between Apache Iceberg and Delta Lake (Partition Evolution and Hidden Partitioning)

Alex Merced discusses some of the major differences in how Apache Iceberg and Delta Lake work that lead to: Follow me on social https://bio.alexmerced.com/data

04-20
08:31

53-Why Do Snowflake Bills Get So Large?

Alex Merced discusses the mistakes that makes Snowflake bills get so large. Hands-On Lakehouse Laptop Exercises:– MongoDB with Dremio: https://bit.ly/am-mongodb-dashboard– SQLServer with Dremio: https://bit.ly/am-sqlserver-dashboard– Postgres with Dremio: https://bit.ly/am-postgres-to-dashboard https://bio.alexmerced.com/data

04-17
--:--

52 – Apache Iceberg, Dremio and PuppyGraph

Alex Merced discusses the benefits of Apache Iceberg’s open data ecosystem! Build a Data Lakehouse on Your Laptop Deploy Deploy into Production

03-28
--:--

#1 – intro to catalogs, manifests and metadata. Oh my!

In this episode, Alex Merced introduces his new podcast “Catalogs, Manifests, and Metadata. Oh my!” covering open-source data projects like Apache Iceberg and others. Make sure to subscribe, this podcast will be showing up in podcast directories over the next week or so of the publishing of this episode. Follow Alex Merced, find all links […]

03-25
01:22

51 – Open Data Standards (Apache Iceberg, Apache Parquet, Apache Arrow, Apache Ibis, Apach Substrait)

Alex Merced discusses many of the open source projects aiming to reduce the frictions the heavily fragmented data world. Follow me on Socials:https://bio.alexmerced.com/data

03-18
09:51

50 – Thinking about the flow of Streaming/Real-Time Data

Alex thinks on the development of Real-time data pipelines.

02-21
--:--

48 – Understanding how Lakehouse Table Formats are Implemented in your Favorite Tools

Alex Merced discusses how formats like Apache Iceberg, Apache Hudi and Delta Lake work and are implemented into your favorite tools, distinguishing what is the responsibility of the format and there responsibility of the engine. Follow Alex on Social, find all links at:https://bio.alexmerced.com/data

02-02
--:--

47 – Understanding your cloud costs (Storage, Egress, Compute, Serverless, etc.)

Alex Merced discusses cloud costs Alex’s Links: https://bio.alexmerced/data

01-21
07:26

Bonus: New Youtube Channel, State of the Data Lakehouse

Find all my data resources below:https://bio.alexmerced.com/data Listen to the State of the Data Lakehouse Podcast Here:https://em360tech.com/podcast/dremio-state-data-lakehouse?utm_source=podcasts&utm_medium=podcast&utm_content=content&utm_campaign=alexmercedcontent&utm_term=iceberg+lakehouse+nessie

01-20
--:--

Recommend Channels