Plumbers of Data Science

Data Engineering is the plumbing of data science. Almost invisible, but super important and a big mess when done wrong. We talk about interesting Data Engineering trends and topics. I also train Data Engineering in my Data Engineering Academy at LearnDataEngineering.com

#90 Taylor McGrath - The Future of the Modern Data Stack

Super happy to have Taylor with me on this stream. She is the VP of Data Labs at Rivery and therefore has a lot of experience with data platforms. We'll talk about the modern data stack and where it's going. I'm excited to hear her experience about the changes that are happening in the data space, and what that means for data engineers & data teams.

01-25
47:00

#89 Piyush Sachdeva - Getting Into Google After Eight Rejections from Amazon!

In this video I talk to Piyush who's an engineer at Google and has his own YouTube channel: "Tech Tutorials with Piyush". He's a really good guy and I love how he's dedicated to teaching engineering. We are talking about some awesome topics like:  Is Linkedin a must for getting a job? Tips for recording yourself  Cloud Engineering vs Data Engineering Which Cloud Platform should you choose right now? The amazing Google work culture explained Everybody should learn how to use Kubernetes How getting rejected over and over at Amazon got him into Google  The hiring process at Google   Have fun! You can also check this on out on YouTube: https://youtu.be/FZemVaqQcnM If you want to get into Data Engineering check out my Academy at https://learndataengineering.com

01-16
44:26

#88 - Wouter Trappers - How to Realize a Data Strategy Like a Pro!

I have seen people doing that wrong a few times. Luckily Wouter Trappers who is helping companies as a professional can help. We talked about The steps you need to take from value proposition to dashboards. Wouter is really knowledgeable and it was super fun talking with him and hearing his approach.

04-12
39:48

#87 - Dhruba Borthakur - From Hadoop to real time analytics

Dhruba Borthakur is CTO at Rockset and a passionate Data Engineer. Before co-founding Rockset he played a big role in development of Hadoop HDFS at Yahoo as well as HBase and RocksDB at Facebook. His current project is the serverless Rockset platform where you can gain real time analytics insight into your data. I tried it out before our talk and really liked it.

04-12
01:05:37

#86 The Ultimate Data Engineering Introduction

The Podcast is back!!!! I promise I am going to keep it up to date this time ;) In this episode I talk about my newest Data Engineering course. I think it's the ultimate 1 hour 15 minutes introduction to Data Engineering.  There were also a ton of questions from the chat that I answered. Think you really enjoy this.

01-14
01:14:35

#085 Big Data and Data Science Landscape plus trying to read Tweets with Nifi

We are looking into the network communication protocol map. I first saw this like 10 years ago and its awesome.  Then we check out the Big Data and Data Science Landscape image. It shows you all the tools available to do data science, machine learning and data engineering. Which is very helpful if you are researching for tools to use.  Before using the Twitter API you got to create a developer account. So, I show you how I created one. After that I tried to get Nifi to download Tweets but it is not working.

05-28
43:06

#084 Behind the scenes: Audio podcast, free transcriptions and GitHub

Today's podcast is a bit of a behind the scenes.  What it takes to do a audio podcast. How you can get audio to text transcriptions for free.  .Also Github questions on how to work with branches on the Cookbook

05-27
51:21

#083 Data Engineering at OLX Case Study

Today a case study about OLX with a guest it was super fun! Here are the slides Alexeyand I talked about: https://www.slideshare.net/mobile/AlexeyGrigorev/image-models-infrastructure-at-olx

05-27
01:10:53

#082 Reading Tweets With Apache Nifi & IaaS vs PaaS vs SaaS

In this episode we install the Nifi docker container and look into how we can extract the twitter data. We are also talking about the differences between infrastructure as a service, platform as a service and application as a service.

05-27
01:19:06

#081 How to get tweets from the Twitter API

In this episode we look into the Twitter API documentation, which I love by the way. How can we get old tweets for a certain hashtags and how to get current live tweets for these hashtags.

05-27
01:09:47

#080 How To Find A Job In Germany & Answering Mails

Tips on how you find a job in Germany and two super interesting mails.

05-27
54:54

#079 Trying to stay true to myself and making the cookbook public on GitHub

The cookbook my Youtube, it will be for free, forever! Check out the data engineering cookbook on GitHub: https://github.com/andkret/Cookbook

05-27
24:34

#078 Cookbook collaboration and updates

Updates of the cookbook and how to collaborate on it

05-27
31:08

#077 Lambda and Kappa Architecture

In this episode we talk about the lambda architecture with stream and batch processing as well as a alternative the Kappa Architecture that consists only of streaming. Also Data engineer vs data scientist and we discuss Andrew Ng's AI Transformation Playbook

05-27
01:22:01

#076 Cloud vs On Premise How To Decide

How do you choose between Cloud vs On-Premise, pros and cons and what you have to think about. Because there are good reasons to not go cloud. Also thoughts on how to choose between the cloud providers by just comparing instance prices. Otherwise the comparison will drive you insane.

05-27
01:15:56

#075 Creating the Course Structure For My Data Engineering Course

In this episode we go over the ideas I have for the data engineering course structure. It was your chance for you to influence what we put in there.

05-27
53:18

#074 Starting My Data Engineering Online Course

In this video we go over some of the 100+ comments I received on LinkedIn about a data engineering training. 

05-27
01:01:19

#073 Data Engineering At LinkedIn Case Study

Let's check out how LinkedIn is processing data

05-27
01:12:21

#072 Data Engineering At Twitter Case Study

How is Twitter doing Data Engineering? Oh man, they have a lot of cool things to share these tweets. 

05-27
56:27

#071 Data Engineering At Spotify Case Study

In this episode we are looking at the data engineering at Spotify, my favorite music streaming service. How do they process all that data?

05-27
43:04

Jonnie Kerry

Emergency plumbing in Florida provides fast, reliable service for unexpected leaks, burst pipes, clogged drains, or broken water heaters. With Florida’s humid climate and frequent storms, plumbing emergencies can quickly escalate, causing water damage or mold growth. https://plumbifyfl.com/plumbing-repair-miami/

10-28 Reply

Recommend Channels