The Joe Reis Show

The official podcast of tech/data nerd and "recovering data scientist" Joe Reis. He provides refreshingly candid thoughts on the world of technology and data. Each week, he broadcasts from somewhere in the world, sometimes ranting solo or with the smartest people in the business.

Tanya Bragin - Clickhouse, Open Source vs Commercial, and More

Tanya Bragin and I have a wide-ranging chat about the tension of open source and commercial products, Clickhouse, aligning marketing and product, and how she manages her time.

11-20
56:43

Freestyle Fridays - Obscurity is Your Enemy

People often ask me for career advice. In a tough job market where people are sending out thousands of resumes and hearing nothing back, I notice a lot of people have weak networks and are unknown to the companies they're applying to. This results in lots of frustration and disappointment for job seekers. Is there a better way? Yes. People need to know who you are. Obscurity is your enemy. Also, the name of the Friday show changed because I can't seem to keep things to five minutes ;) My works: 📕Fundamentals of Data Engineering: https://www.oreilly.com/library/view/fundamentals-of-data/9781098108298/ 🎥 Deeplearning.ai Data Engineering Certificate: https://www.coursera.org/professional-certificates/data-engineering 🔥Practical Data Modeling: https://practicaldatamodeling.substack.com/ 🤓 My SubStack: https://joereis.substack.com/

11-15
20:39

Chris Riccomini - Building (and Writing About) Data Intensive Applications

Chris Riccomini and I chat about building his latest project SlateDB, building data intensive infrastructure, writing, investing, and much more.

11-13
47:45

Beers and Data with Friends in Helsinki, Finland

In this episode, I have a chat with Antti Rask, Juha Korpella, Niko Korvenlaita, Russell Willis, and Kosti Hokkanen. We chat about data, startups, and business in Finland and Europe.

11-12
51:52

5 Minute Friday - The Quality Paradox

Let's do things the right way, not just the fast way. My works: 📕Fundamentals of Data Engineering: https://www.oreilly.com/library/view/fundamentals-of-data/9781098108298/ 🎥 Deeplearning.ai Data Engineering Certificate: https://www.coursera.org/professional-certificates/data-engineering 🔥Practical Data Modeling: https://practicaldatamodeling.substack.com/ 🤓 My SubStack: https://joereis.substack.com/

11-08
11:16

5 Minute Friday - Asking Good Questions at Conferences

I speak at a lot of conferences, and I've lost track of how many questions I've answered. Since conferences are top of mind for me right now, here are some tips for asking good (and bad) questions of speakers. My works: 📕Fundamentals of Data Engineering: https://www.oreilly.com/library/view/fundamentals-of-data/9781098108298/ 🎥 Deeplearning.ai Data Engineering Certificate: https://www.coursera.org/professional-certificates/data-engineering 🔥Practical Data Modeling: https://practicaldatamodeling.substack.com/ 🤓 My SubStack: https://joereis.substack.com/

11-01
12:58

Wes McKinney

Wes McKinney and I chat about Positron, Arrow, how he created Pandas and Arrow, and what makes him tick.

10-30
59:03

5 Minute Friday - Is AI a Hail Mary for Tech Debt?

I've seen a TON of horror stories with tech debt and code migrations. It's estimated that 15% to 60% of every dollar in IT spend goes toward tech debt (that's a big range, I know). Regardless, most of this tech debt will not be paid down without a radical change in how we do things. Might AI be the Hail Mary we need to pay down tech debt? I don't see why not... My works: 📕Fundamentals of Data Engineering: https://www.oreilly.com/library/view/fundamentals-of-data/9781098108298/ 🎥 Deeplearning.ai Data Engineering Certificate: https://www.coursera.org/professional-certificates/data-engineering 🔥Practical Data Modeling: https://practicaldatamodeling.substack.com/ 🤓 My SubStack: https://joereis.substack.com/

10-25
07:56

Anne-Claire Baschet and Yoann Benoit - The Data Death Cycle

Anne-Claire Baschet and Yoann Benoit recently wrote a wonderful article called The Data Death Cycle, which describes the feedback loop of doom that many data teams find themselves in. Here, we discuss the Data Death Cycle in detail. Article: https://medium.com/craftingdataproducts/the-data-death-cycle-6b10ef261d8e

10-24
14:12

Larry Burns - Reimagining How Data Teams Can Add Value

Larry Burns and I chat about all things data teams—how they fail, their challenges, and how they can add value. To add value, we need to reimagine not only how we think about data but also how we manage knowledge. Larry brings a fresh and battle-worn perspective to the data field, and if you work on or manage a data team, this conversation is worth a listen. LinkedIn: https://www.linkedin.com/in/larryburnsdba/

10-23
01:01:17

5 Minute Friday - Speaking at Conferences

This week I posted about how some major conferences charge a bunch of money for tickets and sponsorship, but don't pay speakers. As a speaker, I find this unethical and exploitative. Here, I unpack my thoughts on speaking at conferences. If you're a speaker, or want to become one, this is worth your time to listen. My post: https://www.linkedin.com/posts/josephreis_this-morning-i-had-to-decline-a-speaking-activity-7252331326287011841-NPG6

10-18
09:27

Vijay Yadav - GenAI-Ready Data

Vijay Yadav (Director of Data Science at Merck) joins me to chat about a very interesting project he launched at Merck involving LLMs in production. A big part of this discussion is how to make data ready for generative AI. This is a great example of an LLM-native use case in production, which are rare right now. Lots to learn from here. Enjoy! LinkedIn: https://www.linkedin.com/in/vijay-yadav-ds/

10-16
01:03:56

5 Minute Friday - Playing Not to Lose

In my newsletter last week, I wrote "Data’s still a mess. Most data initiatives fail. Data teams are seen as a cost center and not getting the support they deserve. Same as it ever was." Here, I unpack those four sentences. Data teams need to stop stop playing to not lose. Instead, they need to play to win!

10-11
12:05

Navnit Shukla - Data Wrangling and Architecting Solutions on AWS, Writing Books, and More

Navnit Shukla is a solutions architect with AWS. He joins me to chat about data wrangling and architecting solutions on AWS, writing books, and much more. Navnit is also in the Coursera Data Engineering Specialization, dropping knowledge on data engineering on AWS. Check it out! Data Wrangling on AWS: https://www.amazon.com/Data-Wrangling-AWS-organize-analysis/dp/1801810907 LinkedIn: https://www.linkedin.com/in/navnitshukla/

10-09
56:33

5 Minute Friday - Field Notes, Early Fall 2024 Edition

I've spent the last three weeks visiting the UK, Australia, and New Zealand. Here are my observations and anecdotes about the data and ML/AI industry from countless chats with executives, practitioners, and pundits.

10-04
10:02

Ilya Reznik - How to Lead New and Existing ML Teams and More

Ilya Reznik has been in the ML game for ages, having worked at Adobe and Twitter and led teams at Meta, among others. We chat about leading ML teams, AI today, creating content, and much more. LinkedIn: https://www.linkedin.com/in/ibreznik/

10-01
01:03:06

5 Minute Friday - Boring is Good

As I travel this Fall, I'm reminded that most people don't work at fancy tech companies. Most people work at traditional companies with "boring" data and tech stacks. And that's OK. Boring is good.

09-27
05:33

Jordan Morrow - How to Write Amazing Books

Jordan Morrow has written a ton, including four books. We chat about the process of writing books, the ins and outs of working with a publisher, the role of AI in writing, and much more. If you're interested in writing a book, this is a crash course in what you should know. Enjoy!

09-26
36:42

Venkat Subramaniam - Moving Beyond Agile as a Buzzword, Learning to do Less, and more

Venkat Subramaniam is a programmer, author, speaker, and founder of Agile Developer, Inc. I've seen him speak several times, and was always blown away by his passion and technical depth. So, I was excited to have him on the podcast. We chat about agile development in the real world, learning to do less, and much more. Venkat is extremely wise, and I very much enjoyed our discussion. Enjoy! LinkedIn: https://www.linkedin.com/in/vsubramaniam Twitter: https://x.com/venkat_s

09-24
57:56

5 Minute Friday - Uncle Rico

Uncle Rico is a character in the movie Napoleon Dynamite, who is stuck in the past, reminiscing about his days as a high school football star. If only he'd won the game and went to the state championship. Some of the data industry reminds me of Uncle Rico. During a recent panel, there was a question about whether AI can help with data management (governance, modeling, etc). Some people were quick to dismiss this, saying that machines are no substitute for humans in their understanding and translating of "the business" to data. Yet why are we still perpetually stuck in the mode of "80% of data projects fail"? Might AI/ML help data management move out of its rut? Or will it stay stuck in the past? Also, please check out my new data engineering course on Coursera! https://www.coursera.org/learn/intro-to-data-engineering

09-20
05:50

Recommend Channels