DiscoverThe Python Podcast.__init__Update Your Model's View Of The World In Real Time With Streaming Machine Learning Using River
Update Your Model's View Of The World In Real Time With Streaming Machine Learning Using River

Update Your Model's View Of The World In Real Time With Streaming Machine Learning Using River

Update: 2022-12-12
Share

Description

Preamble


This is a cross-over episode from our new show The Machine Learning Podcast, the show about going from idea to production with machine learning.


Summary


The majority of machine learning projects that you read about or work on are built around batch processes. The model is trained, and then validated, and then deployed, with each step being a discrete and isolated task. Unfortunately, the real world is rarely static, leading to concept drift and model failures. River is a framework for building streaming machine learning projects that can constantly adapt to new information. In this episode Max Halford explains how the project works, why you might (or might not) want to consider streaming ML, and how to get started building with River.


Announcements



  • Hello and welcome to the Machine Learning Podcast, the podcast about machine learning and how to bring it from idea to delivery.

  • Building good ML models is hard, but testing them properly is even harder. At Deepchecks, they built an open-source testing framework that follows best practices, ensuring that your models behave as expected. Get started quickly using their built-in library of checks for testing and validating your model’s behavior and performance, and extend it to meet your specific needs as your model evolves. Accelerate your machine learning projects by building trust in your models and automating the testing that you used to do manually. Go to themachinelearningpodcast.com/deepchecks today to get started!

  • Your host is Tobias Macey and today I’m interviewing Max Halford about River, a Python toolkit for streaming and online machine learning


Interview



  • Introduction

  • How did you get involved in machine learning?

  • Can you describe what River is and the story behind it?

  • What is "online" machine learning?

    • What are the practical differences with batch ML?

    • Why is batch learning so predominant?

    • What are the cases where someone would want/need to use online or streaming ML?



  • The prevailing pattern for batch ML model lifecycles is to train, deploy, monitor, repeat. What does the ongoing maintenance for a streaming ML model look like?

    • Concept drift is typically due to a discrepancy between the data used to train a model and the actual data being observed. How does the use of online learning affect the incidence of drift?



  • Can you describe how the River framework is implemented?

    • How have the design and goals of the project changed since you started working on it?



  • How do the internal representations of the model differ from batch learning to allow for incremental updates to the model state?

  • In the documentation you note the use of Python dictionaries for state management and the flexibility offered by that choice. What are the benefits and potential pitfalls of that decision?

  • Can you describe the process of using River to design, implement, and validate a streaming ML model?

    • What are the operational requirements for deploying and serving the model once it has been developed?



  • What are some of the challenges that users of River might run into if they are coming from a batch learning background?

  • What are the most interesting, innovative, or unexpected ways that you have seen River used?

  • What are the most interesting, unexpected, or challenging lessons that you have learned while working on River?

  • When is River the wrong choice?

  • What do you have planned for the future of River?


Contact Info



Parting Question



  • From your perspective, what is the biggest barrier to adoption of machine learning today?


Closing Announcements



  • Thank you for listening! Don’t forget to check out our other shows. The Data Engineering Podcast covers the latest on modern data management. Podcast.__init__ covers the Python language, its community, and the innovative ways it is being used.

  • Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.

  • If you’ve learned something or tried out a project from the show then tell us about it! Email hosts@themachinelearningpodcast.com) with your story.

  • To help other people find the show please leave a review on iTunes and tell your friends and co-workers


Links



The intro and outro music is from Hitman’s Lovesong feat. Paola Graziano by The Freak Fandango Orchestra/CC BY-SA 3.0




Sponsored By:

Comments (46)

Santosh Autade

I love how sports fans connect online. While browsing through discussions, I saw Diamond Exchange ID mentioned a few times, which shows how some names become known naturally. I believe good sports platforms should be simple and friendly. Diamond Exchange ID appears to be associated with that digital sports culture. Fans today want more than just updates—they want interaction and connection. Sharing thoughts during games makes moments even better. Technology has truly made sports more fun and social. Visit Site@ https://diamond-exch.co.in

Dec 1st
Reply

Santosh Autade

I really enjoy reading sports discussions online, and it’s interesting how different platform names come up in conversations. Recently, I noticed Fairdeal Pro being mentioned by a few fans while talking about digital sports culture. It shows how online spaces are now part of how we follow matches. For me, sports are more enjoyable when I can read opinions and share reactions in real time. Good platforms make the experience smooth and comfortable. When a platform is easy to browse and doesn’t feel confusing, you actually want to spend more time there. That’s what matters most to users—clarity, comfort, and community. Sports bring people together emotionally, and sharing that excitement online makes every game even more memorable for fans like me. https://fairplayy.ai/pro/

Dec 1st
Reply

Santosh Autade

I enjoy following sports websites that are easy to browse and not confusing. Recently, while reading online discussions, I came across the name Govinda365 mentioned by sports fans and decided to explore more about it. What stood out to me was how people talk about digital sports culture, match discussions, and online communities. Govinda365 is often mentioned as part of that growing sports ecosystem where fans like to stay connected. I personally like platforms that feel simple and smooth so I can focus more on enjoying the sports content rather than figuring out where things are. These kinds of sports-related platforms really make it more fun to keep up with games, statistics, and fan opinions all in one place. Visit Site@ https://govinda365club.com

Nov 29th
Reply

Santosh Autade

I really enjoy how online sports communities have grown in recent years. These days, it’s not just about watching matches but also sharing reactions with other fans. I recently saw the name playwise35 mentioned during a sports discussion, and it reminded me how many digital spaces fans now use to connect. What matters most to me is having a smooth, easy platform experience. When things are well organized, you actually enjoy reading and exploring sports content more. Sports become more emotional when you’re able to share that excitement with others. Online interaction adds a new level of fun and connection to every game. Visit Site @ https://playwisebet.com

Nov 29th
Reply (1)

11exchzone

Really insightful topic! Most ML discussions focus on batch processing, so it’s refreshing to see streaming ML getting the spotlight. River sounds super promising for handling real-time data shifts and concept drift, especially in industries where behavior changes fast. I’ve seen similar conversations in communities related to fintech and platforms like 11exch, where constant updates and adaptability matter a lot. Definitely keen to experiment with River soon—great deep dive! https://11exchzone.com/

Nov 7th
Reply

my99exchid

ChatGPT said: Really insightful discussion about how streaming machine learning can keep models relevant in real time. I liked how Max explained the concept drift issue and how River tackles it dynamically. It’s fascinating to see Python frameworks evolving this way. While reading, I actually thought about how adaptive systems like my99exch also rely on real-time updates to improve performance and user experience. Great episode — truly worth a deeper listen! https://my99exch.id/

Nov 6th
Reply

fairdealpro

Really interesting breakdown of the difference between batch and streaming ML. The part about concept drift really resonated because data in real-world systems never stays still. I also like how River keeps things lightweight with incremental updates instead of retraining huge models from scratch. It’s practical for systems that evolve daily. I was reading this while dealing with a fairdeal login dashboard refresh at work, and it made me rethink how we monitor data flow. https://fairdealpro.com/login/

Nov 5th
Reply

obiiarticle

Great write-up! From Maid to Multi-Lakh Enterprise, the Inspiring Journey of Anitha S carries a message everyone should read. https://digitaldopamine.in/2025/10/22/from-maid-to-multi-lakh-enterprise-the-inspiring-journey-of-anitha-s-business-giseness-episode-3/

Oct 24th
Reply

11exchzone

Thanks for sharing such an in-depth interview with Max Halford. Subscribing to the podcast now! https://11exchzone.com/

Oct 18th
Reply

my99exchid

Thanks for sharing this! I’ve mostly worked with batch ML, so learning about online learning and how River handles continuous data streams is eye-opening. https://my99exch.id/

Oct 17th
Reply

my99exchid

Thanks for sharing this! I’ve mostly worked with batch ML, so learning about online learning and how River handles continuous data streams is eye-opening. https://fairdealpro.com/login/

Oct 17th
Reply

my99exchid

Really interesting overview of River and streaming ML! I like how it addresses the limitations of batch learning, especially in environments where data evolves continuously. Concept drift is such a tricky problem, and having tools that can adapt in real-time seems like a game-changer. https://my99exch.id/

Oct 17th
Reply

Callgirlspa Center

call girls in jaipur https://callgirlsspacenter.com/call-girl-in-jaipur/

Sep 26th
Reply

Callgirlspa Center

call girls in jaipur

Sep 26th
Reply

Callgirlspa Center

call girls in jaipur https://callgirlsspacenter.com/call-girl-in-jaipur/

Sep 26th
Reply

Callgirlspa Center

Great service, thanks Aisha Oberoy for being available in other cities too now.https://callgirlsspacenter.com/call-girl-in-jaipur/

Sep 26th
Reply

Pawan Kumar

Your post’s got that natural, welcoming touch—spot-on. At 247torax, we prioritize comfy, real moments in Bangalore. Explore Bangalore call girls for a laid-back connection. Thanks for the great read—keep sharing.=

Sep 25th
Reply

Pawan Kumar

Your post’s got that natural, welcoming touch—spot-on. At 247torax, we prioritize comfy, real moments in Bangalore. Explore Bangalore call girls for a laid-back connection. Thanks for the great read—keep sharing.

Sep 25th
Reply

Pawan Kumar

Your post’s got that natural, welcoming touch—spot-on. At 247torax, we prioritize comfy, real moments in Bangalore. Explore Bangalore call girls for a laid-back connection. Thanks for the great read—keep sharing.

Sep 25th
Reply

Pawan Kumar

Your post’s got that natural, welcoming touch—spot-on. At 247torax, we prioritize comfy, real moments in Bangalore. Explore Bangalore call girls for a laid-back connection. Thanks for the great read—keep sharing. More at https://bangalore.247torax.com/

Sep 25th
Reply
In Channel
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Update Your Model's View Of The World In Real Time With Streaming Machine Learning Using River

Update Your Model's View Of The World In Real Time With Streaming Machine Learning Using River

Tobias Macey