DiscoverSoftware Engineering DailyModal and Scaling AI Inference with Erik Bernhardsson
Modal and Scaling  AI Inference with  Erik Bernhardsson

Modal and Scaling AI Inference with Erik Bernhardsson

Update: 2025-07-311
Share

Description


Modal is a serverless compute platform that’s specifically focused on AI workloads. The company’s goal is to enable AI teams to quickly spin up GPU-enabled containers, and rapidly iterate and autoscale.


It was founded by Erik Bernhardsson who was previously at Spotify for 7 years where he built the music recommendation system and the popular Luigi workflow scheduler.


In this episode, Erik joins Sean Falconer to talk about the motivation for founding his company, the market gap in ML and AI tooling, optimizing container cold start, Modal’s interface design, and more.




Sean’s been an academic, startup founder, and Googler. He has published works covering a wide range of topics from AI to quantum computing. Currently, Sean is an AI Entrepreneur in Residence at Confluent where he works on AI strategy and thought leadership. You can connect with Sean on LinkedIn.


 



Please click here to see the transcript of this episode.



The post Modal and Scaling AI Inference with Erik Bernhardsson appeared first on Software Engineering Daily.

Comments (1)

Camilo

Excellent 😃

Sep 16th
Reply
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Modal and Scaling  AI Inference with  Erik Bernhardsson

Modal and Scaling AI Inference with Erik Bernhardsson

Software Engineering Daily