DiscoverMLOps.communityAccelerating Multimodal AI // Ethan Rosenthal // #242
Accelerating Multimodal AI // Ethan Rosenthal // #242

Accelerating Multimodal AI // Ethan Rosenthal // #242

Update: 2024-06-21
Share

Description

Join us at our first in-person conference on June 25 all about AI Quality: https://www.aiqualityconference.com/

Accelerating Multimodal AI // MLOps podcast #241 with Ethan Rosenthal, Member of Technical Staff of Runway.

Huge thank you to AWS for sponsoring this episode. AWS - https://aws.amazon.com/

// Abstract
We’re still trying to figure out systems and processes for training and serving “regular” machine learning models, and now we have multimodal AI to contend with! These new systems present unique challenges across the spectrum, from data management to efficient inference. I’ll talk about the similarities, differences, and challenges that I’ve seen by moving from tabular machine learning, to large language models, to generative video systems. I’ll also talk about the setups and tools that I have seen work best for supporting and accelerating both the research and productionization process.

// Bio
Ethan works at Runway building systems for media generation. Ethan's work generally straddles the boundary between research and engineering without falling too hard on either side. Prior to Runway, Ethan spent 4 years at Square. There, he led a small team of AI Engineers training large language models for Conversational AI. Before Square, Ethan freelance consulted and worked at a couple ecommerce startups. Ethan found his way into tech by way of a Physics PhD.

// MLOps Jobs board
https://mlops.pallet.xyz/jobs

// MLOps Swag/Merch
https://mlops-community.myshopify.com/

// Related Links
Website: https://www.ethanrosenthal.com
Ethan's mangum opus: https://www.ethanrosenthal.com/2020/08/25/optimal-peanut-butter-and-banana-sandwiches/
Real-time Model Inference in a Video Streaming Environment // Brannon Dorsey // Coffee Sessions #98: https://youtu.be/TNO6rYwP3yg
Feature Stores for Self-Service Machine Learning: https://www.ethanrosenthal.com/2021/02/03/feature-stores-self-service/
Gen-1: The Next Step Forward for Generative AI: https://research.runwayml.com/gen1
Machine Learning: The High Interest Credit Card of Technical Debt by D. Sculley et al.: https://research.google/pubs/machine-learning-the-high-interest-credit-card-of-technical-debt/

--------------- ✌️Connect With Us ✌️ -------------
Join our slack community: https://go.mlops.community/slack
Follow us on Twitter: @mlopscommunity
Sign up for the next meetup: https://go.mlops.community/register
Catch all episodes, blogs, newsletters, and more: https://mlops.community/

Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/
Connect with Ethan on LinkedIn: https://bsky.app/profile/ethanrosenthal.com



Timestamps:
[00:00 ] Ethan's preferred coffee
[00:11 ] Takeaways
[02:07 ] Falling into LLMs
[03:16 ] Advanced AI Tech Capabilities
[04:40 ] AI-powered video editing tool
[06:56 ] Transition to AI: Diffusion Models
[09:09 ] Multimodal Feature Store breakdown
[15:33 ] Multimodal Feature Stores Evolution
[18:09 ] Benefits of Multimodal Feature Store
[25:09 ] Centralized Training Data Repository
[27:33 ] Large-scale distributed training
[32:37 - 33:39 ] AWS Ad
[33:45 ] Dealing with researchers on productionizing
[43:52 ] Infrastructure for Researchers and Engineers
[47:04 ] Generative DevOps movement
[49:21 ] Structuring teams
[52:06 ] Multimodal Feature Stores Efficiency
[54:02 ] Wrap up

Comments 
In Channel
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Accelerating Multimodal AI // Ethan Rosenthal // #242

Accelerating Multimodal AI // Ethan Rosenthal // #242

Demetrios Brinkmann