Listen Top Shows Blog

#8 - Operating ML and GenAI at scale: the latest on Kubernetes, Karpenter and Bedrock

#8 - Operating ML and GenAI at scale: the latest on Kubernetes, Karpenter and Bedrock

Update: 2023-10-11

Share

Description

In Episode 8, we’re joined by AWS Partner Solutions Architects Andrew Park and Mike McDonald to discuss the complexities and cost of running today’s ML and AI workloads on the cloud.

From anecdotes of the bad old days before container orchestration, our panelists take you to the present challenge of how to simplify efficient infrastructure operation — with the aim of freeing up Data Scientists and Engineers to focus on building and innovating.

Our panelists discuss the merits, pitfalls, and potential of various cost-optimizing tools and approaches (Ray, Karpenter, Spot, timeslicing) — key to addressing the demand for the expensive computing power generated by ML and AI models at scale.

Watch the full episode for:

The lowdown on AWS Bedrock and where it fits into the current stack of the latest AWS ML and AI offerings — how it works, use cases, the access it grants to new generative AI models

How Karpenter can make your life easy and save you SO much money (especially if you set-it-and-forget-it with nKS)

And hot takes on the controversial question: is ECS dead?!

Comments

In Channel

#14: How Sonos Mastered Spot: Karpenter GA, KubeCon & More

#14: How Sonos Mastered Spot: Karpenter GA, KubeCon & More

2024-11-0744:12

#13: Multidimensional Pod Autoscaling & Machine Learning for Cloud Optimization

#13: Multidimensional Pod Autoscaling & Machine Learning for Cloud Optimization

2024-07-1543:30

#12: Optimizing for Sustainability

#12: Optimizing for Sustainability

2024-04-1950:33

#11: FinOps 101 with the AWS Optics team (Part 2): Committing Wisely

#11: FinOps 101 with the AWS Optics team (Part 2): Committing Wisely

2024-02-2738:50

#10: FinOps 101 with the AWS Optics team (Part 1): Automating Governance

#10: FinOps 101 with the AWS Optics team (Part 1): Automating Governance

2024-02-2148:41

#9 - The Truth About the Hype: GenAI and Applications in FinOps

#9 - The Truth About the Hype: GenAI and Applications in FinOps

2024-01-1851:41

#8 - Operating ML and GenAI at scale: the latest on Kubernetes, Karpenter and Bedrock

#8 - Operating ML and GenAI at scale: the latest on Kubernetes, Karpenter and Bedrock

2023-10-1140:49

#7 - My Baby Is Not Ugly! FinOps X, Public Sector Cloud, & AWS Billing Horror Stories

#7 - My Baby Is Not Ugly! FinOps X, Public Sector Cloud, & AWS Billing Horror Stories

2023-08-2901:07:42

#6 - An Enterprise Perspective on Multicloud Cost Management.

#6 - An Enterprise Perspective on Multicloud Cost Management.

2023-08-2141:59

#5 - Solving the Cost Puzzle : Showbacks, Visibility and Resource Allocation

#5 - Solving the Cost Puzzle : Showbacks, Visibility and Resource Allocation

2023-06-2751:16

#4 - Let’s Do the Time Warp: Cassell on the Future

#4 - Let’s Do the Time Warp: Cassell on the Future

2023-05-2645:17

#3 - Scaling Up: Karpenter to the Rescue

#3 - Scaling Up: Karpenter to the Rescue

2023-05-1149:19

#2 - Commitment Issues? Why Engineers Struggle with Savings Plans & Reserved Instances

#2 - Commitment Issues? Why Engineers Struggle with Savings Plans & Reserved Instances

2023-05-0439:29

#1 - Kubernetes 101 : History, Cost Optimization, and the State of K8s Today

#1 - Kubernetes 101 : History, Cost Optimization, and the State of K8s Today

2023-04-2724:35

00:00

00:00

x

#8 - Operating ML and GenAI at scale: the latest on Kubernetes, Karpenter and Bedrock

#8 - Operating ML and GenAI at scale: the latest on Kubernetes, Karpenter and Bedrock

nOps