DiscoverKubeFMHow Policies Saved us a Thousand Headaches, with Alessandro Pomponio
How Policies Saved us a Thousand Headaches, with Alessandro Pomponio

How Policies Saved us a Thousand Headaches, with Alessandro Pomponio

Update: 2025-08-12
Share

Description

Alessandro Pomponio from IBM Research explains how his team transformed their chaotic bare-metal clusters into a well-governed, self-service platform for AI and scientific workloads. He walks through their journey from manual cluster interventions to a fully automated GitOps-first architecture using ArgoCD, Kyverno, and Kueue to handle everything from policy enforcement to GPU scheduling.

You will learn:

  • How to implement GitOps workflows that reduce administrative burden while maintaining governance and visibility across multi-tenant research environments

  • Practical policy enforcement strategies using Kyverno to prevent GPU monopolization, block interactive pod usage, and automatically inject scheduling constraints

  • Fair resource sharing techniques with Kueue to manage scarce GPU resources across different hardware types while supporting both specific and flexible allocation requests

  • Organizational change management approaches for gaining stakeholder buy-in, upskilling admin teams, and communicating policy changes to research users

Sponsor

This episode is brought to you by Testkube—the ultimate Continuous Testing Platform for Cloud Native applications. Scale fast, test continuously, and ship confidently. Check it out at testkube.io

More info

Comments 
In Channel
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

How Policies Saved us a Thousand Headaches, with Alessandro Pomponio

How Policies Saved us a Thousand Headaches, with Alessandro Pomponio