Discover
KubeFM

49 Episodes
Reverse
Managing microservices in Kubernetes at scale often leads to inconsistent deployments and maintenance overhead. This episode explores a practical solution that standardizes service deployments while maintaining team autonomy.Calin Florescu discusses how a unified Helm chart approach can help platform teams support multiple development teams efficiently while maintaining consistent standards across services.You will learn:Why inconsistent Helm chart configurations across teams create maintenance challenges and slow down deploymentsHow to implement a unified Helm chart that balances standardization with flexibility through override functionsHow to maintain quality through automated documentation and testing with tools like Helm Docs and Helm unittestSponsorThis episode is sponsored by Learnk8s — get started on your Kubernetes journey through comprehensive online, in-person or remote training.More infoFind all the links and info for this episode here: https://ku.bz/mcPtH5395Interested in sponsoring an episode? Learn more.
Learn how ByteDance manages computing resources at scale with custom Kubernetes scheduling solutions that handle millions of pods across thousands of nodes.Yue Yin, Software Engineer at ByteDance, discusses their open-source Gödel scheduler and Katalyst resource management system. She explains how these tools address the challenges of managing online and offline workloads in large-scale Kubernetes deployments.You will learn:How Gödel's distributed architecture with dispatcher, scheduler, and binder components enables the scheduling of 5,000 pods per secondWhy NUMA-aware scheduling and two-layer architecture are crucial for handling complex workloads at scaleHow Katalyst provides node-level resource insights to enable efficient workload co-location and improve CPU utilizationSponsorThis episode is sponsored by Learnk8s — get started on your Kubernetes journey through comprehensive online, in-person or remote training.More infoFind all the links and info for this episode here: https://ku.bz/lMpNng_33Interested in sponsoring an episode? Learn more.
Platform Engineer Artem Lajko breaks down observability into three distinct layers and explains how tools like Prometheus, Grafana, and Falco serve different purposes. He also shares practical insights on implementing the right level of monitoring based on team requirements and capabilities.You will learn:How to implement the three-layer model (external, internal, and OS-level) and why each layer serves different stakeholdersHow to choose and scale observability tools using a label-based approach (low, medium, high)How to manage observability costs by collecting only relevant metrics and logsSponsorThis episode is sponsored by Learnk8s — get started on your Kubernetes journey through comprehensive online, in-person or remote training.More infoFind all the links and info for this episode here: https://ku.bz/9sGxhmm8sInterested in sponsoring an episode? Learn more.
In this episode, William Morgan, CEO of Buoyant, explores the complex trade-offs between cost optimization and reliability in Kubernetes networking. The discussion focuses on Topology-aware routing and why its implementation might not be the silver bullet for managing cross-zone traffic costs.William shares practical insights from real-world implementations and explains why understanding these trade-offs is crucial for platform teams managing multi-zone Kubernetes clusters.You will learn:How Topology-aware routing attempts to reduce cross-zone traffic costs but can compromise reliability by limiting inter-zone communicationWhy Layer 7 load balancing offers better traffic management through protocol awareness compared to topology-aware routing's Layer 4 approachHow HAZL (High Availability Zonal Load Balancing) provides a more nuanced solution by balancing cost savings with reliability guarantees through intelligent traffic routingSponsorThis episode is sponsored by Learnk8s — get started on your Kubernetes journey through comprehensive online, in-person or remote training.More infoFind all the links and info for this episode here: https://ku.bz/CBwn51pl-Interested in sponsoring an episode? Learn more.
Are you running PostgreSQL on Kubernetes and need to choose the right operator? In this episode, David Pech, Staff Cloud Ops Engineer, shares his experience implementing database platforms on Kubernetes and guides teams through operator selection and platform requirements.You will learn:The core requirements for a PostgreSQL platform on Kubernetes, including autopilot capabilities, security practices, and observabilityHow to evaluate PostgreSQL operators based on their architecture — from single-instance deployments to cloud-native implementationsWhat teams should consider before building their own database-as-a-service and common pitfalls to avoidThe distinction between being production-ready (running single instances) versus platform-ready (operating at scale with proper tooling)SponsorThis episode is sponsored by Learnk8s — get started on your Kubernetes journey through comprehensive online, in-person or remote training.More infoFind all the links and info for this episode here: https://ku.bz/rGMF2ktdbInterested in sponsoring an episode? Learn more.
Stefan Roman shares his experience building Labs4Grabs, a platform that gives students root access to Kubernetes clusters. He discusses the journey from evaluating simple namespace-based isolation to implementing full VM-based isolation with KubeVirt.You will learn:Why namespace isolation isn't sufficient for untrusted users and the limitations of tools like vCluster when running privileged workloads.How to use KubeVirt to achieve complete workload isolation and the trade-offs.Practical approaches to implementing network security with NetworkPolicies and managing resource allocation across multiple student environments.Follow Stefan's journey from simple to complex isolation strategies, focusing on the technical decisions and trade-offs he encountered.SponsorThis episode is sponsored by Kusari — gain complete visibility into your software components and secure your supply chain through comprehensive tracking and analysis.More infoFind all the links and info for this episode here: https://ku.bz/Xz-TrmX2FInterested in sponsoring an episode? Learn more.
Michael Levan explains how specialized teams and smart abstractions can lead to better outcomes. Drawing from cognitive science and his experience in platform engineering, Michael presents practical strategies for building effective engineering organizations.You will learn:Why specialized teams (or "silos") can improve productivity and why the real enemy is ego, not specialization.How to use Internal Developer Platforms (IDPs) and abstractions to empower teams without requiring everyone to be a Kubernetes expert.How to balance specialization and collaboration using platform engineering practices and smart abstractionsPractical strategies for managing cognitive load in engineering teams and why not everyone needs to know YAML.SponsorThis episode is brought to you by Testkube — scale all of your tests with Kubernetes, integrate seamlessly with CI/CD and centralize test troubleshooting and reporting.More infoFind all the links and info for this episode here: https://ku.bz/qlZPfM-zrInterested in sponsoring an episode? Learn more.
Xe Iaso shares their journey in building a "compute as a faucet" home lab where infrastructure becomes invisible and tasks can be executed without manual intervention. The discussion covers everything from operating system selection to storage architecture and secure access patterns.You will learn:How to evaluate operating systems for your home lab — from Rocky Linux to Talos Linux, and why minimal, immutable operating systems are gaining traction.How to implement a three-tier storage strategy combining Longhorn (replicated storage), NFS (bulk storage), and S3 (cloud storage) to handle different workload requirements.How to secure your home lab with certificate-based authentication, WireGuard VPN, and proper DNS configuration while protecting your home IP address.SponsorThis episode is sponsored by Nutanix — innovate faster with a complete and open cloud-native stack for all your apps and data anywhere.More infoFind all the links and info for this episode here: https://ku.bz/2kzj2MgfHInterested in sponsoring an episode? Learn more.
If you're trying to make sense of when to use Kubernetes and when to avoid it, this episode offers a practical perspective based on real-world experience running production workloads.Paul Butler, founder of Jamsocket, discusses how to identify necessary vs unnecessary complexity in Kubernetes and explains how his team successfully runs production workloads by being selective about which features they use.You will learn:The three compelling reasons to use Kubernetes are managing multiple services across machines, defining infrastructure as code, and leveraging built-in redundancy.Why to be cautious with features like CRDs, StatefulSets, and Helm and how to evaluate if you really need them.How to stay on the "happy path" in Kubernetes by focusing on stable and simple resources like Deployments, Services, and ConfigMaps.When to consider alternatives like Google Cloud Run for simpler deployments that don't need the full complexity of KubernetesSponsorThis episode is sponsored by Syntasso, the creators of Kratix, a framework for building composable internal developer platformsMore infoFind all the links and info for this episode here: https://ku.bz/VB-0WYqtbInterested in sponsoring an episode? Learn more.
This episode explores Admission Controllers and Webhooks with Gordon Myers, who shares his experience implementing webhook solutions in production. Gordon explains the lifecycle of Kubernetes API requests and how webhooks can intercept and modify resources before they are stored in etcd.You will learn:How the Kubernetes API processes requests through authentication, authorization, and Admission Controllers.The difference between Validating and Mutating webhooks and how to implement them using JSON Patch.Best practices for testing webhooks and avoiding common pitfalls that can break cluster deployments.Real-world examples of webhook implementations, including injecting secrets from HashiCorp Vault into containers.SponsorThis episode is sponsored by Learnk8s — get started on your Kubernetes journey through comprehensive online, in-person or remote training.More infoFind all the links and info for this episode here: https://ku.bz/Dmn93dd7MInterested in sponsoring an episode? Learn more.
Are you facing challenges with pre-production environments in Kubernetes?This KubeFM episode shows how to implement efficient deployment previews and solve data seeding bottlenecks.Nick Nikitas, Senior Platform Engineer at Blueground, shares how his team transformed their static pre-production environments into dynamic previews using ArgoCD Application Sets, Wave and Velero.He explains their journey from managing informal environment sharing between teams to implementing a scalable preview system that reduced environment creation time from 19 minutes to 25 seconds.You will learn:How to implement GitOps-based preview environments with Argo CD Application Sets and PR generators for automatic environment creation and cleanup.How to control cloud costs with TTL-based termination and FIFO queues to manage the number of active preview environments.How to optimize data seeding using Velero, AWS EBS snapshots, and Kubernetes PVC management to achieve near-instant environment creation.SponsorThis episode is sponsored by Loft Labs — simplify Kubernetes with vCluster, the leading solution for Kubernetes multi-tenancy and cost savings.More infoFind all the links and info for this episode here: https://ku.bz/tt4VFslxDInterested in sponsoring an episode? Learn more.
Discover how a seemingly simple 502 error in Kubernetes can uncover complex interactions between Go and containerized environments.Emin Laletović, a solution architect at Hybird Technologies, shares his experience debugging a production issue in which a specific API endpoint failed due to out-of-memory errors.He walks through the systematic investigation process, from initial log checks to uncovering the root cause in Go's memory management within Kubernetes.You will learn:How Go's garbage collector interacts with Kubernetes resource limits, potentially leading to unexpected OOMKilled errors.The importance of the GOMEMLIMIT environment variable in Go 1.19+ for managing memory usage in containerized environments.Debugging techniques for memory-related issues in Kubernetes, including GODEBUG for garbage collector tracing.Considerations for optimizing Go applications in Kubernetes, balancing performance and resource utilization.SponsorThis episode is sponsored by StormForge – Double your Kubernetes resource utilization and unburden developers from sizing complexity with the first HPA-compatible vertical pod rightsizing solution. Try it for free.More infoFind all the links and info for this episode here: https://ku.bz/7fnF-tJ8RInterested in sponsoring an episode? Learn more.
This episode offers a rare glimpse into the design decisions that shaped the world's most popular container orchestration platform.Brian Grant, CTO of ConfigHub and former tech lead on Google's Borg team discusses the Kubernetes Resource Model (KRM) and its profound impact on the Kubernetes ecosystem.He explains how KRM's resource-centric API patterns enable Kubernetes' flexibility and extensibility and influence the entire cloud native landscape.You will learn:How the Kubernetes API evolved from inconsistency to a uniform structure, enabling support for thousands of resource types.Why Kubernetes' self-describing resources and Server-side Apply simplify client implementations and configuration management.The evolution of Kubernetes configuration tools like Helm, Kustomize, and GitOps solutions.Current trends and future directions in Kubernetes configuration, including potential AI-driven enhancements.SponsorThis episode is sponsored by StormForge – Double your Kubernetes resource utilization and unburden developers from sizing complexity with the first HPA-compatible vertical pod rightsizing solution. Try it for free.More infoFind all the links and info for this episode here: https://ku.bz/_ZLj6ZV-9Interested in sponsoring an episode? Learn more.
Dive into the world of GitOps and compare two of the most popular tools in the CNCF landscape: Argo CD and Flux CD.Andrei Kvapil, CEO and Founder of Aenix, breaks down the strengths and weaknesses of Argo CD and Flux CD, helping you understand which tool might best fit your team's needs.You will learn:The different philosophies behind the tools.How they handle access control and deployment restrictions.Their trade-offs in usability and conformance to infrastructure as code.Why there is no one-size-fits-all in the GitOps world.SponsorThis episode is sponsored by DigitalOcean — learn how GPUs for DigitalOcean Kubernetes can enable your AI/ML workloads.More infoFind all the links and info for this episode here: https://ku.bz/0mvh5s4LdInterested in sponsoring an episode? Learn more.
Eric Jalal, an independent consultant and Kubernetes developer, explains how Kubernetes is fundamentally built on familiar Linux features. He discusses why understanding Linux is crucial for working with Kubernetes and how this knowledge can simplify your approach to cloud-native technologies.You will learn:Why Eric considers Kubernetes to be "just Linux" and how it wraps existing Linux technologies.The importance of understanding Linux fundamentals (file systems, networking, storage).How Kubernetes provides a standard and consistent interface for managing Linux-based infrastructure.Why learning Linux deeply can make Kubernetes adoption an incremental step rather than a giant leapSponsorThis episode is sponsored by Learnk8s — get started on your Kubernetes journey through comprehensive online, in-person or remote training.More infoFind all the links and info for this episode here: https://ku.bz/-jCTfgqRCInterested in sponsoring an episode? Learn more.
Alexandre Souza, a senior platform engineer at Getir, shares his expertise in managing large-scale environments and configuring requests, limits, and autoscaling.He explores the challenges of over-provisioning and under-provisioning and discusses strategies for optimizing resource allocation using tools like Horizontal Pod Autoscaler (HPA) and Vertical Pod Autoscaler (VPA).You will learn:How to set appropriate resource requests and limits to balance application performance and cost-efficiency in large-scale Kubernetes environments.Strategies for implementing and configuring Horizontal Pod Autoscaler (HPA), including scaling policies and behavior management.The differences between CPU and memory management in Kubernetes and their impact on workload performance.Techniques for leveraging tools like KubeCost and StormForge to automate resource optimization.SponsorThis episode is sponsored by VictoriaMetrics - request a free trial for VictoriaMetrics enterprise today.More infoFind all the links and info for this episode here: https://ku.bz/z2Vj9PBYhInterested in sponsoring an episode? Learn more.
In this KubeFM episode, Kensei Kanada discusses Tortoise, an open-source project he developed at Mercari to tackle Kubernetes resource optimization challenges. He explains the limitations of existing solutions like Horizontal Pod Autoscaler (HPA) and Vertical Pod Autoscaler (VPA), and how Tortoise aims to provide a more comprehensive and automated approach to resource management in Kubernetes clusters.You will learn:The complexities of resource optimization in Kubernetes, including the challenges of managing HPA, VPA, and manual tuning of resource requests and limitsHow Tortoise automates resource optimization by replacing HPA and VPA, reducing the need for manual intervention and continuous tuningThe technical implementation of Tortoise, including its use of Custom Resource Definitions (CRDs) and how it interacts with existing Kubernetes componentsStrategies for adopting and migrating to new tools like Tortoise in a large-scale Kubernetes environmentSponsorThis episode is sponsored by Learnk8s — estimate the perfect cluster node with the Kubernetes Instance CalculatorMore infoFind all the links and info for this episode here: https://ku.bz/bRd0243xQInterested in sponsoring an episode? Learn more.
In this KubeFM episode, Ángel Barrera discusses Adidas' strategic shift to a GitOps-based container platform management system, initiated in May 2022, and its impact on their global infrastructure.You will learn:The initial state and challenges: Understand the complexities and inefficiencies of Adidas' pre-GitOps infrastructure.The transition process: Explore the steps and strategies used to migrate to a GitOps-based system, including tool changes and planning.Technical advantages: Learn about the benefits of the pull mechanism, unified configuration, and improved visibility into cluster states.Developer and business feedback: Gain insights into the feedback from developers and the business side, and how they were convinced to invest in the migration.SponsorThis episode is sponsored by ControlPlane — empower your Kubernetes deployments with ControlPlane Enterprise for Flux CD.More infoFind all the links and info for this episode here: https://ku.bz/-5QbzQXJgInterested in sponsoring an episode? Learn more.
In this KubeFM episode, Miguel Luna discusses the intricacies of Observability in Kubernetes, including its components, tools, and future trends.You will learn:The fundamental components of Observability: metrics, logs, and traces, and their roles in understanding system performance and health.Key tools and projects: insights into Keptn and OpenTelemetry and their significance in the Observability ecosystem.The integration of AI technologies: how AI is shaping the future of Observability in Kubernetes.Practical steps for implementing Observability: starting points, what to monitor, and how to manage alerts effectively.SponsorThis episode is sponsored by Learnk8s — estimate the perfect cluster node with the Kubernetes Instance CalculatorMore infoFind all the links and info for this episode here: https://ku.bz/WwS04jYvvInterested in sponsoring an episode? Learn more.
In this KubeFM episode, Harsha explores the intricacies of Kubernetes security, focusing on the benefits and misconceptions of Distroless container images and the broader aspects of container security.You will learn:The advantages and limitations of Distroless container images: understand why these images are smaller, have a reduced attack surface and are not inherently secure.Best practices for container security: gain insights into selecting base images, managing dependencies, and fortifying your infrastructure at every layer.Supply chain security: explore how the supply chain can be an attack vector and the importance of signing artifacts and validating sources.Emerging Kubernetes tools and future projects: discover the latest tools Harsha is monitoring and get a sneak peek into his upcoming projects, including a new podcast and a tool for simulating multistage attacks in cloud-native environments.SponsorThis episode is sponsored by Learnk8s — estimate the perfect cluster node with the Kubernetes Instance CalculatorMore infoFind all the links and info for this episode here: https://ku.bz/n_sJ04xMYInterested in sponsoring an episode? Learn more.
Comments
Top Podcasts
The Best New Comedy Podcast Right Now – June 2024The Best News Podcast Right Now – June 2024The Best New Business Podcast Right Now – June 2024The Best New Sports Podcast Right Now – June 2024The Best New True Crime Podcast Right Now – June 2024The Best New Joe Rogan Experience Podcast Right Now – June 20The Best New Dan Bongino Show Podcast Right Now – June 20The Best New Mark Levin Podcast – June 2024