You Can’t Fix What You Don’t Measure: Observability in the Age of AI with Conor Bronsdon

Update: 2025-11-05

Description

Only 50% of companies monitor their ML systems. Building observability for AI is not simple: it goes beyond 200 OK pings. In this episode, Sylvain Kalache sits down with Conor Brondsdon (Galileo) to unpack why observability, monitoring, and human feedback are the missing links to make large language model (LLM) reliable in production.

Conor dives into the shift from traditional test-driven development to evaluation-driven development, where metrics like context adherence, completeness, and action advancement replace binary pass-fail checks. He also shares how teams can blend human-in-the-loop feedback, automated guardrails, and small language models to keep AI accurate, compliant, and cost-efficient at scale.

Comments

In Channel

It’s Never Different This Time: LLM Reliability Without the Hype with Julien Simon

2025-11-1930:31

You Can’t Fix What You Don’t Measure: Observability in the Age of AI with Conor Bronsdon

2025-11-0531:17

The End of “Good Code”? AI, Throughput, and Reliability with CircleCI CTO Rob Zuber

2025-09-1037:38

Frontline Reliability: Protecting User Journeys with SLOs with Shery Brauner (Razor, ex-Zalando)

2025-08-2031:03

Balancing Reliability at the Crypto-Finance Frontier with Brian Shaw (Uphold)

2025-07-0313:23

Command Under Pressure: David Owczarek on Incident Leadership and Human-Centered Reliability

2025-06-1723:25

AI at the Frontlines of Healthcare Reliability with Ryan Lockard (CVS Health)

2025-05-3024:07

Trust Is the Product: Building Reliable Billing in the AI Era with Cosmo Wolfe (Metronome)

2025-05-2620:16

The Golden Path to Nowhere: When Platforms Undermine Reliability with Chase Roberts (Northflank)

2025-05-1427:26

AI can boost developer productivity, if used right, with Justin Reock, Deputy CTO at DX

2025-04-3037:48

Why Reliability in the AI Era Starts with the Network with Marino Wijay

2025-04-1727:03

Metrics That Matter: Measuring Developer Productivity in the AI Era

2025-04-0939:36

Are AI and Platforms Making SRE Obsolete? With Kaspar von Grünberg, Humanitec’s CEO

2025-03-2425:44

Scientific Incident Management with Dan Slimmon

2025-03-1437:35

How AI broke serverless and what to do about it with Vercel’s Mariano Fernández Cocirio

2025-03-0613:52

I Want My Shoes Fast! Observability, SRE Burnout, and OTel with Dynatrace’s Adriana Villela

2025-02-2734:23

AI in Production with GitHub’s Sean Goedecke

2025-02-1817:33

The Reliability Diagnosis: Google’s Steve McGhee on Debugging and Incident Response

2025-02-1015:32

No CS Degree, No Problem: Building a Career in Tech Leadership

2025-02-0511:09

Beyond SLOs: How an ex-Google SRE scaled reliability at the largest e-commerce in the nordics

2025-02-0307:34

00:00

You Can’t Fix What You Don’t Measure: Observability in the Age of AI with Conor Bronsdon

#box-pro-ellipsis-176424328987143{-webkit-line-clamp:2;}You Can’t Fix What You Don’t Measure: Observability in the Age of AI with Conor Bronsdon

You Can’t Fix What You Don’t Measure: Observability in the Age of AI with Conor Bronsdon

Rootly

You Can’t Fix What You Don’t Measure: Observability in the Age of AI with Conor Bronsdon