Listen Top Shows Blog

GPT-5 Unboxed: What Changed, What Broke, and What’s Next

GPT-5 Unboxed: What Changed, What Broke, and What’s Next

Update: 2025-09-19

Share

Description

In this special episode of the ODSC Ai X Podcast, host Sheamus McGovern dives into the real-world impact of GPT-5—from routing and hallucination issues to cost savings and open-weight models.

Joining him are two expert guests:

Ivan Lee: Founder and CEO of Datasaur, who helps enterprises build private LLM stacks and has deep experience evaluating model upgrades.
Nir Gazit: Co-founder and CEO of Traceloop, and co-creator of the OpenTelemetry Generative AI SIG, who brings insight into model routing, evaluation strategies, and observability tooling.

Together, they unpack what GPT-5 actually changed—and what teams should do next.

Key Topics Covered:

Why GPT-5’s biggest shift is routing, not reasoning
What casual vs. power users gained (or lost) with the rollout
Hallucination benchmarks vs. real-world results
Evaluation strategies using open-source tools like Phoenix and LangChain
OpenAI’s OSS model release and its enterprise implications
Why developers worry about black-box routing and lack of traceability
How to migrate safely: pinning snapshots, running evals, shadow testing
Whether GPT-5 gets us closer to AGI—or just better infrastructure
What to expect from agent workflows, tool selection, and model specialization

Memorable Outtakes:

Ivan Lee: “GPT-5 is an upgrade for 98% of users—but for the power users, the loss of model choice felt like control was taken away.”
Nir Gazit: “Of course every new model crushes it on benchmarks—they’re optimizing for the benchmarks. That doesn’t mean it works for your use case.”
Ivan Lee: “OpenAI’s OSS release might be the bigger story than GPT-5. Suddenly, enterprises are back at the table.”

References & Resources:

Guests

Ivan Lee – CEO of Datasaur
Website: https://www.datasaur.ai
LinkedIn: https://www.linkedin.com/in/iylee/
Nir Gazit – CEO of Traceloop
Website: https://www.traceloop.com
Blog: https://www.traceloop.com/blog
LinkedIn: https://www.linkedin.com/in/nirga/

Resources Mentioned

OpenAI GPT-5 https://openai.com/gpt-5
OpenTelemetry Project: https://opentelemetry.io
Traceloop OpenLLMetry: https://www.traceloop.com/openllmetry
Phoenix (Arize AI open-source evals): https://github.com/Arize-ai/phoenix
LangChain Evals: https://python.langchain.com/api_reference/langchain/evaluation.html
GPT-OSS Open Weight Models by OpenAI: https://platform.openai.com/docs/models/gpt-oss
Claude + Model Context Protocol (Anthropic): https://docs.anthropic.com/en/docs/tool-use
ARC-AGI Leaderboard: https://arcprize.org/leaderboard

Sponsored by:

🔥 ODSC AI West 2025 – The Leading AI Training Conference

Join us in San Francisco from October 28th–30th for expert-led sessions on generative AI, LLMOps, and AI-driven automation.

Use the code podcast for 10% off any ticket.

Learn more: https://odsc.ai

Comments

In Channel

CrewAI and the Rise of Autonomous Agents in Enterprise AI with João (Joe) Moura

CrewAI and the Rise of Autonomous Agents in Enterprise AI with João (Joe) Moura

2025-09-1938:00

Inside Google’s New AI Stack with Paige Bailey

Inside Google’s New AI Stack with Paige Bailey

2025-09-1939:50

From Turing’s Chess to Neural Game Engines: AI in Video Games Today with Julian Togelius

From Turing’s Chess to Neural Game Engines: AI in Video Games Today with Julian Togelius

2025-09-1901:07:41

Your Brain on ChatGPT with Nataliya Kosmyna

Your Brain on ChatGPT with Nataliya Kosmyna

2025-09-1954:46

GPT-5 Unboxed: What Changed, What Broke, and What’s Next

GPT-5 Unboxed: What Changed, What Broke, and What’s Next

2025-09-1956:45

Minimum Viable AI: Redefining How We Build Products with Dan Huss

Minimum Viable AI: Redefining How We Build Products with Dan Huss

2025-09-1801:00:39

The Hardest Problem in AI: Evaluation in 2025 with Ian Cairns

The Hardest Problem in AI: Evaluation in 2025 with Ian Cairns

2025-09-1245:59

The Most Neglected Tasks in Data Engineering with Veronika Durgin

The Most Neglected Tasks in Data Engineering with Veronika Durgin

2025-09-1248:32

Nick Walton: Creating Unique Narrative Experiences with AI in Gaming

Nick Walton: Creating Unique Narrative Experiences with AI in Gaming

2025-07-2437:16

AI Agents in Action: Memory, Messaging, and MCP with Michael Lanham

AI Agents in Action: Memory, Messaging, and MCP with Michael Lanham

2025-07-2444:58

What No One Tells You About AI Infrastructure with Hugo Shi

What No One Tells You About AI Infrastructure with Hugo Shi

2025-07-0435:27

Beyond Real: The Case for Synthetic Data + How to Win $100K with Alexandra Ebert

Beyond Real: The Case for Synthetic Data + How to Win $100K with Alexandra Ebert

2025-06-2647:07

ODSC East 2025 Minisodes

ODSC East 2025 Minisodes

2025-06-1746:50

"AI Can Predict Disease—So Why Aren’t Doctors Using It?" with Regina Barzilay

"AI Can Predict Disease—So Why Aren’t Doctors Using It?" with Regina Barzilay

2025-05-2231:29

The AI Superintelligence Myth with Arvind Narayanan

The AI Superintelligence Myth with Arvind Narayanan

2025-05-1250:41

Inside Probabilistic AI: Bayesian Modeling and PyMC with Thomas Wiecki

Inside Probabilistic AI: Bayesian Modeling and PyMC with Thomas Wiecki

2025-05-0548:26

Can AI Simulate Life? Exploring World Models and Digital Organisms with Eric Xing

Can AI Simulate Life? Exploring World Models and Digital Organisms with Eric Xing

2025-05-0159:23

Rethinking RAG: Why AI Search Needs a New Architecture with Sid Probstein

Rethinking RAG: Why AI Search Needs a New Architecture with Sid Probstein

2025-04-1853:29

AI Agents: The Shift from AI Assistants to Intelligent Automation

AI Agents: The Shift from AI Assistants to Intelligent Automation

2025-04-1642:53

Making AI Make Sense with Graphs: Context, Connections, and GraphRAG

Making AI Make Sense with Graphs: Context, Connections, and GraphRAG

2025-04-0947:56

00:00

00:00

1.0x

GPT-5 Unboxed: What Changed, What Broke, and What’s Next

GPT-5 Unboxed: What Changed, What Broke, and What’s Next