Episode 40: AI Observabilty and Evaluation with Arize AI

Update: 2025-05-07

Description

AI can still sometimes hallucinate and give less than optimal answers. To address this, we are joined by Arize AI’s Co-Founder a Aparna Dhinakaran for a discussion on Observability and Evaluation for AI. We begin by discussing the challenges AI Observability and Evaluation. For example, how does “LLM as a Judge” work? We conclude with some valuable advice from Aparna for first time entrepreneurs.

Begin Observing and Evaluating your AI Applications with Open Source Phoenix:

https://phoenix.arize.com/

AWS Hosts: Nolan Chen & Malini Chatterjee

Email Your Feedback: rethinkpodcast@amazon.com

Comments

In Channel

Episode 45: re:Invent 2025 Recap: AI’s Continuing Impact on Individuals and Organizations

2025-12-1844:18

Episode 44: The Future Of Wellness, Powered by Gen AI at mindbodygreen

2025-11-2032:11

Episode 43: Rethinking Data Infrastructure for AI Agents with Tacnode Context Lake

2025-11-1829:43

Episode 42: Rethinking AI Assistants for Businesses with Amazon Quick Suite

2025-11-1142:16

Episode 41: Rethinking Data Strategy for Healthcare and Life Sciences with TileDB

2025-07-1030:10

Episode 40: AI Observabilty and Evaluation with Arize AI

2025-05-0739:04

Episode 39: Rethink your AI Agents with MCP

2025-05-0522:01

Episode 38: AI Native Databases with Weaviate

2025-04-3024:47

Episode 37: AI Model Merging with Arcee AI

2025-04-2346:08

Episode 36: AI Data Pipelines with Komprise

2025-02-1321:20

Episode 35: What it Takes to Win in 2025

2025-01-2928:20

Episode 34: re:Invent 2024 Recap

2024-12-2040:53

Episode 33: Rethinking Network Connectivity and Security in the Cloud with Tailscale

2024-11-1829:03

Episode 32: How Criteria Corp unleashes Next-Gen Interviewing with AWS Gen AI

2024-11-0417:09

Episode 31: Future of GenAI driven Bulk Image Editing with Crop.Photo

2024-10-1818:12

Episode 30: FinOps and Insuring your Cloud Investment with Archera

2024-10-1530:04

Episode 29: Choosing the Right Large Language Model for your AI Projects

2024-09-0326:56

Episode 28: Real Time Analytics with Apache Pinot and Startree

2024-08-2042:50

Episode 27: Enhancing RAG based Gen AI Applications with Unstructured Data

2024-07-0232:52

Episode 26: The Fourth Industrial Revolution & 100 Years of AI

2024-06-1341:14

00:00

Episode 40: AI Observabilty and Evaluation with Arize AI

#box-pro-ellipsis-176673803684891{-webkit-line-clamp:2;}Episode 40: AI Observabilty and Evaluation with Arize AI

Episode 40: AI Observabilty and Evaluation with Arize AI

Episode 40: AI Observabilty and Evaluation with Arize AI