DiscoverAWS re:Think PodcastEpisode 40: AI Observabilty and Evaluation with Arize AI
Episode 40: AI Observabilty and Evaluation with Arize AI

Episode 40: AI Observabilty and Evaluation with Arize AI

Update: 2025-05-07
Share

Description

AI can still sometimes hallucinate and give less than optimal answers. To address this, we are joined by Arize AI’s Co-Founder a Aparna Dhinakaran for a discussion on Observability and Evaluation for AI. We begin by discussing the challenges AI Observability and Evaluation. For example, how does “LLM as a Judge” work? We conclude with some valuable advice from Aparna for first time entrepreneurs.

Begin Observing and Evaluating your AI Applications with Open Source Phoenix:

https://phoenix.arize.com/

AWS Hosts: Nolan Chen & Malini Chatterjee

Email Your Feedback: rethinkpodcast@amazon.com

Comments 
loading
In Channel
loading
00:00
00:00
1.0x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Episode 40: AI Observabilty and Evaluation with Arize AI

Episode 40: AI Observabilty and Evaluation with Arize AI