DiscoverPaul, Weiss Waking Up With AIModel Metrics: Benchmarking AI
Model Metrics: Benchmarking AI

Model Metrics: Benchmarking AI

Update: 2025-05-15
Share

Description

In this episode of "Paul, Weiss Waking Up With AI," Katherine Forrest and Anna Gressel discuss AI benchmarking, exploring how these standardized tests evaluate AI models against each other and human capabilities, helping developers and deployers assess performance, safety and progress toward artificial general intelligence.




##


Learn More About Paul, Weiss’s Artificial Intelligence practice:

https://www.paulweiss.com/industries/artificial-intelligence

Comments 
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Model Metrics: Benchmarking AI

Model Metrics: Benchmarking AI

Paul, Weiss