Evaluations: Trust, performance, and price (bonus, announcing RewardBench)

Update: 2024-03-21

Description

Evaluation is not only getting harder with modern LLMs, it's getting harder because it means something different.
This is AI generated audio with Python and 11Labs. Music generated by Meta's MusicGen.
Source code: https://github.com/natolambert/interconnects-tools
Original post: https://www.interconnects.ai/p/evaluations-trust-performance-and-price

00:00 Evaluations: Trust, performance, and price (bonus, announcing RewardBench)
03:14 The rising price of evaluation
05:40 Announcing RewardBench: The First reward model evaluation tool
08:37 Updates to RLHF evaluation tools

YouTube code intro: https://youtu.be/CAaHAfCqrBA

Figure 1: https://huggingface.co/datasets/natolambert/interconnects-figures/resolve/main/evals/img_026.png
Figure 2: https://huggingface.co/datasets/natolambert/interconnects-figures/resolve/main/evals/img_030.png
Figure 3: https://huggingface.co/datasets/natolambert/interconnects-figures/resolve/main/evals/img_034.png
Figure 4: https://huggingface.co/datasets/natolambert/interconnects-figures/resolve/main/evals/img_040.png

Comments

In Channel

Switched to Claude 3.5

2024-07-0306:40

Interviewing Dean Ball on AI policy

2024-06-2756:31

RLHF Roundup: Trying to get good at PPO, charting RLHF's impact, RewardBench retrospective, and a reward model competition

2024-06-2611:52

Frontiers in synthetic data

2024-06-2111:27

Text-to-video AI is already abundant

2024-06-1808:18

AI for the rest of us

2024-06-1212:35

A realistic path to robotic foundation models

2024-06-0507:49

We aren't running out of training data, we are running out of open training data

2024-05-2908:29

Name, image, and AI's likeness

2024-05-2209:03

OpenAI chases Her

2024-05-1612:28

OpenAI's Model (behavior) Spec, RLHF transparency, and personalization questions

2024-05-1314:05

RLHF: A thin line between useful and lobotomized

2024-05-0113:08

Phi 3 and Arctic: Outlier LMs are hints

2024-04-3009:46

AGI is what you want it to be

2024-04-2410:38

Llama 3: Scaling open LLMs to AGI

2024-04-2115:05

Stop "reinventing" everything to "solve" alignment

2024-04-1707:32

The end of the "best open LLM"

2024-04-1506:45

Why we disagree on what open-source AI should be

2024-04-0308:57

DBRX: The new best open LLM and Databricks' ML strategy

2024-03-2916:33

Evaluations: Trust, performance, and price (bonus, announcing RewardBench)

2024-03-2112:40

00:00

1.0x

Evaluations: Trust, performance, and price (bonus, announcing RewardBench)

#box-pro-ellipsis-172024921214968{-webkit-line-clamp:2;}Evaluations: Trust, performance, and price (bonus, announcing RewardBench)

Evaluations: Trust, performance, and price (bonus, announcing RewardBench)

Nathan Lambert

Evaluations: Trust, performance, and price (bonus, announcing RewardBench)