DiscoverMisreading Chat#124: GAIA: a benchmark for General AI Assistants
#124: GAIA: a benchmark for General AI Assistants

#124: GAIA: a benchmark for General AI Assistants

Update: 2023-12-22
Share

Description

LLM に解かせる難問集と採点結果を向井が睨みました。ご意見感想などは Reddit やおたより投書箱にお寄せください。iTunes のレビューや星もよろしくね。










<figure class="wp-block-audio"></figure>













Comments 
In Channel
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

#124: GAIA: a benchmark for General AI Assistants

#124: GAIA: a benchmark for General AI Assistants

Jun Mukai