DiscoverSuper Prompt: Generative AIAI Safety: Constitutional AI vs Human Feedback
AI Safety: Constitutional AI vs Human Feedback

AI Safety: Constitutional AI vs Human Feedback

Update: 2024-06-17
Share

Description

With great power comes great responsibility. How do leading AI companies implement safety and ethics as language models scale? OpenAI uses Model Spec combined with RLHF (Reinforcement Learning from Human Feedback). Anthropic uses Constitutional AI. The technical approaches to maximizing usefulness while minimizing harm. Solo episode on AI alignment.

REFERENCE

OpenAI Model Spec

https://cdn.openai.com/spec/model-spec-2024-05-08.html#overview

Anthropic Constitutional AI

https://www.anthropic.com/news/claudes-constitution



To stay in touch, sign up for our newsletter at https://www.superprompt.fm

Comments 
loading
00:00
00:00
1.0x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

AI Safety: Constitutional AI vs Human Feedback

AI Safety: Constitutional AI vs Human Feedback

Tony Wan