DiscoverPalisade Research Podcast
Palisade Research Podcast
Claim Ownership

Palisade Research Podcast

Author: Palisade Research

Subscribed: 0Played: 0
Share

Description

Interviews with AI researchers talking about the latest AI research
1 Episodes
Reverse
Marius Hobbhahn is the CEO and co-founder of Apollo Research. Through a joint research project with OpenAI, his team discovered that as models become more capable, they are developing the ability to hide their true reasoning from human oversight.Jeffrey Ladish, Executive Director of Palisade Research, talks with Marius about this work. They discuss the difference between hallucination and deliberate deception and the urgent challenge of aligning increasingly capable AI systems.Links:Marius’ Twitter: https://twitter.com/mariushobbhahnApollo Research Twitter: https://twitter.com/apolloaievalsApollo Research: https://www.apolloresearch.aiPalisade Research: https://palisaderesearch.org/Twitter/X: https://x.com/PalisadeAIAnti-Scheming Project: https://www.antischeming.aiResearch paper “Stress Testing Deliberative Alignment for Anti-Scheming Training”: https://www.arxiv.org/pdf/2509.15541Blog posts from OpenAI and Apollo: https://openai.com/index/detecting-and-reducing-scheming-in-ai-models/ https://www.apolloresearch.ai/research/stress-testing-deliberative-alignment-for-anti-scheming-training/
Comments