#14 Aligning AI models for healthcare | Understanding Reinforcement Learning from Human Feedback (RLHF)

Update: 2024-02-14

Description

How do we align AI models for healthcare? 👨‍⚕️ And importantly, the moral codes and ethics that we practice everyday, how does the LLM deal with ethical scenarios like the trolley problem for example? This is a fascinating topic and one we spend a lot of time thinking about.

In this episode Dev and Doc, Zeljko Kraljevic and I cover all the up to date topics around reinforcement learning, the benefits and where it can go wrong. We also discuss different RL methods including the algorithms used to train ChatGPT (RLHF).

Dev and Doc is a Podcast where developers and doctors join forces to deep dive into AI in healthcare. Together, we can build models that matter.

👨🏻‍⚕️Doc - Dr. Joshua Au Yeung - https://www.linkedin.com/in/dr-joshua...
🤖Dev - Zeljko Kraljevic https://twitter.com/zeljkokr

The podcast 🎙️
🔊Spotify: https://open.spotify.com/show/3QO5Lr3...
📙Substack: https://aiforhealthcare.substack.com/

Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :)

🎞️ Editor-
Dragan Kraljević https://www.instagram.com/dragan_kral...

🎨Brand design and art direction -
Ana Grigorovici
https://www.behance.net/anagrigorovic...00:00 Highlights
01:27 start
4:38 aligning ethics of ai models
7:04 doctors ethical choices daily
8:00 RLHF and AI training methods
16:29 reinforcement learning
19:35 Preference model -rewarding models correctly can make or break the success
27:05 exploiting reward function, model degradation (and how to fix it)

Ref
AI intro paper - https://pn.bmj.com/content/23/6/476
Open AI RLHF paper - https://arxiv.org/abs/1909.08593
War and peace of LLMs! - https://arxiv.org/abs/2311.17227

Comments

In Channel

#32 2025 in Review: Our AI Healthcare Predictions and Hot Takes

2025-12-2742:59

#31 AI & Digital Twins: The Next Evolution for Personalised Medicine

2025-12-1953:07

#30 The Age of AI agents in healthcare (Live Podcast at HETT 2025)

2025-10-2236:32

Everything you need to know about LLM benchmarks- Turing Test, OpenAI's Healthbench, ARC prize, LM arena

2025-08-2255:19

#28 AI agents explained - Manus AI, computer control, Agentic workflows (healthcare)

2025-05-0901:00:48

#27 Exploring Claude Sonnet 3.7 for healthcare

2025-02-2658:03

#26 Is it still worth doing a PhD in 2025? (Computer Science / Machine Learning)

2025-02-2156:41

#25 Testing Deepseek R1 on Complex Medical Tasks. Here's what we found. (GRPO explainer)

2025-02-0701:20:45

#24 Significantly advancing LLMs with RAG (Google's Gemini 2.0, Deep Research, notebookLM)

2025-01-1057:46

#23 Can OpenAI's GPT o1 solve complex medical problems?

2024-09-2039:44

#22 Explaining Explainable AI (for healthcare) with Dr Annabelle Painter (RSM digital health section Podcast)

2024-08-1558:40

#21 Foundational Models in Digital Pathology: Enhancing Cancer detection and outcomes

2024-08-0201:01:43

#20 How to build a successful healthTech/ BioTech start-up (2024 roadmap) - Derrick Khor

2024-07-1801:08:33

#19 Tracking health with technology and AI - demystifying digital biomarkers

2024-07-0401:03:36

#18 Keith Grimes - Startups and doctors, HealthTech consulting, Babylon's demise, Leadership theory

2024-05-3001:09:33

#17 How to build a clinically safe Large Language Model - Hippocratic AI, Llama3, Biollama

2024-05-0943:24

#16 Dev&Doc x Rewired - LLMs, Clinical foundation models and automating administrative tasks (live)

2024-03-2146:59

#15 The death of Prompt Engineering

2024-02-2934:52

#14 Aligning AI models for healthcare | Understanding Reinforcement Learning from Human Feedback (RLHF)

2024-02-1442:01

#13 Research begins when hype ends - Doc's adventure, LlaMa3 , Code LlaMa, Gemini Ultra

2024-02-0118:04

00:00

#14 Aligning AI models for healthcare | Understanding Reinforcement Learning from Human Feedback (RLHF)

#box-pro-ellipsis-176755210703936{-webkit-line-clamp:2;}#14 Aligning AI models for healthcare | Understanding Reinforcement Learning from Human Feedback (RLHF)

#14 Aligning AI models for healthcare | Understanding Reinforcement Learning from Human Feedback (RLHF)

Dev and Doc

#14 Aligning AI models for healthcare | Understanding Reinforcement Learning from Human Feedback (RLHF)