Deep Reinforcement Learning in the Real World with Anna Goldie

Update: 2024-03-26

Description

In this episode, you’ll explore the field of deep reinforcement learning and the ways it influences the real world with Anna Goldie, Senior Staff Research Scientist at Google DeepMind.

Anna’s current role has her working on Large Language Model (LLM) research for Gemini and Bard. Prior to that, she worked on reinforcement learning for LLMs and retrieval-augmented LLMs at Anthropic, and was co-founder/lead of the ML for Systems team in Google Brain.

During this wide-ranging discussion, you’ll learn about her contributions to the field of reinforcement learning, and how we can leverage reinforcement learning effectively for real world applications going forward.

Sponsored by: https://odsc.com/

Find more ODSC lightning interviews, webinars, live trainings, certifications, bootcamps here – https://aiplus.training/

Topics:

1. Professional journey and the key moments

2. Core principles of deep reinforcement learning

3. Deep reinforcement learning for chip design vs traditional approaches

4. Key complexities in modern chip design and how deep reinforcement learning can address these complexities

5. Discuss Google’s TPUs - Tensor Processing Units - built specifically for accelerating machine learning workloads.

6. The potential of Deep Reinforcement learning in computer systems or other domains within Google Deepmind.

7. Deep reinforcement learning use in Large Language Models (LLMs)

8. Reinforcement Learning from Human Feedback (RLHF), designing effective rewards and providing feedback at scale

9. Scalable supervision techniques, for developing methods to efficiently gather feedback that aligns the LLM with human preferences

10. Implement the Constitutional AI framework where AI models are guided by a set of foundational principles or 'constitutional' directives

11. How Retrieval Augment Generation (RAG) systems improve the accuracy and relevance of responses compared to standard large language models even LLMs with large retrieval context windows

12. How “RAPTOR: RECURSIVE ABSTRACTIVE PROCESSING FOR TREE-ORGANIZED RETRIEVALcompares to traditional RAG approaches that retrieve short, contiguous chunks?

13. Hierarchical summaries with RAPTOR

14. LLM Finetuning With Low-Rank Adaptation (LoRA)

15. Google's Gemini 1.5 next-generation LLM model and mixture of experts architecture

16. CALM—Composition to Augment Language Models

17. How dual undergraduate degrees in both computer science and linguistics from MIT has contributed to your innovative work in machine learning,

18. Constitutional AI at Antropic https://www.anthropic.com/news/claudes-constitution

19. What is the best way to follow your work?

20. Keynote address at ODSC East in mid-April.

SHOW NOTES

More about Anna Godie

https://www.linkedin.com/in/adgoldie/

https://www.annagoldie.com/

More about Constitutional AI at Antropic

https://www.anthropic.com/news/claudes-constitution

Constitutional AI: Harmlessness from AI Feedback

https://arxiv.org/pdf/2212.08073.pdf

More about Anna’s Paper

RAPTOR: RECURSIVE ABSTRACTIVE PROCESSING FOR TREE-ORGANIZED RETRIEVAL

https://openreview.net/pdf?id=GN921JHCRw

The official Code implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

https://github.com/parthsarthi03/raptor

More about Large Language Models

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

https://arxiv.org/abs/2305.18290

LLM Finetuning With Low-Rank Adaptation (LoRA)

https://lightning.ai/pages/community/article/lora-llm

CALM—Composition to Augment Language Models

https://arxiv.org/pdf/2401.02412.pdf

https://www.anthropic.com/news/claudes-constitution

Comments

In Channel

The Hardest Problem in AI: Evaluation in 2025 with Ian Cairns

2025-09-1245:59

The Most Neglected Tasks in Data Engineering with Veronika Durgin

2025-09-1248:32

Nick Walton: Creating Unique Narrative Experiences with AI in Gaming

2025-07-2437:16

AI Agents in Action: Memory, Messaging, and MCP with Michael Lanham

2025-07-2444:58

What No One Tells You About AI Infrastructure with Hugo Shi

2025-07-0435:27

Beyond Real: The Case for Synthetic Data + How to Win $100K with Alexandra Ebert

2025-06-2647:07

ODSC East 2025 Minisodes

2025-06-1746:50

"AI Can Predict Disease—So Why Aren’t Doctors Using It?" with Regina Barzilay

2025-05-2231:29

The AI Superintelligence Myth with Arvind Narayanan

2025-05-1250:41

Inside Probabilistic AI: Bayesian Modeling and PyMC with Thomas Wiecki

2025-05-0548:26

Can AI Simulate Life? Exploring World Models and Digital Organisms with Eric Xing

2025-05-0159:23

Rethinking RAG: Why AI Search Needs a New Architecture with Sid Probstein

2025-04-1853:29

AI Agents: The Shift from AI Assistants to Intelligent Automation

2025-04-1642:53

Making AI Make Sense with Graphs: Context, Connections, and GraphRAG

2025-04-0947:56

Beyond Prompt and Pray: Why Now is the Time for Everyone to Build with AI

2025-03-3155:05

Reasoning Models: Practical Insights from Ivan Lee

2025-03-2440:29

The AI-Powered Analyst: Skills You Need to Stay Relevant

2025-03-1255:50

How to Make More Reliable Predictions in Machine Learning with Brian Lucena

2025-03-0646:28

A Look at the Modern Data Science Practitioner in 2025 with Marck Vaisman

2025-02-2042:06

Hype vs. Reality: How DeepSeek R1 Is Reshaping AI – Insights from Sinan Ozdemir

2025-02-1354:19

00:00

Deep Reinforcement Learning in the Real World with Anna Goldie

#box-pro-ellipsis-17578151729355{-webkit-line-clamp:2;}Deep Reinforcement Learning in the Real World with Anna Goldie

Deep Reinforcement Learning in the Real World with Anna Goldie

Deep Reinforcement Learning in the Real World with Anna Goldie