RAFT: Adapting Language Model to Domain Specific RAG

Update: 2024-06-28

Description

Where adapting LLMs to specialized domains is essential (e.g., recent news, enterprise private documents), we discuss a paper that asks how we adapt pre-trained LLMs for RAG in specialized domains. SallyAnn DeLucia is joined by Sai Kolasani, researcher at UC Berkeley’s RISE Lab (and Arize AI Intern), to talk about his work on RAFT: Adapting Language Model to Domain Specific RAG.

RAFT (Retrieval-Augmented FineTuning) is a training recipe that improves an LLM’s ability to answer questions in a “open-book” in-domain settings. Given a question, and a set of retrieved documents, the model is trained to ignore documents that don’t help in answering the question (aka distractor documents). This coupled with RAFT’s chain-of-thought-style response, helps improve the model’s ability to reason. In domain-specific RAG, RAFT consistently improves the model’s performance across PubMed, HotpotQA, and Gorilla datasets, presenting a post-training recipe to improve pre-trained LLMs to in-domain RAG.

Read it on the blog: https://arize.com/blog/raft-adapting-language-model-to-domain-specific-rag/

Learn more about AI observability and evaluation in our course, join the Arize AI Slack community or get the latest on LinkedIn and X.

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

How DeepSeek is Pushing the Boundaries of AI Development

2025-02-2129:54

Multiagent Finetuning: A Conversation with Researcher Yilun Du

2025-02-0430:03

Training Large Language Models to Reason in Continuous Latent Space

2025-01-1424:58

LLMs as Judges: A Comprehensive Survey on LLM-Based Evaluation Methods

2024-12-2328:57

Merge, Ensemble, and Cooperate! A Survey on Collaborative LLM Strategies

2024-12-1028:47

Agent-as-a-Judge: Evaluate Agents with Agents

2024-11-2324:54

Introduction to OpenAI's Realtime API

2024-11-1229:56

Swarm: OpenAI's Experimental Approach to Multi-Agent Systems

2024-10-2946:46

KV Cache Explained

2024-10-2404:19

The Shrek Sampler: How Entropy-Based Sampling is Revolutionizing LLMs

2024-10-1603:31

Google's NotebookLM and the Future of AI-Generated Audio

2024-10-1543:28

Exploring OpenAI's o1-preview and o1-mini

2024-09-2742:02

Breaking Down Reflection Tuning: Enhancing LLM Performance with Self-Learning

2024-09-1926:54

Composable Interventions for Language Models

2024-09-1142:35

Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

2024-08-1639:05

Breaking Down Meta's Llama 3 Herd of Models

2024-08-0644:40

DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines

2024-07-2333:57

RAFT: Adapting Language Model to Domain Specific RAG

2024-06-2844:01

LLM Interpretability and Sparse Autoencoders: Research from OpenAI and Anthropic

2024-06-1444:00

Trustworthy LLMs: A Survey and Guideline for Evaluating Large Language Models' Alignment

2024-05-3048:07

00:00

1.0x

RAFT: Adapting Language Model to Domain Specific RAG

We and our partners use cookies to personalize your experience, to show you ads based on your interests, and for measurement and analytics purposes. By using our website and our services, you agree to our use of cookies as described in our Cookie Policy.

#box-pro-ellipsis-174069938575489{-webkit-line-clamp:2;}RAFT: Adapting Language Model to Domain Specific RAG

RAFT: Adapting Language Model to Domain Specific RAG

RAFT: Adapting Language Model to Domain Specific RAG