Orca: Progressive Learning from Complex Explanation Traces of GPT-4

Update: 2023-07-21

Description

Deep Papers is a podcast series featuring deep dives on today’s seminal AI papers and research. Hosted by AI Pub creator Brian Burns and Arize AI founders Jason Lopatecki and Aparna Dhinakaran, each episode profiles the people and techniques behind cutting-edge breakthroughs in machine learning.

In this episode, we talk about Orca. Recent research focuses on improving smaller models through imitation learning using outputs from large foundation models (LFMs). Challenges include limited imitation signals, homogeneous training data, and a lack of rigorous evaluation, leading to overestimation of small model capabilities.

To address this, Orca is a 13-billion parameter model that learns to imitate LFMs’ reasoning process. Orca leverages rich signals from GPT-4, surpassing state-of-the-art models by over 100% in complex zero-shot reasoning benchmarks. It also shows competitive performance in professional and academic exams without CoT. Learning from step-by-step explanations, generated by humans or advanced AI models, enhances model capabilities and skills.

Full transcript and more here: https://arize.com/blog/orca-progressive-learning-from-complex-explanation-traces-of-gpt-4-paper-reading/

Learn more about AI observability and evaluation, join the Arize AI Slack community or get the latest on LinkedIn and X.

Comments

In Channel

TUMIX: Multi-Agent Test-Time Scaling with Tool-Use Mixture

2025-11-2423:44

Meta AI Researcher Explains ARE and Gaia2: Scaling Up Agent Environments and Evaluations

2025-11-1022:34

Georgia Tech's Santosh Vempala Explains Why Language Models Hallucinate, His Research With OpenAI

2025-10-1431:24

Atropos Health’s Arjun Mukerji, PhD, Explains RWESummary: A Framework and Test for Choosing LLMs to Summarize Real-World Evidence (RWE) Studies

2025-09-2226:22

Stan Miasnikov, Distinguished Engineer, AI/ML Architecture, Consumer Experience at Verizon Walks Us Through His New Paper

2025-09-0648:11

Small Language Models are the Future of Agentic AI

2025-09-0531:15

Watermarking for LLMs and Image Models

2025-07-3042:56

Self-Adapting Language Models: Paper Authors Discuss Implications

2025-07-0831:26

The Illusion of Thinking: What the Apple AI Paper Says About LLM Reasoning

2025-06-2030:35

Accurate KV Cache Quantization with Outlier Tokens Tracing

2025-06-0425:11

Scalable Chain of Thoughts via Elastic Reasoning

2025-05-1628:54

Sleep-time Compute: Beyond Inference Scaling at Test-time

2025-05-0230:24

LibreEval: The Largest Open Source Benchmark for RAG Hallucination Detection

2025-04-1827:19

AI Benchmark Deep Dive: Gemini 2.5 and Humanity's Last Exam

2025-04-0426:11

Model Context Protocol (MCP)

2025-03-2515:03

AI Roundup: DeepSeek’s Big Moves, Claude 3.7, and the Latest Breakthroughs

2025-03-0130:23

How DeepSeek is Pushing the Boundaries of AI Development

2025-02-2129:54

Multiagent Finetuning: A Conversation with Researcher Yilun Du

2025-02-0430:03

Training Large Language Models to Reason in Continuous Latent Space

2025-01-1424:58

LLMs as Judges: A Comprehensive Survey on LLM-Based Evaluation Methods

2024-12-2328:57

00:00

Orca: Progressive Learning from Complex Explanation Traces of GPT-4

#box-pro-ellipsis-176683627637921{-webkit-line-clamp:2;}Orca: Progressive Learning from Complex Explanation Traces of GPT-4

Orca: Progressive Learning from Complex Explanation Traces of GPT-4

Arize AI

Orca: Progressive Learning from Complex Explanation Traces of GPT-4