DiscoverNo Priors: Artificial Intelligence | Technology | StartupsAndrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI
Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

Update: 2026-03-206
Share

Digest

The podcast explores the profound impact of AI agents on various aspects of technology and society. It begins with the concept of "AI psychosis," a feeling of being overwhelmed by rapid AI advancements, and how agents are transforming software development by allowing engineers to delegate tasks and focus on higher-level problem-solving. The discussion delves into the capabilities and limitations of these agents, introducing concepts like "claws" for persistent operations and "macro actions" for complex task orchestration. The conversation highlights the shift from being compute-bound to skill-bound, emphasizing the importance of mastering agent interaction. The future of AI agents is envisioned as increasingly sophisticated, with potential for advanced memory systems and distinct personalities. Personal projects like "Dobby D," an AI home automation system, showcase practical applications. The podcast speculates on an "agentic web" where agents orchestrate services, potentially rendering many current applications obsolete, and an "agent-first" approach to tool development. A significant portion of the discussion focuses on "auto research," where AI autonomously conducts research, leading to surprising discoveries and recursive self-improvement. This process is seen as outperforming human tuning in certain areas, prompting a re-evaluation of researchers' roles. The "jaggedness" of AI intelligence is examined, noting its brilliance in some domains and ineptitude in others, leading to discussions about AI speciation and the potential for specialized models. The conversation also touches upon the systemic risks of centralized AI development, advocating for a balance with open-source alternatives. The challenges and opportunities in robotics, the interface between physical and digital worlds, and the potential for information markets are explored. Finally, the podcast discusses the evolving nature of education, with agents becoming tools for learning and teachers focusing on unique human insights, and touches upon the economic implications, including the Jevons paradox and the ephemeral nature of code. The discussion concludes with promotional content for the podcast.

Outlines

00:00:00
The AI Revolution: Agents, Psychosis, and New Workflows

The conversation introduces "AI psychosis," a state of awe and disorientation due to rapid AI advancements, particularly with agents. This leads to a fundamental shift in workflows, where engineers delegate extensively to AI agents, transforming how code is written and tasks are managed. The discussion highlights the urgency to adapt to these evolving capabilities and the emergence of agents as virtual colleagues.

00:03:59
Macro Actions, Collaboration, and Resource Maximization

The concept of "macro actions" is explored, where agents handle complex software development tasks beyond single lines of code, involving multiple agents working across repositories. This leads to a discussion on maximizing token usage and subscription value, shifting the focus from computational power to token throughput as a key metric for AI interaction.

00:06:01
From Compute-Bound to Skill-Bound: Mastering AI Agents

The industry's perception shifts from being compute-bound to resource-bound by individual skill. This empowers individuals, as improvement is directly tied to learning and mastering agent interaction. The future of AI agents is envisioned as requiring mastery in managing multiple agents and sophisticated memory systems.

00:07:53
Agent Personalities and Practical Applications: Dobby D

The importance of agent personality in user experience is highlighted, contrasting dry coding agents with more engaging ones. A personal project, "Dobby D," an AI home automation system, is presented as a practical example of an AI "claw" controlling various home subsystems via natural language.

00:11:45
The Agentic Web and the Future of Tools

AI is reshaping user experience towards natural language interaction. The proliferation of single-purpose apps is questioned, suggesting agents could unify functionality through APIs, leading to an "agentic web." This implies an "agent-first" approach to tools, where APIs are the primary interface and agents orchestrate their use.

00:15:00
Vibe Coding, Distractions, and Auto Research

The current phase of "vibe coding" is acknowledged, where users guide AI, but the prediction is for AI to handle tasks autonomously soon. Distractions from rapid advancements and cautious integration due to security concerns are discussed. "Auto research" is introduced as a method to maximize token throughput by removing humans from the loop.

00:17:44
Recursive Self-Improvement and Automated Research Processes

The effectiveness of auto research is surprising, with LLM training seen as a playground for recursive self-improvement. Auto research has outperformed human tuning in discovering better model configurations. The ideal scenario involves systems where research proceeds autonomously, enabling continuous, hands-off operation.

00:19:43
Efficiency, Researcher Roles, and Meta-Optimization

Auto research efforts are expected to increase efficiency and provide direction for scaling AI models. Researchers may transition from enacting ideas to contributing them to a queue, with automated scientists generating and testing hypotheses. "Program MD" is introduced as a way to meta-optimize research organization and efficiency.

00:22:51
Layers of Abstraction, Caveats, and AI Intelligence Jaggedness

AI development is presented as layers of abstraction, leading to "infinite" possibilities and "skill issues." However, "AI psychosis" is tempered by the need for verifiable metrics and the acknowledgment that current models have rough edges and inconsistencies. AI intelligence exhibits "jaggedness," excelling in some areas while being inept in others.

00:25:12
Inefficiencies, Reinforcement Learning, and Decoupled Capabilities

Frustration arises from agent inefficiencies and unproductive loops. AI models trained via reinforcement learning struggle with nuanced tasks, excelling in verifiable domains. The decoupling of AI capabilities is observed, where optimization focuses on specific tasks, leaving softer skills underdeveloped.

00:27:22
Generalization, Monolithic Models, and AI Speciation

The question of whether code generation prowess translates to broader intelligence is raised, suggesting a decoupling of AI capabilities. The monolithic interface of current AI models is questioned, proposing specialization. The future may see more "speciation" of AI intelligences, driven by efficiency and compute constraints.

00:31:35
Frontier Models, AI Manipulation, and Cost-Effectiveness

Frontier labs may develop large, multitasking models, potentially hindering speciation. The science of manipulating AI models beyond context windows is developing, with fine-tuning being complex. For AI speciation to be worthwhile, it must be cost-effective, considering the difficulty in deep model adjustments.

00:32:58
Open Ground, Blockchain-Inspired AI Collaboration, and Distributed Computing

"Open ground" focuses on creating collaboration surfaces for AI research, facing challenges in parallelizing auto research and enabling untrusted workers. Blockchain-inspired systems with "commits" and "proof of work" allow verification of AI solutions. Distributed computing models offer ways for untrusted workers to contribute compute cycles.

00:36:45
Compute as Currency and AI's Job Market Impact

Compute power may become more valuable than traditional currency as it becomes a bottleneck. AI's impact on job markets is analyzed, with significant refactoring expected in digital information processing roles. The digital space will transform rapidly, while professions will evolve, requiring adaptation to AI tools.

00:40:52
Navigating the AI Job Market and Jevons Paradox

Keeping up with AI advancements is crucial, as AI is an empowering tool. The Jevons paradox suggests that as software becomes cheaper due to AI, demand will increase, potentially creating more software engineering jobs. Code is becoming more ephemeral, leading to a "rewiring" of the digital space.

00:43:43
Automation in Research and Frontier Lab Conundrums

Frontier labs are automating research processes, potentially displacing researchers. The focus shifts to building automation for higher-level decision-makers. Working within frontier labs presents ethical questions due to AI's potential societal impact and concerns about autonomy and transparency.

00:48:54
Open Source vs. Frontier AI and Centralization Concerns

Open-source models are closing the gap with frontier AI, but a lag persists. There's a demand for open platforms, but capital investment in frontier AI creates challenges. While open source democratizes access, frontier labs push boundaries. Centralization in frontier AI raises concerns, emphasizing the need for diverse perspectives.

00:54:18
Robotics Lag, Physical-Digital Interface, and Information Markets

Robotics, particularly self-driving, lags behind digital AI due to physical world complexities. The interface between physical and digital worlds is key, requiring sensors and actuators for AI interaction. Information markets, where agents trade data, could fuel AI development by providing training data.

00:59:38
Human-AI Symbiosis, Mechanized Training, and Micro-GPT

The book "Damon" inspires a vision of human-AI symbiosis. Training cycles and data collection need mechanization to remove humans from the loop. Micro-GPT, a simplified LLM implementation, aims to make AI more accessible and understandable.

01:02:58
Explaining to Agents, Markdown for Code, and Future Education

Education is shifting towards explaining concepts to agents, not just humans. Documentation should transition to agent-readable Markdown. Agents can aid in educational redirection, with teachers focusing on unique insights that AI cannot replicate, while agents handle routine tasks.

01:06:11
Podcast Promotion and Subscription Information

This segment includes promotional content for the podcast, encouraging listeners to follow on social media, subscribe to YouTube, and find episodes on various platforms, including email sign-ups and transcripts at No-Priors.com.

Keywords

AI Psychosis


A state of intense excitement and disorientation caused by the rapid and profound advancements in Artificial Intelligence, leading to a feeling of being overwhelmed by the pace of change and the potential implications.

Code Agents


AI-powered software entities designed to assist in or automate coding tasks. They can understand natural language instructions, write code, debug, and even manage complex software development workflows.

Claw Entities


Advanced AI agents with enhanced persistence and autonomous capabilities. They operate in dedicated sandboxes, manage complex tasks over extended periods, and often possess sophisticated memory systems.

Macro Actions


High-level commands or tasks that agents can execute, going beyond simple code snippets. These involve orchestrating multiple agents, managing repositories, and executing complex functionalities.

Token Throughput


A metric measuring the volume of data (tokens) processed by an AI model within a given time. Maximizing token throughput is crucial for efficient and cost-effective utilization of AI services.

Auto Research


An AI-driven process for automating research tasks, aiming to remove human bottlenecks. It involves setting objectives, metrics, and boundaries for agents to conduct research autonomously.

Recursive Self-Improvement


The concept of AI systems improving their own capabilities through iterative cycles of learning and refinement. This is a key area of research for advancing AI intelligence.

Agent Personality


The distinct characteristics and communication style of an AI agent. A well-crafted personality can significantly enhance user experience and engagement, making the agent feel more like a collaborator.

Agentic Web


A future vision of the internet where AI agents are the primary users and orchestrators of online services. APIs become the main interface, and agents act as intelligent intermediaries.

Speciation of AI


The emergence of diverse, specialized AI models tailored for specific tasks or domains, analogous to biological speciation. This contrasts with monolithic, general-purpose AI.

Q&A

  • What is "AI psychosis" and why is it relevant to the current state of AI development?

    AI psychosis refers to the intense feeling of being overwhelmed and disoriented by the rapid advancements in AI capabilities, particularly with the emergence of powerful agents. It highlights the dramatic shift in how individuals interact with and utilize AI, moving from manual tasks to extensive delegation.

  • How are AI agents changing the workflow of software engineers?

    AI agents are fundamentally altering software engineering by enabling engineers to delegate a significant portion of their tasks. This shift from writing code manually to instructing agents allows for a focus on higher-level problem-solving and macro-level actions, drastically increasing productivity.

  • What is the significance of "claws" in the context of AI agents?

    "Claws" represent advanced AI entities that offer enhanced persistence and autonomous operation. They function within their own environments, manage tasks over time, and often incorporate sophisticated memory systems, pushing the boundaries of agent capabilities beyond simple interactions.

  • How does the concept of "token throughput" relate to AI usage?

    Token throughput measures the amount of data an AI can process within a specific timeframe. Maximizing this is becoming crucial for efficient AI utilization, similar to maximizing GPU usage in the past. It reflects the user's ability to leverage AI resources effectively.

  • What is "auto research" and how does it aim to improve AI development?

    Auto research is a method for automating AI research by removing human bottlenecks. It sets objectives and metrics for agents to conduct research autonomously, aiming to accelerate discovery and development by allowing AI to explore possibilities without constant human oversight.

  • Why is agent personality considered important?

    An agent's personality significantly impacts user experience. A well-defined personality can make the AI feel more like a collaborator, enhancing engagement and trust. This contrasts with purely functional agents and can influence how users perceive and interact with AI.

  • What is the "agentic web" and how might it change the internet?

    The agentic web envisions a future where AI agents are the primary actors online, interacting through APIs. This could lead to a simplification of the internet, with agents orchestrating services and potentially making many current applications obsolete.

  • What are the potential implications of AI speciation?

    AI speciation suggests the development of diverse, specialized AI models for specific tasks. This could lead to more efficient and tailored AI solutions, moving away from monolithic models and potentially creating smaller, more focused AI intelligences.

  • What are the systemic risks associated with centralized AI development?

    Centralized AI development, where a few entities control cutting-edge technology, poses systemic risks. A lack of diversity in perspectives and potential for misuse are concerns, highlighting the importance of open-source alternatives for a balanced ecosystem.

  • How is robotics expected to evolve in relation to digital AI?

    Robotics is anticipated to lag behind digital AI due to the inherent complexities of manipulating the physical world ("atoms") compared to digital information ("bits"). However, the interface between physical and digital realms, using sensors and actuators, presents significant opportunities.

  • How might the role of teachers change with the advancement of AI agents?

    Teachers may shift their focus to imparting unique insights, intuition, and explanations that AI agents cannot easily replicate, while agents handle more straightforward tasks.

  • What is Micro-GPT and what challenges were encountered in its creation?

    Micro-GPT is an attempt to simplify complex AI models like GPT. The challenge lies in boiling down intricate concepts, such as neural networks, to their simplest forms, which agents struggled to achieve.

  • What is the proposed value-add for educators in an AI-driven educational landscape?

    The value-add for educators lies in providing the "few bits" of crucial understanding or unique explanatory methods that agents cannot generate, focusing on areas where human intuition and creativity are essential.

Show Notes

What happens when AI agents can design experiments, collect data, and improve — without a human in the loop? Andrej Karpathy joins Sarah Guo on the state of models, the future of engineering and education, thinking about impact on jobs, and his project AutoResearch: where agents close the loop on a piece of AI research (experimentation, training, and optimization, autonomously).




00:00 Andrej Karpathy Introduction


02:55 What Capability Limits Remain?


06:15 What Mastery of Coding Agents Looks Like


11:16 Second Order Effects of Natural Language Coding


15:51 Why AutoResearch 


22:45 Relevant Skills in the AI Era


28:25 Model Speciation


32:30 Building More Collaboration Surfaces for Humans and AI


37:28 Analysis of Jobs Market Data


48:25 Open vs. Closed Source Models


53:51 Autonomous Robotics


1:00:59 MicroGPT and Agentic Education


1:05:40 Conclusion



Comments 
In Channel
loading

Table of contents

00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

Conviction