High Agency: The Podcast for AI Builders

34 Episodes

Reverse

How Graphite's $50M Series B is Transforming AI Code Review

2025-05-2043:15

Merrill Lutsky, co-founder and CEO of Graphite, discusses their evolution from stack diff workflows to Diamond, an AI code review agent that just helped secure their $50M Series B. He shares insights on building reliable AI review systems, why over-generating and pruning comments works better than single responses, and the shift from RAG to agentic code browsing. Merrill offers a provocative vision where developers define requirements and AI agents build the code, potentially eliminating traditional IDE coding. This episode provides valuable perspectives on how AI is fundamentally reshaping software development workflows and engineering roles.Chapters:00:00 - Introduction and Graphite overview01:58 - Evolution from stack diffs to AI review07:39 - Diamond: The AI code reviewer explained10:13 - Human vs AI review: Finding the balance11:44 - Engineering challenges of reliable AI review17:38 - Over-generate and prune: A winning strategy24:49 - From RAG to code browser agents28:12 - The bitter lesson of AI engineering30:48 - The future of software engineering37:33 - Is AI over or under-hyped?

The End of Language-Only Models l Amit Jain, Luma AI

2025-05-1340:17

This week Raza is joined by Amit Jain, CEO and co-founder of Luma AI, to explore why the future of artificial intelligence lies beyond language. Amit shares Luma’s bold mission to build world models through multimodal training and why video is the most overlooked and critical data source in AI today.Chapters:00:00 - Introduction03:40 - Competing with Big AI Labs: Language vs. Multimodality08:09 - Joint Training and Why Current Multimodal Models Fall Short11:01 - Language is Discrete, the World is Continuous14:36 - Do These Models Have World Models?18:18 - Planning, Counterfactuals, and Causal Reasoning in AI22:08 - Capabilities of Ray 2 and Real-World Use Cases26:14 - Rethinking Video Length and Creative Workflows29:18 - Solving Coherence Across Shots and Characters30:00 - When Will AI Create a Feature-Length Film?31:27 - What You Can Build with Luma’s API Today35:49 - Overlooked Ideas and Noise in the AI Industry38:34 - Why Video is the Missing Link in AI

From 0 to $40M in 5 Months: Bolt.new Story with Eric Simons

2025-04-0341:33

Eric Simons discusses the meteoric rise of Bolt.new, an AI-powered web app builder that went from zero to $40 million ARR in just five months. He shares insights on how they built an AI agent capable of creating full-stack web applications from simple prompts, the challenges of rapid growth, and the future of AI in software development. From nearly shutting down the company to becoming one of the fastest-growing AI products in history, Eric offers valuable lessons for anyone building in the AI space.Chapters:00:00 - Introduction and Bolt.new overview06:05 - The journey from near-shutdown to rapid growth13:28 - Challenges of explosive growth and scaling18:50 - Technical deep dive: Building Bolt.new26:37 - Debugging and improving AI-generated code32:09 - Future directions and enterprise adoption34:11 - Advice for building AI applications37:03 - The concept of "vibe revenue" in AI startups39:39 - Is AI over or under-hyped?------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is the LLM evals platform for enterprises. We give you the tools that top teams use to ship and scale AI with confidence. To find out more go to humanloop.com

Saving Pharma Companies Billions with AI l Patrick Leung from Faro Health

2025-03-2148:04

In this episode of High Agency, Patrick Leung from Faro Health explains how they're using AI to revolutionize clinical trial design by both generating regulatory documents and extracting insights from thousands of existing trials. Patrick emphasises the essential collaboration between clinical experts and AI engineers when building reliable systems in healthcare's high-stakes environment. Chapters:00:00 - Introduction04:26 - Clinical trials before: Microsoft Word Documents08:17 - Document generation using AI12:26 - What makes clinical trials so expensive16:26 - Parsing and processing clinical trial data18:04 - Challenges with traditional evaluation metrics21:28 - Importance of domain experts in the evaluation process24:35 - Collaboration between domain experts and engineering31:26 - Building a graph-based knowledge system34:27 - Roles and skillsets required38:06 - Lessons learned building LLM products40:56 - Discussion on AI capabilities and limitations46:07 - Is AI overhyped or underhyped------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is the LLM evals platform for enterprises. We give you the tools that top teams use to ship and scale AI with confidence. To find out more go to humanloop.com

100x Hiring Speed with Superhuman Recruiters l Metaview Co-Founder

2025-03-0753:07

In this episode, Raza is joined by Shahriar Tajbakhsh, the co-founder of Metaview. They discuss how Metaview’s AI scribe automates interview note-taking, how AI agents can surface top candidates from thousands of resumes, and why hiring managers should think of AI as a co-worker, not just a tool. Raza's recomended reading: Creating a LLM-as-a-Judge That Drives Business Results.Chapters:00:00 - Introduction03:32 - How AI Co-Workers Are Transforming Recruiting06:21 - Inside MetaView: AI Scribe and Workflow Automation09:11 - Unlocking Hiring Insights with AI-Driven Conversations11:30 - Balancing AI Innovation and User Adoption14:05 - Metaview’s Tech Stack and the Role of LLMs18:29 - How MetaView Generates Superhuman Interview Notes23:18 - The Challenges of Building Reliable AI Hiring Agents32:40 - The Future of AI in Hiring: Automating Job Descriptions40:26 - AI Co-Workers That Work While You Sleep47:08 - Why Vertical AI Will Win Over General AI Agents50:24 - The Underrated Power of Graph-Based AI------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is the LLM evals platform for enterprises. We give you the tools that top teams use to ship and scale AI with confidence. To find out more go to humanloop.com

AI Will Replace Command Lines I Ex-Google Tech Lead and Founder at Warp

2025-02-2147:451

In this episode, Raza Habib chats with Zach Lloyd, CEO and founder of Warp, about how AI is transforming the developer experience. They explore how Warp is reimagining the command line, the power of AI-driven automation, and what the future holds for coding workflows.Chapters:00:00 - Introduction04:06 - Why the terminal needed reinvention07:11 - AI’s role in Warp’s evolution08:55 - Key AI features in Warp12:49 - Balancing safety, reliability, and usability19:43 - Challenges in AI-Powered development22:33 - Changing developer behavior with AI27:24 - Prompt engineering and context optimization31:05 - Lessons for building AI products37:50 - The future of AI in software development46:42 - Underappreciated AI innovations------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is the LLM evals platform for enterprises. We give you the tools that top teams use to ship and scale AI with confidence. To find out more go to humanloop.com

Google Is Dead: How This 144-GPU Startup Is Building Einstein-Level AI Search I Will Bryk | Exa CEO

2025-02-0738:441

Will Bryk, CEO of Exa, sits down with Raza Habib to reveal why traditional search engines are becoming obsolete and how his startup is building an AI-powered search engine for the future. From constructing a massive GPU cluster to predicting AI will surpass human mathematicians by 2026, Will shares fascinating insights about the technological breakthroughs that will reshape society in the coming months.Chapters:00:00 - Introduction 05:13 - Exa as a Tool for LLMs and Neural Search 06:19 - Introducing "Websets" and Its Use Cases 10:16 - Building a Compute Cluster: Why Own vs. Rent? 12:00 - The Bitter Lesson and Scalability in AI 17:11 - Interesting Use Cases for Exa 19:44 - People Search and CRM Opportunities 21:10 - Predictions for AI Progress and Test-Time Compute 27:10 - Implications of AI on Creative Tasks and Society 29:15 - Automation, Jobs, and the Knowledge Economy 33:57 - What Could Stop AI Progress? 36:22 - Advice for AI Builders and Entrepreneurs------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is the LLM evals platform for enterprises. We give you the tools that top teams use to ship and scale AI with confidence. To find out more go to humanloop.com

$100M raised: How Decagon is building better AI agents I Jesse Zhang

2025-01-2241:451

In this episode, Jesse Zhang joins Raza to discuss building cutting-edge AI agents for customer support. They explore how his early passion for LLMs led to creating a company that’s transforming the way businesses like Rippling, Duolingo, and Webflow interact with customers. Jesse breaks down the challenges of scaling AI systems, the importance of customer feedback, and his predictions for the future of AI.Chapters:00:00 - Introduction and Jesse Zhang's Background 01:17 - First Exposure to LLMs and Building Early Projects 04:32 - Decagon’s Rapid Growth and Differentiation in AI 06:37 - Understanding Decagon’s AI Customer Support Product 10:21 - Challenges in Building High-Performance AI Systems 13:14 - Evolution from Simple RAG to Agent Architectures 16:54 - Measuring Accuracy with Evals and Customer Feedback 19:05 - Balancing Customization and Reusability Across Clients 22:35 - Handling Customer Data and Incremental Deployment 25:21 - Restructuring Support Teams for AI Integration 27:03 - Team Composition and the Role of Domain Expertise 29:19 - Advice for New AI Builders: Customer-Driven Development 32:21 - Key Insights on AI Agents and Enterprise Adoption 36:34 - Predictions for AI Advancements in 2025 39:41 - Is AI Overhyped or Underhyped? 41:07 - Closing Remarks and Final Thoughts------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is the LLM evals platform for enterprises. We give you the tools that top teams use to ship and scale AI with confidence. To find out more go to humanloop.com

How GitHub Copilot Became the First LLM-Powered Developer Tool with Ryan Salva

2025-01-0738:531

On this week's episode, former GitHub Copilot lead Ryan Salva breaks down how AI coding tools became ubiquitous almost overnight. They discuss the critical differences between what novice and expert developers expect from AI, why starting with predictive text was both a blessing and a curse, and how the rapid adoption of AI assistance is reshaping the future of software development.Chapters:00:00 - Introduction 01:09 - The Creation of GitHub Copilot 05:39 - From Prototype to Product: Challenges in Scaling 07:37 - How GitHub Copilot Works Behind the Scenes 11:18 - Metrics That Matter: Evaluating AI Success 14:43 - Building Momentum: What It Feels Like to Launch a Hit 17:51 - The Evolution of AI Tools for Developers 21:13 - Evaluations and Testing in AI Development 26:00 - The Role of Automation and the Future of Coding 30:53 - Will Engineers Still Write Code in the Future? 33:16 - Advice for Aspiring AI Builders 36:51 - Is AI Overhyped or Underhyped? 38:17 - Closing Reflections ----------------------------------------------------------------------------------------------------------------------------------------------Humanloop is the LLM evals platform for enterprises. We give you the tools that top teams use to ship and scale AI with confidence. To find out more go to humanloop.com

What Gives an AI Founder Staying Power I James Theuerkauf, CEO of Syrup Tech I Sara Ittelson, Partner at Accel

2024-12-2743:36

In this week's episode, Raza speaks with James Theuerkauf, CEO of Syrup Tech, and Sara Ittelson, Partner at Accel, to explore the challenges and opportunities for entrepreneurs in this transformative era. They discuss building AI-first companies and the lessons learned from scaling in a rapidly evolving space. With practical tips on leveraging data, creating competitive advantages, and sustaining passion for the long haul, this episode offers invaluable guidance for founders in AI.Chapters:00:00 - Introduction and Guest Backgrounds 01:27 - Syrup Tech’s Approach to AI in Retail 03:29 - The Role of AI in Demand Forecasting 08:49 - Building Effective AI Systems and Teams 15:30 - How Generative AI is Shaping Businesses 19:18 - Advice for Founders in the AI Era 28:15 - Building an AI-First Company 33:26 - Innovations and Trends in AI 38:47 - Is AI Overhyped or Underhyped? 42:46 - Closing Thoughts and Reflections--------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is the LLM evals platform for enterprises. We give you the tools that top teams use to ship and scale AI with confidence. To find out more go to humanloop.com

How to build great AI products with Vanta Software Developer Noam Rubin

2024-12-1840:571

In this episode, Noam Rubin, a Software Developer at Vanta reveals how his team uses data-driven strategies to design, test, and improve cutting-edge AI features. Learn how customer insights, rapid prototyping, and iterative development transform raw ideas into tools that make compliance and security easier for businesses everywhere.Chapters:00:00 - Introduction02:47 - The process of building AI products at Vanta04:51 - The role of customer feedback in product development06:59 - Integrating AI into security and compliance workflows08:06 - Using data specifications to guide product development10:10 - Collaborating with subject matter experts to refine AI models12:14 - Iterative testing and refining AI features14:10 - Quality control and ensuring AI accuracy16:00 - The importance of dogfooding and internal feedback loops18:23 - Scaling AI features and rolling them out to wider audiences20:50 - Educating engineers and democratizing AI at Vanta22:20 - Key lessons learned from building AI products24:12 - Maintaining AI quality through continuous feedback26:00 - The future of AI in business and product development

Predictions for AI in 2025 I Ex-OpenAI, Ex-Stripe researcher Stanislav Polu

2024-12-1144:271

In this episode of High Agency, former OpenAI researcher Stan Polu shares his journey from AI research to founding Dust, an enterprise AI platform. Stan offers a contrarian view on the future of AI, suggesting we may be hitting a plateau in model capabilities since GPT-4. He discusses why startups should focus on product-market fit before investing in GPUs, shares practical lessons for building AI products, and predicts increased competition between AI labs and API developers. Chapters:00:00 - Introducing Dust: an enterprise AI platform06:07 - From Stripe to OpenAI: Stan's journey10:29 - Why research wasn't enough: building Dust15:10 - Best practices for building an AI product20:50 - Is prompt engineering here to stay23:40 - Understanding language models and their limitations32:56 - Predictions for AI in 202539:53 - Measuring progress toward AGI42:26 - The true value of AI technology--------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is the LLM evals platform for enterprises. We give you the tools that top teams use to ship and scale AI with confidence. To find out more go to humanloop.com

How Replicate is Democratizing AI with Open-Source Resources

2024-11-1336:15

In this episode, we explore how Replicate is breaking down barriers in AI development through its open-source platform. CEO Ben Firshman shares how Replicate enables developers without machine learning expertise to run AI models in the cloud.00:00 Introduction 00:29 Overview of Replicate 03:13 Replicate's user base 05:45 Enterprise use cases and lowering the AI barrier 07:45 The complexity of traditional AI deployment 10:24 Simplifying AI with Replicate's API 13:50 ControlNets and the challenges of image models 19:42 Fragmentation in AI models: images vs. language 25:05 Customization and multi-model pipelines in production 26:33 Learning by doing: skills for AI engineers 28:44 Applying AI in governments 31:12 Iterative development and co-evolution of AI specs 33:13 Final reflections on AI hype 35:18 Conclusion--------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is an Integrated Development Environment for Large Language Models. It enables product teams to develop LLM-based applications that are reliable and scalable. To find out more go to humanloop.com

The Principles for Building Excellent AI Features with Superhuman’s Lorilyn McCue

2024-11-0742:351

How do you build AI tools that actually meet users’ needs? In this episode of High Agency, Raza speaks with Lorilyn McCue, the driving force behind Superhuman’s AI-powered features. Lorilyn lays out the principles that guide her team’s work, from continuous learning to prioritizing user feedback. Learn how Superhuman’s "learning-first" approach allows them to fine-tune features like Ask AI and AI-driven summaries, creating practical solutions for today’s professionals. 00:00 - Introduction04:20 - Overview of the Superhuman06:50 - Instant Reply and Ask AI10:00 - Building On-Demand vs. Always-On AI Features13:45 - Prompt Engineering for Effective Summarization22:35 - The Importance of Seamless AI Integration in User Workflows25:10 - Developing Advanced Email Search with Contextual Reasoning29:45 - Leveraging User Feedback32:15 - Balancing Customization and Scalability in AI-Generated Emails36:05 - Approach to Prioritization39:30 - Real-World Use Cases: The Versatility of Current AI Capabilities43:15 - Learning and Staying Updated in the Rapidly Evolving AI Field46:00 - Is AI Overhyped or Underhyped?49:20 - Final Thoughts and Closing Remarks--------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is an Integrated Development Environment for Large Language Models. It enables product teams to develop LLM-based applications that are reliable and scalable. To find out more go to humanloop.com

Jeff Huber of Chroma: Building the open-source toolkit for AI Engineering

2024-10-2454:591

This week on High Agency, Raza Habib is joined by Chroma founder Jeff Huber. They cover the evolution of vector databases in AI engineering, challenge common assumptions about RAG and share insights from Chroma's journey. Jeff shares insights from Chroma's development, including their focus on developer experience and observations about real-world usage patterns. They also get into whether or not we can expect a super AI any time soon and what is over and under hyped in the industry today. 00:00 - Introduction02:30 - Why vector databases matter for AI06:00 - Understanding embeddings and similarity search12:00 - Chroma early days15:45 - Problems with existing vector database solutions19:30 - Workload patterns in AI applications23:40 - Real-world use cases and search applications27:15 - The problem with RAG terminology31:45 - Dynamic retrieval and model interactions35:30 - Email processing and instruction management39:15 - Context windows vs vector databases42:30 - Enterprise adoption and production systems45:45 - The journey from GPT-3 to production AI48:15 - Internal vs customer-facing applications51:00 - Advice for AI engineers--------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is an Integrated Development Environment for Large Language Models. It enables product teams to develop LLM-based applications that are reliable and scalable. To find out more go to humanloop.com

How to Create AI Strategy in Enterprises with Peter Gostev from Moonpig

2024-10-1639:541

In this episode of High Agency podcast, Peter Gostev shares his experiences implementing LLMs at NatWest and Moonpig. He discusses creating an AI strategy, talks about challenges in deploying LLMs in large organizations, and shares thoughts on underappreciated AI developments.00:00 - Introduction00:44 - OpenAI dev day reactions 03:47 - Using AI to automate customer service 10:43 - Impact of AI products13:41 - Who are the users of LLMs14:47 - Challenges building with AI in a large enterprise 21:22 - AI use cases at Moonpig24:34 - How to create an AI strategy28:10 - Underappreciated AI developments--------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is an LLM evals platform for enterprises. It enables product teams to develop LLM-based applications that are reliable and scalable. To find out more go to humanloop.com

Ex-Coinbase CPO's Next Big Thing: AI Employees I Surojit Chatterjee

2024-10-0244:431

In this episode of High Agency, we're joined by Surojit Chatterjee, former CPO of Coinbase and now CEO of Ema. Surojit unveils his audacious plan to create universal AI employees and revolutionize Fortune 1000 workforce. Drawing from his career at tech giants like Google and Coinbase, he shares how these experiences fueled his vision for Ema. Surojit dives into the challenges of building AI agents, explores the concept of artificial humans, and predicts how this technology could transform the future of SaaS(00:00:00) Introduction and Surojit’s background(00:03:00) Founding story of Ema (Universal AI Employee)(00:04:53) How the Universal AI Employee works(00:08:39) Ema’s data integration and security(00:12:57) AI employee use cases in enterprises(00:15:02) Challenges with building AI agents(00:16:45) Evaluations, hallucinations, customizing models(00:19:52) Artificial human metaphor (00:25:42) AI employee vs humans(00:31:25) Advice for AI builders(00:37:14) Is AI overhyped or underhyped?(00:39:28) How the business model of SaaS will change--------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is an Integrated Development Environment for Large Language Models. It enables product teams to develop LLM-based applications that are reliable and scalable. To find out more go to humanloop.com

Why Your AI Product Needs Evals with Hamel Husain and Swyx

2024-09-2501:09:021

Hamel Husain is a seasoned AI consultant and engineer with experience at companies like GitHub, DataRobot, and Airbnb. He is a trailblazer in AI development, known for his innovative work in literate programming and AI-assisted development tools. Shawn Wang (aka Swyx) is the host of the Latent Space podcast, the author of the essay 'Rise of the AI Engineer,' and the founder of the AI Engineer World Fair. In this episode, Hamel and Swyx share their unique insights on building effective AI products, the critical importance of evaluations, and their vision for the future of AI engineering.Chapters00:00 - Introduction and recent AI advancements06:14 - The critical role of evals in AI product development15:33 - Common pitfalls in AI product development26:33 - Literate programming: A new paradigm for AI development39:58 - Answer AI and innovative approaches to software development51:56 - Integrating AI with literate programming environments58:47 - The importance of understanding AI prompts01:00:37 - Assessing the current state of AI adoption01:07:10 - Challenges in evaluating AI models--------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is an Integrated Development Environment for Large Language Models. It enables product teams to develop LLM-based applications that are reliable and scalable. To find out more go to humanloop.com

How AI is Changing Product Management with Raz Nussbaum from Gong AI

2024-09-1830:03

Raz Nussbaum is a Senior Product Manager in AI at Gong — the leading AI platform for revenue teams. He is an absolute legend when it comes to building and scaling AI products that genuinely deliver value. In this episode, he opens up about what it takes to build successful AI products in an era where things change at lightning speed.Chapters00:00 - Introduction01:16 - How LLMs Changed Product Development at Gong AI08:32 - Including Product Managers in Development Process13:05 - Testing and Monitoring Pre vs Post-deployment17:53 - New Challenges in the Face of Generative AI19:39 - Shipping Fast and Interacting with the Market23:25 - What's Next For Gong AI25:13 - The Psychology of Trusting AI 28:19 - Is AI Overhyped or Underhyped?--------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is an Integrated Development Environment for Large Language Models. It enables product teams to develop LLM-based applications that are reliable and scalable. To find out more go to humanloop.com

From Fiction to Reality: Sudowrite's Journey in AI-Assisted Creative Writing

2024-09-1156:431

In this episode, we dive deep into the world of AI-assisted creative writing with James Yu, founder of Sudowrite. James shares the journey of building an AI assistant for novelists, helping writers develop ideas, manage complex storylines, and avoid clichés. James gets into the backlash the company faced when they first released Story Engine and how they're working to build a community of users.00:00 - Introduction and Background of Sudowrite02:26 - The Early Days: Concept, Skepticism, and User Adoption05:20 - Sudowrite's Interface, Features, and User Base10:23 - Developing and Iterating Features in Sudowrite17:29 - The Evolution of Story Bible and Writing Assistance24:27 - Challenges in Maintaining Coherence and AI-Assisted Writing29:12 - Evaluating AI Features and the Role of Prompt Engineering33:35 - Handling Tropes, Clichés, and Fine-Tuning for Author Voice40:43 - The Controversy and Future of AI in Creative Work51:37 - Predictions for AI in the Next Five Years--------------------------------------------------------------------------------------------------------------------------------------------------Humanloop is an Integrated Development Environment for Large Language Models. It enables product teams to develop LLM-based applications that are reliable and scalable. To find out more go to humanloop.com

#box-pro-ellipsis-176420906877590{-webkit-line-clamp:2;}High Agency: The Podcast for AI Builders

High Agency: The Podcast for AI Builders