#034: The Year of CUDA Python: NVIDIA GTC 2025 Recap w/ Charles Frye

Update: 2025-03-26

Description

In this episode, we dive into NVIDIA's bold push to make Python a first-class citizen in its GPU ecosystem—what guest Charles Frye calls "The Year of CUDA Python." Charles, a developer advocate at Modal, recaps key takeaways from the 2025 NVIDIA GTC conference, spotlighting the growing centrality of Python across CUDA tooling, including the debut of Python-first libraries like cuTile and a fully reworked Python interface for CUTLASS.

We explore why NVIDIA is embracing Python for performance-critical development, how they’re addressing the challenges of Tensor Core programming, and what this all means for AI builders. Charles also breaks down NVIDIA’s hardware strategy shift—favoring scale-up over scale-out—and covers powerful new profiling tools like NSight Systems and Torch Profiler. Plus, we look at distributed inference innovations like Dynamo and how they intersect with platforms like Modal.

Whether you're GPU-curious or deep into LLM infrastructure, this conversation offers insight into how NVIDIA’s ecosystem is evolving—and why Python is at the center of it all.

Connect with Charles Frye

🧠 X (Twitter): @charles_irl

💻 Try Modal: https://modal.com

[CHAPTERS]

00:00 Start

01:35 The Year of CUDA Python

01:56 NVIDIA's Software Stack Evolution

03:01 Python's Growing Role in GPU Programming

06:11 CUTLASS and Python Integration

08:30 Tensor Cores and CUDA Complexity

12:02 Scaling Up vs. Scaling Out

18:01 AI Factory Concept

20:42 Hopper GPUs and New Generations

23:44 Memory-Bound Challenges in GPU Scaling

24:49 Performance and Tooling Insights

28:36 GPU Debugging Tools: Torch Profiler and NSight Systems

33:43 Dynamo: Distributed Inference for Language Models

39:37 Introducing the Modal Platform

43:10 How to Connect and Get Started with Moda

Comments

In Channel

#037: Build. Vibe. Sell: How Billy Turns Ideas into Apps with Replit | The Manny Bernabe Show #037

2025-04-1648:22

Vibe Coding in VC: Custom Tools and the New Wave of Founders

2025-04-0726:32

Vibe Coding: AI Superpowers for Every Builder with Matt Palmer (Replit)

2025-03-2849:12

#034: The Year of CUDA Python: NVIDIA GTC 2025 Recap w/ Charles Frye

2025-03-2644:52

#033 Jeff Croft: GenAI at Palantir, Enterprise Impact & The Power of Relationships

2024-05-0901:02:43

#032 Ari Kaplan: Moneyball Analytics, New Jobs, AI Automation, and Custom LLMs

2023-12-1958:53

#031 AI for All: How Small and Midsize Manufacturers Can Harness Analytics

2023-04-0701:04:54

#030 Unlocking Business Value with AI at the Edge (IoT) w/ Ben Jacques

2022-09-2953:15

#029 Musk, Thiel, Coups, Hackers, Fraud and the Founding of PayPal w/ Jimmy Soni

2022-09-1455:41

#028 AI in Space: Solving Machine Learning’s Last Mile Challenge w/ Vid Jain

2022-09-0256:08

#027 AI in MedTech: Challenges & Opportunities w/ Adam King

2022-08-2548:47

#026 Selling Digital Transformation w/ Charlotte Fuller

2022-07-1351:26

#025 How VC's Vet AI Investments w/ Michael Bommarito

2022-06-2701:06:58

#024 Why AI Investments Fail to Deliver on Business Value w/ Dorian Smiley

2022-06-1458:02

#022 Building IoT Digital Twins With Palantir Foundry w/ Kai Altstaedt

2022-06-0757:31

#021 Tesla Cyber Rodeo: AI Highlights (FSD, Tesla Bot, RoboTaxi)

2022-06-0710:29

#018 Reaction to Palantir Chief Architect AMA: Data Integration and AI

2022-06-0720:10

#020 How AI Will Revolutionize Healthcare w/ Brandon Cosley

2022-06-0758:47

#019 AI for Manufacturing

2022-06-0746:22

#017 AI is overrated, start with Automation! w/ Doug Shannon

2022-06-0751:11

00:00

#034: The Year of CUDA Python: NVIDIA GTC 2025 Recap w/ Charles Frye

#box-pro-ellipsis-176070110765321{-webkit-line-clamp:2;}#034: The Year of CUDA Python: NVIDIA GTC 2025 Recap w/ Charles Frye

#034: The Year of CUDA Python: NVIDIA GTC 2025 Recap w/ Charles Frye

Manny

#034: The Year of CUDA Python: NVIDIA GTC 2025 Recap w/ Charles Frye