Inside Google’s New AI Stack with Paige Bailey
Description
In this episode of the ODSC AiX Podcast, host Sheamus McGovern reconnects with Paige Bailey, Engineering Lead at Google DeepMind for the Developer Experience team. Paige shares how the Gemini ecosystem has evolved since her last appearance, including the launch of Gemini 2.5 DeepThink, multimodal video generation with Veo 3, real-time music creation with Lyria RT, and groundbreaking advances in agentic and on-device AI systems. The conversation explores the rapid rise of agent-based workflows, AI-powered robotics, and the growing divide between cutting-edge tools and real-world adoption.
Key Topics Covered:
- Gemini 2.5 DeepThink & Reasoning Models
- The model that won gold at the International Mathematical Olympiad (IMO)
- Use cases for DeepThink, Pro, Flash, and FlashLite variants
- Using Gemini Live API for real-world robotics and decision planning
- Role of multimodal inputs (video, audio, text) in enabling embodied AI
- On-Device AI & Ubiquity
- Implications for edge deployment, cost reduction, and accessibility
- Veo 3: Multimodal Video Generation
- Lyria RT: Real-Time Music Generation
- Gemini Live API & Voice Interfaces
- Real-time bidirectional voice, screen understanding, and tool calling
- Rise of voice as the dominant AI interface
- Use of SynthID and digital watermarking to detect deepfakes
- Future of AI-agent orchestration via MCP servers
Memorable Outtakes:
- On the pace of model development: “A 4-billion parameter model on-device now outperforms our best cloud model from six months ago. That’s pretty magical.” — Paige Bailey
- On the role of AI agents in robotics: “You can say, ‘Hey robot, go get me that apple,’ and Gemini will plan the task, route it, and call the right control models.” — Paige Bailey
- On the AI adoption gap: “In the Bay Area, we use AI hourly. But when I talk to developers in the Midwest, they often aren’t using it at all.” — Paige Bailey
References & Resources:
- Paige Bailey
- Dynamic Web Paige: https://webpaige.dev/
- LinkedIn: https://www.linkedin.com/in/dynamicwebpaige
- GitHub: https://github.com/dynamicwebpaige
- Medium: https://medium.com/@dynamicwebpaige
- Previous podcast with Paige: https://podcasters.spotify.com/pod/show/ai-x-podcast/episodes/Googles-AI-Powered-Tools-for-Data-Scientists-Building-the-Automated-Future-of-Data-Science-with-Paige-Bailey-e2p3t6e
- International Mathematical Olympiad (IMO): https://www.imo-official.org
- Model Context Protocol (MCP): https://modelcontextprotocol.io/docs/getting-started/intro
- Gemini 2.5 Deep Think: https://blog.google/products/gemini/gemini-2-5-deep-think/
- Veo 3: https://deepmind.google/technologies/veo/
- Lyria RT & Music AI Sandbox: https://deepmind.google/technologies/lyria/
- SynthID & Deepfake Watermarking: https://deepmind.google/technologies/synthid/
- Gemma Models: https://ai.google.dev/gemma
- Gemini Live API Docs: https://ai.google.dev/gemini-api/docs/live
- Google AI Studio: https://ai.google.dev
Sponsored by:
🔥 ODSC AI West 2025 – The Leading AI Training Conference
Join us in San Francisco from October 28th–30th for expert-led sessions on generative AI, LLMOps, and AI-driven automation.
Use the code podcast for 10% off any ticket.
Learn more: https://odsc.ai