317: I Got 99 Problems, But a Hallucination Ain’t One

Update: 2025-08-23

Description

Welcome to episode 317 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and an out-of-breath (from outrunning bears) Ryan are back in the studio to bring you another episode of everyone’s favorite cloud and AI news wrap-up. This week we’ve got GTP-5, Oracle’s newly minted AI conference, hallucinations (not the good kind), and even a Cloud Journey follow-up. Let’s get into it!

Titles we almost went with this week:

Oracle Intelligence: Mission Las Vegas

AI World: Oracle’s Excellent Adventure

AI Gets a Reality Check: Amazon’s New Math Teacher for Hallucinating Models

Jules Verne’s 20,000 Lines Under the C

GPT-5: The Empire Strikes Back at Computing Costs

5⃣Five Alive: OpenAI’s Latest Language Model Drops

GPT-5 is Alive! (And Ready for Your API Calls)

From Kanban to Kan’t-Ban: Alienate Your User Base in One Update

No More Console Hopping: ECS Logs Stay Put

Following the Paper Trail: ECS Logs Go Live

The Pull Request Whisperer

Five’s Company: DigitalOcean Joins the GPT Party

WireGuard Your Kubernetes: The Mesh-iah Has Arrived

EKS-tending Your Reach: When Your Nodes Need a VPN Alternative

Buttercup Blooms: DARPA’s Prize-Winning AI Security Tool Goes Public

From DARPA to Docker: How Buttercup Brings AI Bug-Hunting to Your Laptop

Agent 007: License to Query

Compliance Manager: Because Nobody Dreams of Filling Out Federal Paperwork

Do Compliance Managers dream of Public Sector sheep?

Blob’s Your Uncle: Finding Lost Data in the Cloud

Wassette: Teaching Your AI Assistant to Go Shopping for Tools

Monitor, Monitor on the Wall, Who’s the Most Secure of All?

Better Late Than IPv-Never

VPC Logs: Now with 100% Less Manual Labor

CloudWatch Catches All the Flows in Your Organization

The Organization-Wide Net: No VPC Left Behind

SQS Goes Super Size: Would You Like to Quadruple That?

One MiB to Rule Them All: SQS’s Payload Growth Spurt

Microsoft Finally Merges with Its $7.5 Billion Side Piece

From Hub to Spoke: GitHub Loses Its Independence

Cloud Run Forest Run: Google’s AI Workshop Marathon

From Zero to AI Hero: Google’s Production Pipeline Workshop

The Fast and the Serverless: Cloud Run Drift

A big thanks to this week’s sponsor:

We’re sponsorless! Want to get your brand, company, or service in front of a very enthusiastic group of cloud news seekers? You’ve come to the right place! Send us an email or hit us up on our Slack channel for more info.

General News

01:17 GitHub will be folded into Microsoft proper as CEO steps down – Ars Technica

GitHub will lose its operational independence and be integrated into Microsoft’s CoreAI organization in 2025, ending its separate CEO structure that has existed since Microsoft’s $7.5 billion acquisition in 2018.

The reorganization eliminates the CEO position, with GitHub’s leadership team reporting to multiple executives within CoreAI rather than a single leader, potentially impacting decision-making speed and product direction.

This structural change could affect GitHub’s developer-focused culture and remote-first operations that have distinguished it from Microsoft’s traditional corporate structure.

The integration into CoreAI suggests Microsoft plans to more tightly couple GitHub with its AI initiatives, potentially accelerating AI-powered development features but raising concerns about platform neutrality.

Developers and enterprises should monitor how this affects GitHub’s roadmap, pricing, and commitment to open source projects, as tighter Microsoft integration historically has led to significant platform changes.

03:01 Matt – “God knows how long a decision is going to take to get made.”

AI Is Going Great – or How ML Makes Its Money

05:10 Jules, Google’s asynchronous AI coding agent, is out of public beta

If you’ve forgotten about it, Jules is the worst-marketed Google AI coding agent tool.

Jules is Google’s AI coding agent that operates asynchronously to handle development tasks.

It’s now publicly available, after processing 140,000+ code improvements during beta testing with thousands of developers.

The service runs on Gemini 2.5 Pro’s advanced reasoning capabilities to create coding plans and generate higher-quality code outputs, with new features including GitHub issues integration and multimodal support.

Google introduced three pricing tiers: free introductory access (which you will blow through almost immediately), Google AI Pro with 5x higher limits for daily coding, and Google AI Ultra with 20x limits for intensive multi-agent workflows at scale.

Is it just us, or is this the same pricing structure as Claude?

This represents a shift toward autonomous coding assistants that can work independently on tasks while developers focus on other work, potentially changing how cloud-based development teams operate.

The asynchronous nature allows Jules to handle time-consuming tasks like bug fixes and code improvements without requiring constant developer oversight, which could significantly impact productivity for cloud development projects.

06:30 Ryan – “I think it’s a perfect example of like where GitHub might go, right? Because this already integrates with GitHub, so you can communicate with the AI in issues or point at certain issues, or use it in comments. And it’s synchronous, so it’s just running in the background. It’s not a chat or an interactive agent conversation. You’re sort of like giving it directions and sending it off.”

08:11 Introducing GPT-5

Were you waiting for the drumroll? Well, no sound effects this week. Sad face.

GPT-5 introduces a larger model architecture with refined attention mechanisms and multimodal input processing, requiring substantial cloud compute resources for deployment and inference at scale.

Enhanced contextual comprehension and faster processing speeds enable more efficient API calls and reduced latency for cloud-based AI services, potentially lowering operational costs for businesses.

Technical improvements in training efficiency could reduce the computational overhead for fine-tuning models on cloud platforms, making custom AI deployments more accessible to smaller organizations.

Healthcare, education, and creative industries can leverage GPT-5 through cloud APIs for applications like medical documentation, personalized learning systems, and content generation workflows.

OpenAI’s safety measures and ethical deployment guidelines will likely influence cloud provider policies for hosting and serving large language models, affecting compliance requirements for enterprise users.

AGI is here, guys! Well, not really. Maybe. Sort of. Getting close? Ryan is excited about it, anyway.

09:38 Introducing GPT-5 for Developers

GPT-5 represents the next iteration of OpenAI’s language model series, likely offering improved language understanding and generation capabilities that developers can integrate via API endpoints into cloud-based applications.

The model would provide enhanced performance benchmarks compared to GPT-4, potentially including better context handling, reduced hallucinations, and more accurate responses for enterprise cloud deployments.

Developer integration features may include new API capabilities, updated SDKs, and code examples for implementing GPT-5 across various cloud platforms and programming languages.

Pricing and rate limits will be critical factors for businesses evaluating GPT-5 adoption, particularly for high-volume cloud applications requiring scalable AI inference.

The release could impact cloud computing costs and architecture decisions as organizations determine whether to use OpenAI’s hosted service or explore self-hosting options on their cloud infrastructure.

11:09 Ryan – “I’m kind of afraid of AGI, and I’m putting my head in the sand about