DiscoverThe Cloud Pod319: AWS Cost MCP: Your Billing Data Now Speaks Human
319: AWS Cost MCP: Your Billing Data Now Speaks Human

319: AWS Cost MCP: Your Billing Data Now Speaks Human

Update: 2025-09-03
Share

Description

Welcome to episode 319 of The Cloud Pod, where the forecast is always cloudy! Justin, Matt, and Ryan are in the studio to bring you all the latest in cloud and AI news. AWS Cost MCP makes exploring your finops data as simple as english text. We’ve got a sunnier view for junior devs, a Microsoft open source development, tokens, and it’s even Kubernetes’ birthday – let’s get into it! 


Titles we almost went with this week:



  • From Linux Hater to Open Source Darling: A Microsoft Love Story

  • 20,000 Lines of Code and a Dream: Microsoft’s Open Source Glow-Up

  • Ctrl+Alt+Delete Your Assumptions: Microsoft Goes Full Penguin

  • Token and Esteem: Amazon Bedrock Gets a Counter

  • CSI: Cloud Scene Investigation

  • The Great SQL Migration: How AI Became the Universal Translator

  • Token and Ye Shall Receive: Bedrock’s New Counting Feature

  • The Count of Monte Token: A Bedrock Tale – mk

  • Ctrl+Z for Your Database: Now with Built-in Lag Time

  • IP Freely: GKE Takes the Pain Out of Address Management

  • AWS CEO: AI Can’t Replace Junior Devs Because Someone Has to Fix the AI’s Code

  • Better Late Than Never: RDS PostgreSQL Gets Time Travel

  • The SQL Whisperer: Teaching AI to Speak Database

  • DigitalOcean Goes Full Chatbot: Your Infrastructure Now Speaks Human

  • Musk vs Cook: The App Store Wars Episode AI

  • Firestore Goes Mongo: A Database Love Story

  • GKE Turns 10: Now With More Candles and Less Complexity

  • Prime Day Infrastructure: Now With 87,000 AI Chips and a Robot Army

  • AWS Scales to Quadrillion Requests: Your Black Friday Traffic Looks Cute

  • AWS billing now speaks human, thanks to MCPs

  • The Bastion Holds: Azure’s New Gateway to Kubernetes Kingdoms

  • The Surge Before the Merge: Azure’s New Upgrade Strategy

  • CNI Overlay: Because Your Pods Deserve Their Own ZIP Code


AI Is Going Great – or How ML Makes Money 


00:46 Musk’s xAI sues Apple, OpenAI alleging scheme that harmed X, Grok



  • xAI filed a lawsuit against Apple and OpenAI, alleging anticompetitive practices in AI chatbot distribution, claiming Apple deprioritizes competing AI apps like Grok in the App Store while favoring ChatGPT through direct integration into iOS devices.

  • The lawsuit highlights tensions in AI platform distribution models, where cloud-based AI services depend on mobile app stores for user access, potentially creating gatekeeping concerns for competing generative AI providers.

  • Apple’s partnership with OpenAI to integrate ChatGPT into iPhone, iPad, and Mac products represents a shift toward native AI integration rather than app-based access, which could impact how cloud AI services reach end users.

  • The dispute underscores growing competition in the generative AI market, where multiple players, including xAI’s Grok, OpenAI’s ChatGPT, DeepSeek, and Perplexity, are vying for market position through both cloud APIs and mobile distribution channels.

  • For cloud developers, this case raises questions about AI service distribution strategies and whether direct device integration partnerships will become necessary to compete effectively against app store-based distribution models.


01:55 Justin – “There’s always a potential for conflict of interest when you have a partnership like this, but also the app store – there’s a ton of companies that track downloads and track usage of these things, and I don’t know that they have hard evidence here, other than this is just a way to keep Apple distracted while they make Grok better.” 


04:14 AWS CEO says AI replacing junior staff is ‘dumbest idea’ • The Register



  • AWS CEO Matt Garman argues that using AI to replace junior developers is counterproductive, since they’re the least expensive employees and most engaged with AI tools, warning that eliminating entry-level positions creates a pipeline problem for future senior talent.

  • Garman criticizes the standard metric of measuring AI value by percentage of code written, noting that more lines of code don’t equal better code – and that over 80% of AWS developers already use AI tools for various tasks, including unit tests, documentation, and code writing.

  • The CEO emphasizes that future tech workers need to learn critical thinking and problem-solving skills, rather than narrowly focused technical skills, as rapid technological change means that specific skills may not sustain a 30-year career.

  • This perspective aligns with AWS’s push for their Kiro AI coding assistant while acknowledging that AI should augment rather than replace human developers, particularly as organizations need experienced developers to evaluate and implement AI-generated code properly.

  • Garman’s comments come amid industry concerns about AI’s impact on employment and follow recent issues with AWS’s Q Developer tool, which had security vulnerabilities, highlighting the ongoing need for human oversight in AI development.


05:25 Ryan – “I do really think the industry is using AI wrong, and I think that the layoffs are a sign of that. And it’s really easy to say ‘oh, well our mid to senior developer staff can now do all these junior tasks, so let’s replace them,’ but I don’t think that’s a sustainable model.” 


AWS


11:14 Count Tokens API is now supported for Anthropic’s Claude models now in Amazon Bedrock



  • Amazon Bedrock now offers a Count Tokens API for Claude models, enabling developers to calculate token usage before making inference calls, which helps predict costs and avoid unexpected rate limit issues.

  • This API addresses a common pain point where developers would submit prompts that exceed context windows or trigger throttling, only discovering the issue after the fact and potentially incurring unnecessary costs.

  • The feature enables more efficient prompt engineering by allowing teams to test different prompt variations and measure their token consumption without actually running inference, which is particularly useful for optimizing system prompts and templates.

  • Currently limited to Claude models only, Amazon is prioritizing Anthropic’s integration, while potentially planning similar support for other Bedrock models, such as Titan, or third-party options.

  • For cost-conscious organizations, this pre-flight check capability allows better budget forecasting and helps implement guardrails before expensive model calls, critical as enterprises scale their AI workloads.


12:10 Justin – “Now, I appreciate the idea of allowing better budget forecasting, but budget forecasting does not move with the scale of AI, so there is no way that you’re getting an accurate forecast unless you have very specific prompts that you’re going to reuse a LOT of times.”    


13:39 Announcing the AWS Billing and Cost Management MCP server



  • AWS releases an open-source Model Context Protocol (MCP) server for Billing and Cost Management that enables AI assistants like Claude Desktop, VS Code Copilot, and Q Developer CLI to analyze AWS spending patterns and identify cost optimization opportunities.

  • The MCP server features a dedicated SQL-based calculation engine that handles large volumes of cost data and performs reproducible calculations for period-over-period changes and unit cost metrics, providing more comprehensive functionality than simple API access.

  • This integration enables customers to utilize their preferred AI assistant for FinOps tasks, including historical spending analysis, cost anomaly detection, workload cost estimation, and AWS service pricing queries, all without needing to switch to the AWS console.

  • The server connects securely using standard AWS credentials, with minimal configuration required, and is now available in the AWS Labs
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

319: AWS Cost MCP: Your Billing Data Now Speaks Human

319: AWS Cost MCP: Your Billing Data Now Speaks Human

Justin Brodley, Jonathan Baker, Ryan Lucas and Matt Kohn