Mixture of Experts

<p>Welcome to <em>Mixture of Experts</em>, your weekly deep dive into the ever-evolving landscape of artificial intelligence—bringing you insightful discussions on the latest AI trends, innovations, and their impact on business. </p><p><br /></p><p>From breakthrough research to practical applications, each episode offers a balanced blend of expertise and analysis. Explore how AI is reshaping industries, driving efficiency, and unlocking new opportunities for growth. Whether you're a seasoned professional seeking to stay ahead of the curve or an enthusiast curious about the future of technology, <em>Mixture of Experts</em> delivers the perfect mix of insights and practical knowledge. Tune in and stay informed as we navigate the dynamic intersection of AI and business.</p>

PLAY ON CASTBOX

AI code generation: Wins, fails and the future

What’s the future of AI code generation? This week on Mixture of Experts, host Tim Hwang is joined by Chris Hay, Olivia Buzek and Gabe Goodhart to debrief the biggest AI use-case of 2025: AI-powered software engineering. Claude Opus 4.5 solved a months-long optimization in under an hour but failed spectacularly at simple tasks. The barbell effect is real. Next, who's the architect—you or the model? We discuss agent orchestration, context windows and why tool performance varies wildly. Then, model differentiation: are OpenAI and Anthropic fundamentally different, or does agent architecture matter more? Finally, can open-source compete with closed ecosystems? We explore vertical integration, inference costs and the future of open models. All that and more on this week's Mixture of Experts. 00:00 – Introduction 01:11 – The barbell problem: AI coding wins and fails 03:46 – Claude Code cracks Apple Metal optimization 07:52 – Who's the architect: You or the AI? 11:44 – Model vs agent orchestration 20:44 – The future of unsupervised AI agents 24:30 – Open source vs proprietary tools 33:22 – The inference cost challenge The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts Learn more about AI code generation → https://www.ibm.com/think/topics/ai-code-generation

12-26

35:19

Disney's AI bet: USD 1B OpenAI content deal explained

Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts Why did Disney pay OpenAI a billion dollars to use their characters? This week on Mixture of Experts, host Tim Hwang and experts Marina Danilevsky, Martin Keen and Kush Varshney analyze Disney's three-year OpenAI licensing deal, and what it means for IP owners, content creators and the future of fan-generated AI content. Next, Time Magazine names “Architects of AI” as 2025 Person of the Year—it’s not the first time the person of the year was not a person, but what’s different about this? Then, NVIDIA drops Nemotron 3 open-source models; we explore what makes this model release different. Finally, Anthropic’s Soul Document leaked. We unpack model alignment, philosophy in AI and the future of prompting vs. fine-tuning. 00:37 – Introduction 02:14 – Disney and OpenAI billion-dollar deal 10:35 – Time Magazine's Person of the Year: Architects of AI 15:39 – NVIDIA Nemotron 3 open-source models 24:10 – Claude's Soul Document and model alignment The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 #OpenAI, #DisneyAI, #NVIDIANemotron, #ClaudeAI, #AIModels

12-19

38:31

GPT-5.2 code red & AWS Nova models drop

Should we care about GPT-5.2? This week on Mixture of Experts, we analyze the “code red” release of GPT-5.2 as OpenAI responds to Gemini 3. Are the constant model drops benefitting consumers? Next, Stanford released their Foundation Model Transparency Index, revealing a troubling trend that most labs are becoming less transparent. However, IBM Granite achieved a 95/100 score. Then, our experts discuss what model transparency means for enterprise AI adoption. Finally, we debrief AWS re:Invent’s biggest announcements, including Nova frontier models and Nova Forge. Join host Tim Hwang and panelists Kate Soule, Ambhi Ganesan and Mihai Criveti for our expert insights.00:00 – Intro1:02 -- GPT-5.2 emergency release 12:21 -- Stanford AI Transparency Index: Granite scores 95/10027:18 -- AWS re:Invent: Nova models and enterprise AIThe opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts #GPT-5.2 #AITransparency #GraniteModels #AWSNova #AIAgents

12-12

41:42

AI model analysis: Mistral 3, DeepSeek-V3.2 & Claude Opus 4.5

Is open source winning the AI race? This week on Mixture of Experts, we analyze three major model releases that dropped in the final weeks of 2025: Mistral 3, DeepSeek-V3.2 and Claude Opus 4.5. Our experts discuss what makes each model unique—from Mistral’s multimodal capabilities to DeepSeek’s reasoning-first approach and Claude’s developer focus. Are there too many good models? Next, a provocative blog post from Theory Ventures argues Gemini 3 proves scaling laws are throwing more compute at the problem. We debate if scaling laws are a universal truth. Finally, Amazon just blocked ChatGPT’s shopping research agent from accessing product data. We discuss the business incentives threatening the agent dream. Join host Tim Hwang and panelists Aaron Baughman, Abraham Daniels and Gabe Goodhart on this week’s Mixture of Experts for more! 00:00 – Intro 02:05 -- Model launches: Mistral 3, DeepSeek-V3.2 and Claude Opus 4.5 15:32 -- AI scaling laws & Gemini 3 26:23 --Amazon blocking ChatGPT shopping research agent The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 #Mistral3 #DeepSeek #ClaudeOpus #AIscalinglaws #AIagents

12-05

35:41

AI agents in 2025: Why agentic commerce isn't ready for Black Friday yet

Will Black Friday 2025 be the breakout moment for agentic commerce? This week on Mixture of Experts, we're departing from our usual format for a Thanksgiving special focused entirely on AI agents. Host Tim Hwang is joined by Chris Hay, Lauren McHugh Olende and Volkmar Uhlig to debate whether 2025 is truly the "year of agents." Our experts break down why consumer-facing agentic commerce still faces major hurdles. We also explore the massive gap between building agent prototypes and deploying them at scale and discuss why the developer ecosystem needs a "Shopify moment." Then, could language-to-agent interfaces bypass traditional development entirely? From coding agents like Claude Code to cost optimization challenges, this episode covers what needs to happen for agents to move from POC to production. And finally, who's positioned to dominate the space? Plus: Anthropic dropped Claude 4.5 Opus this week. We sit down with Mihai Criveti to discuss initial impressions of the new model. Does this put Anthropic back at the top of the AI game? 00:00 – Introduction 01:12 – Claude 4.5 Opus is here 09:32 – Will Black Friday be big for agentic commerce? 16:11– Agent experience for consumers vs. Enterprise 17:31 – Is 2025 the year of the Agent? 23:02 — AI agents in the developer ecosystem 31:16 — Who will win the agent race? The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts Learn more about AI Agents → https://www.ibm.com/think/topics/ai-agents

11-28

41:31

Google’s Gemini 3: AI agents, reasoning and search mode

Does Gemini 3 live up to the hype? This week on Mixture of Experts, we analyze the release of Google’s Gemini 3 model. Next, OpenAI released a new benchmark about the impact of AI on the economy, GDPval. We debate AI automation and the job market. Then, we always talk AI agents, today we discuss some great innovations coming out of IBM Research and more. Finally, Anthropic disrupted an AI-led cyberattack, what does this mean for AI agents and malicious actions? Join host Tim Hwang and our AI experts Marina Danilevsky, Merve Unuvar and Gabe Goodhart on this week’s Mixture of Experts to learn more. 00:00 – Introduction 01:09 – Microsoft’s AI infrastructure deal, IBM and UFC AI platform and ChatGPT for Teachers 02:00 – Gemini 3 12:50 – AI agent innovation 24:00 – OpenAI GPDval 37:17 – Anthropic cyberattack The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts Discover CUGA → https://research.ibm.com/blog/cuga-agent-framework Boost your agent toolkit → https://research.ibm.com/blog/altk-agent-toolkit Listen to cybersecurity expert takes on Anthropic's cyberattacks → https://www.ibm.com/think/podcasts/security-intelligence/anthropic-stops-ai-spies-owasp-top-10-rise-small-time-ransomware

11-21

46:43

GPT-5.1 and Kimi K2: What ‘Thinking AI’ really means

Which model is better, GPT-5.1 or Kimi K2 Thinking? This week on Mixture of Experts, we have two new AI model releases: OpenAI’s GPT-5.1 and Moonshot AI’s new open-source reasoning model, Kimi K2 Thinking. We discuss user experience and personalization with AI tools and how open-source AI is changing the AI race. Finally, is Microsoft launching full “agentic users” for enterprise? Our experts discuss AI in enterprise—the risks and considerations for both technology and humans. Join host Tim Hwang and panelists Kaoutar El Maghraoui, Aaron Baughman and Mihai Criveti on this week’s Mixture of Experts, to learn more. 00:00 – Introduction 1:12 – Anthropic’s data center investment, Tesla’s AI chips, Baidu’s new chips and AI cat robots 1:59 – GPT-5.1 12:00 – Kimi K2 Thinking 21:35 – Microsoft agent users The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts

11-14

31:56

1X NEO humanoid robot enters the home

Would you trust a humanoid robot to unload your dishwasher? This week on Mixture of Experts, we revisit 1X NEO, a new humanoid robot, as it finally enters the home, seeking to automate tasks. How far are we from this reality? Next, Japan’s copyright dispute with Sora 2, we discuss how AI training, synthetic data and IP are all impacting AI models. Finally, are there too many AI alliances? Our experts analyze the OpenAI and AWS partnership and what it means for AI infrastructure and multi-cloud strategies. Join host Tim Hwang and panelists Ambhi Ganesan, Ash Minhas and Sandi Besen on this week’s Mixture of Experts. 00:00 – Intro 01:10 – Perplexity integrates with Snapchat, Coinbase AI agents, Instacart AI assistants and Google AI chips go to space 02:16 – 1X NEO humanoid robot 15:23 – Sora 2 copyright 26:49 – OpenAI and AWS partnership The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120

11-07

36:29

Anthropic’s TPU move and NVIDIA’s Starcloud

It’s spooky week in AI! This week on our Halloween -edition of Mixture of Experts, we chat about Anthropic’s new billion-dollar TPU deal with Google Cloud. Plus, NVIDIA announces bringing data centers to outer space. Two different approaches to the future of AI compute that our experts discuss. Then, OpenAI released how they’re strengthening ChatGPT’s responses to sensitive conversations. We talk AI governance and AI safety. Finally, we discuss the new paper, Underwriting Superintelligence; would you insure your AGI? Join host Tim Hwang and panelists Chris Hay, Gabe GoodHart and Kate Soule on this week’s Mixture of Experts. 00:00 – Intro 01:05 – OpenAI goes for profit, NVIDIA’s worth USD 5 Trn, and Amazon smart glasses 02:16 – Anthropic TPU announcement 12:49 – Underwriting Superintelligence 27:54 – ChatGPT sensitive conversations 42:14 – NVIDIA Starcloud The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts #Anthropic #AIchip #NVIDIA #AIinfrastructure #AGI

10-31

47:39

ChatGPT Atlas, OpenAI’s new web browser

OpenAI is back and coming for search. This week on Mixture of Experts, we debrief ChatGPT Atlas, OpenAI’s new web browser and the impacts on search. Then, Andrej Karpathy is back with his pessimistic timeline to AGI. Later, we discuss DeepSeek-OCR. Finally, can your AI have brain rot? Join host Tim Hwang and panelists Aaron Baughman, Abraham Daniels and Martin Keen on this week’s Mixture of Experts to find out. 00:00 – Intro 00:55 – Goldman AI, Groq and IBM, Military AI and Uber 02:05 – ChatGPT Atlas 14:23 – Karpathy’s AGI timeline 23:52 – DeepSeek-OCR 34:30 – AI brain rot The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

10-24

44:48

OpenAI, Oracle & AMD shake up AI

Visit the Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts?utm=podcastsMoEYT What if Nvidia’s biggest advantage isn’t so big anymore? This week on Mixture of Experts, we break down the CAISI report on DeepSeek’s model risks, Reflection AI’s massive USD 2B fundraise for an open frontier lab, Oracle Cloud’s big bet on AMD chips over Nvidia and the wild story of a VC fund replacing analysts with AI agents. 00:00 – Intro 01:14 – Bad bot, Oracle + IBM, Dreamforce, 18+ ChatGPT 02:19 – Oralce bets on AMD chips 18:34 – CAISI DeepSeek report 29:57 – Reflection AI raises USD 2B 41:18 – VC fund replacing analysts with AI agents The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120

10-17

49:42

IBM partners with Anthropic, plus OpenAI drops AgentKit

The major AI players are not slowing down. In episode 76 of Mixture of Experts, host Tim Hwang is joined by Olivia Buzek, Chris Hay and Mihai Criveti. First, we analyze OpenAI’s release of their new AgentKit. Then, IBM announced a partnership with Anthropic to accelerate enterprise-ready AI—we talk AI governance and the future of industry partnerships. Then, Chris Hay gives us his explanation of modular manifolds. Finally, we discuss the potential of AI becoming healthcare experts amid Deena Mousa’s The Algorithm Will See You Now. Will AI replace radiologists? Tune in to Mixture of Experts to find out. 00:00 – Intro 01:09 – OpenAI and AMD, IBM’s Project Bob, chip diamonds and Peloton's AI Trainers 02:20 – AgentKit 13:29 – IBM & Anthropic partnership 22:49 – Modular manifolds 31:52 – AI in radiology The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Read the whitepaper about how IBM and Anthropic are securing enterprise AI architectures → https://ibm.biz/Bdb4xR Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Visit the Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts

10-10

43:48

This week in AI models: Granite 4.0, Claude 4.5, Sora 2

What’s new in AI models? In episode 75 of Mixture of Experts, host Tim Hwang is joined by Kaoutar El Maghraoui, Kate Soule and Kush Varshney to debrief a frenzy of model drops this week. IBM debuts Granite 4.0 hyper-efficient hybrid models, Claude Sonnet 4.5 delivers longer run times and sharper reasoning and Sora 2 takes vibe coding to the next level with vibe video production. Then, we dig into OpenAI’s new feature, Buy in ChatGPT. Finally, it’s Cybersecurity Awareness Month, and we are joined by special guest, Matt Kosinski, host of the Security Intelligence podcast—can you trust your AI? Tune in to Mixture of Experts to find out. 00:00 – Intro 1:17 – Meta Ads assistant, DoorDash Dot, Microsoft vibe working and Tilly AI Actor 2:23 – Granite 4.0 10:21 – Claude 4.5 17:57 – Sora 2 29:15 – Buy in ChatGPT 37: 00 – Introducing Security Intelligence The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Learn more about artificial intelligence → https://www.ibm.com/think/artificial-intelligence Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts Watch the latest episode of the Security Intelligence podcast → https://www.youtube.com/watch?v=mDpUZD1ogEE&list=PLOspHqNVtKABGIbaWP1xYQHbwuXjZwqpH&index=1&t=225s

10-03

42:05

NVIDIA’s USD 100bn investment and Google's AP2

Why did NVIDIA invest USD100bn into OpenAI? In episode 74 of Mixture of Experts, host Tim Hwang is joined by Sandi Besen, Mihai Criveti and Gabe Goodhart to talk about the AI chipmaker’s next big move. Next, we analyze Tongyi DeepResearch and get into a conversation around open source. Then, Google released a new agent protocol, AP2. After that, will AI take over? We debrief a new book, If Anyone Builds It, Everyone Dies. Finally, Apple shared that the new AirPods will have a translation feature powered by AI. Is Apple back in the game? Tune in to Mixture of Experts to find out. 00:00 – Intro 01:17 – OpenAI's data centers, Alibaba and NVIDIA partnership, IBM Granite Docling and Meta's dating AI assistant02:23 – Tongyi DeepResearch 13:12 – Google's AP2 23:31 – "If Anyone Builds It, Everyone Dies" 33:50 – AirPods translation feature 43:49 – NVIDIA invests USD100bn in OpenAI The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Learn more about artificial intelligence → https://www.ibm.com/think/artificial-intelligence Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts

09-26

52:41

Anthropic Economic Index, Virtual Agent Economies, AlterEgo and How People Use ChatGPT

What are the most common uses of ChatGPT? In episode 73 of Mixture of Experts, host Tim Hwang is joined by Lauren McHugh, Martin Keen and Aaron Baughman to talk about a new report, How People Use ChatGPT. Next, Anthropic released an updated version of their economic index. Then, another paper, this one coming out of DeepMind on agent economies. How likely is this? Finally, how practical are AI wearables and what does a future with them look like? All that and more on today’s Mixture of Experts. 00:00 – Intro 1:10 – News: Alphabet Inc. $3 Trillion Market Cap, AI could boost trade value and the animal internet 2:04 – How People Use ChatGPT 15:47 – Anthropic Economic Index 25:50 – Virtual Agent Economies 35:36 – AlterEgo The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Learn more about artificial intelligence → https://www.ibm.com/think/artificial-intelligence Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts

09-19

46:55

Why language models hallucinate, revisiting Amodei’s code prediction and AI in the job market

Do we need more AI hallucinations? In episode 72 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Chris Hay and Skyler Speakman to talk about OpenAI’s paper, Why language models hallucinate. Next, in March 2025, Anthropic’s CEO, Dario Amodei, predicted that AI will be writing 90% of code for software developers. How is that turning out? Then, is AI making the job market hell? Finally, you can now run an LLM on a circuit board the size of a business card. Where does that take us? All that and more on today’s Mixture of Experts. 00:00 – Intro 1:18 – Oracle, data center construction, iPhone 17 and tech saint 3:03 – Why language models hallucinate 15:59 – Dario Amodei’s prediction 22:45 – Job market is hell 29:57 – LLM on a business card The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

09-12

42:23

Google Antitrust, Anthropic's $183B leap and are we in the AI winter?

Are browsers the right entry point for AI tools? In episode 71 of Mixture of Experts, guest host Bryan Casey is joined by Gabe Goodhart, Kaoutar El Maghraoui and Mihai Criveti to talk about the verdict in the Google antitrust case and what it means for agentic AI. Next, as Anthropic raised $13 billion in a recent funding round, bringing its valuation to $183 billion, we discuss investment in AI startups. Finally, the discourse on GPT-5 and AI model innovation created “AI winter.” What does this mean for the future of AI innovation? All that and more on today’s Mixture of Experts. 00:00 – Intro 1:38 – Safer GPT, IBM and AMD, Amazon Lens Live and AI and Starbucks 2:59 – Google antitrust 27:08 – Anthropic and AI valuation 41:42 – AI winter The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Learn more about artificial intelligence → https://www.ibm.com/think/artificial-intelligence Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts

09-05

52:18

Monster prompt, OpenAI’s business play, nano-banana and US Open experimentations

Would you trust a 100-page prompt to do your taxes? In episode 70 of Mixture of Experts, host Tim Hwang is joined by Aaron Baughman, Chris Hay and Lauren McHugh to talk about KPMG’s 100-page prompt they used to build their agentic TaxBot. Next, we debrief on OpenAI’s teasing of selling infrastructure in the future. The image model generation goes bananas, with the latest AI Image Model from Gemini: nano-banana. Finally, Aaron Baughman demonstrated three new features for the US Open website, powered by IBM watsonx-built generative AI models: Match Chat, Key Points, and Live Likelihood to Win. All that and more on today’s 70th episode of Mixture of Experts.00:00 – Intro 3:05 – KPMG’s monster prompt 16:37 – OpenAI’s infra for sale? 25:10 – Gemini's nano-banana 35:11 – US Open experimentations The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updates → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Learn more about artificial intelligence → https://www.ibm.com/think/artificial-intelligence Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts US Open powered by IBM Watsonx https://www.ibm.com/sports/usopen

08-29

44:03

Gen AI pilots fail, GPT-5's hidden prompt revealed, reasoning model flaws and Claude closing chats

Is enterprise AI in danger? In episode 69 of Mixture of Experts, host Tim Hwang is joined by Marina Danilevsky, Nathalie Baracaldo and Sandi Besen to debrief MIT’s report on gen AI pilots. Next, GPT-5 has a hidden system prompt? Then, we revisit the conversation about chain of thought (CoT) reasoning with our researchers. Are large reasoning models not thinking straight? Finally, Anthropic announced Claude will close down "distressing” conversations and we debate AI welfare. All that and more on today’s episode of Mixture of Experts. 00:00 – Intro 1:13 – US Open, Meta restructuring Superintelligence lab and Robot Olympics 3:11 – Gen AI pilots fail 11:09 – GPT-5's hidden prompt revealed 22:47 – Reasoning model flaws 33:55 – Claude closing chats The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe to the Think newsletter → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Learn more about artificial intelligence → https://www.ibm.com/think/artificial-intelligence Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts

08-22

45:01

Perplexity’s bid for Chrome, Grok Imagine and GPT-5 check-in

Would you sell Chrome for USD 34.5 billion dollars? In episode 68 of Mixture of Experts, host Tim Hwang is joined by Abraham Daniels, Sophie Kuijt and Shobhit Varshney for another packed week in AI. First, AI startup Perplexity puts out a bid for Google Chrome at over double their valuation. Why? Next, xAI released Grok Imagine and claims it will be the next Vine. Our experts analyze the future of AI video generation. Finally, one week after the GPT-5 release and skeptics are saying it did not live up to the hype. Is AI development plateauing? All that and more on Mixture of Experts! 00:00 – Intro 01:17 – MoE News: NVIDIA H20s, Apple AI devices, AI people pleasers and Google DeepMind’s bioacoustics model 02:40 – Perplexity’s USD 34.5B bid 12:10 – Grok Imagine 24:23 – GPT-5 check-in The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe to the Think Newsletter → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Learn more about artificial intelligence → https://www.ibm.com/think/artificial-intelligence Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts

08-15

40:44

View All on Castbox

Recommend Channels