DiscoverMixture of Experts
Mixture of Experts
Claim Ownership

Mixture of Experts

Author: IBM

Subscribed: 31Played: 559
Share

Description

Welcome to Mixture of Experts, your weekly deep dive into the ever-evolving landscape of artificial intelligence—bringing you insightful discussions on the latest AI trends, innovations, and their impact on business.


From breakthrough research to practical applications, each episode offers a balanced blend of expertise and analysis. Explore how AI is reshaping industries, driving efficiency, and unlocking new opportunities for growth. Whether you're a seasoned professional seeking to stay ahead of the curve or an enthusiast curious about the future of technology, Mixture of Experts delivers the perfect mix of insights and practical knowledge. Tune in and stay informed as we navigate the dynamic intersection of AI and business.

59 Episodes
Reverse
How long until Anthropic drops Claude 5.0? On today’s bonus episode of Mixture of Experts, guest host Bryan Casey is joined by Chris Hay, Marina Danilevsky and Shobhit Varshney to analyze the newly released Claude 4.0 family: Opus 4 and Sonnet 4. What do we know about the model architecture vs. What is speculation? In this special episode we talk Anthropic, OpenAI, Google and the rest of the competition in the AI race! Who will win? Tune-in to this bonus Mixture of Experts for more!  00:01 – Intro 00:32 – Claude 4.0  12:48 – OpenAI and Jony Ives 22:27 – Anthropic full-stack   The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
Should you pay for Google’s AI Ultra subscription plan? In episode 56 of Mixture of Experts, host Tim Hwang is joined by Abraham Daniels, Gabe Goodhart and Marina Danilevsky to debrief the announcements from Google I/O 2025.  Next, RedHat dropped llm-d, a Kubernetes-native distributed inference serving stack; what is it and why does it matter? Then, we analyze Microsoft’s NLWeb: is everything becoming conversational? Finally, Stack Overflow has been on a decline. Is AI to blame? Find out more on this week’s Mixture of Experts! 00:01 – Intro 00:52 --Google I/O 2025 announcements 11:36 -- Stack Overflow 22:04 -- llm-d 30:08 -- NLWeb  The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
Can Mistral make Europe a global AI contender? In episode 55 of Mixture of Experts, host Tim Hwang is joined by Chris Hay, Volkmar Uhlig and Kaoutar El Maghraoui to discuss the drop of Mistral Medium 3. Next, we analyze the AI chip sales both NVIDIA and AMD made to Saudi Arabia. Then, with IBM’s new ITBench and OpenAI’s HealthBench, we dive deeper into benchmarks for AI evaluation. Tune in to this week’s Mixture of Experts for more! 00:01 – Intro 00:47 -- Mistral Medium 3 12:26 -- AI chips to Saudi Arabia 21:21 -- AI evaluation benchmarks 31:47 -- Amazon's AI-generated pause ads  The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
Has AI hallucination gotten out of control? In episode 54 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Skyler Speakman and Kaoutar El Maghraoui to analyze reasoning models and rising hallucinations. Next, as IBM Think 2025 wraps, the experts unpack the biggest highlights from IBM’s biggest show of the year: new AI agents, Ferraris and ... penguins? Then, OpenAI is making moves with its acquisition of Windsurf. What does this mean? Tune in to this week’s Mixture of Experts for more! 00:01 – Intro 01:12 – IBM Think 2025 09:27 – Reasoning models and hallucinations 19:23 – OpenAI Windsurf acquisition   The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
We are celebrating MoE podcast’s one year anniversary! In episode 53 of Mixture of Experts, host Tim Hwang is joined by the O.G. panel of experts from our pilot—Chris Hay, Shobhit Varshney and Kush Varshney. This week, we cover some exciting announcements at LlamaCon. Then, we discuss some new Chinese AI models from Qwen3 to the rumored DeepSeek-R2. Next, J.P. Morgan’s CISO, Patrick Opet, released “An open letter to our third-party suppliers,” covering the need for AI security. Are we doomed? Finally, we look back at some of the topics we discussed in episode 1—the Rabbit AI device, GPT-2 chatbot, Apple Intelligence—after all that, who was the first person to say “agents” on the podcast?  Tune in to find out, on today’s one-year celebration of Mixture of Experts. 00:00 -- Intro00:38 -- LlamaCon10:34 -- Qwen3 and DeepSeek-R223:23 -- J.P. Morgan’s open letter 39:45 -- One year of MoEThe opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
Is OpenAI going to enter the social media game? In episode 52 of Mixture of Experts host, Tim Hwang is joined by Gabe Goodhart, Kate Soule and Marina Danilevsky. First, Sam Altman is rumored to be testing an internal prototype social network; why is this a potential next move for the AI giant? Next, for our paper of the week, we analyze Anthropic’s study on chain-of-thought reasoning, “Reasoning Models Don’t Always Say What They Think.” Then, AI scraping puts a strain on Wikimedia; what’s the impact of this? Finally, China held a humanoid robot half-marathon, where humans raced alongside robot competitors. Who wins this AI race? All that and more on today’s Mixture of Experts. 00:41 -- OpenAI social network 10:02 -- Anthropic’s reasoning study 20:56 -- AI bots strain Wikimedia 31:33 -- Humanoid half-marathon  The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
OpenAI just dropped o3 and o4-mini! In episode 51 of Mixture of Experts host, Tim Hwang is joined by Chris Hay, Vyoma Gajjar and special guest John Willis, Owner of Botchagalupe Technologies. Today, we analyze Sam Altman’s new AI models, o3 and o4-mini. Next, Google announced that by Q3 you can run Gemini on-prem; what does this mean for enterprise AI adoption? Then, John is on the show today to take us through AI evaluation tools and why we need them. Finally, NVIDIA is planning to move AI chip manufacturing to the U.S. Can they pull this off? All that and more on today’s Mixture of Experts. 00:01 – Intro 00:56 – OpenAI o3 and o4 mini 14:57 – Google Gemini on-prem 23:43 – AI evaluation tools 34:59 – NVIDIA's U.S. chip manufacturing   The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
IBM z17 is here! In episode 50 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Shobhit Varshney and Hillery Hunter to debrief the launch of a new mainframe with robust AI infrastructure. Next, Meta dropped Llama 4 over the weekend;, how's it going? Then, Shobhit is recording live from Google Cloud Next in Las Vegas, along with Gemini 2.5 Pro. What are some of the most exciting announcements? Finally, the Pew Research Center shows perception of AI, how does this impact the industry? All that and more on today’s 50th Mixture of Experts. 00:01 -- Intro 00:55 -- IBM z17 11:42 -- Llama 4 25:02 -- Google Cloud Next 2025 34:29 -- Pew's research on perception of AI The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Explore the new features of IBM z17: https://www.ibm.com/products/z17 Read the Pew Research: https://www.pewresearch.org/internet/2025/04/03/how-the-us-public-and-ai-experts-view-artificial-intelligence/  Subscribe for AI updates: https://ibm.biz/Think_newsletter Visit Mixture of Experts podcast page to learn more AI content: https://www.ibm.com/think/podcasts/mixture-of-experts 
Will OpenAI be fully open source by 2027? In episode 49 of Mixture of Experts, host Tim Hwang is joined by Aaron Baughman, Ash Minhas and Chris Hay to analyze Sam Altman’s latest move towards open source. Next, we explore Anthropic's mechanistic interpretability results and the progress the AI research community is making. Then, can Apple catch up? We analyze the latest critiques on Apple Intelligence. Finally, Amazon enters the chat with AI agents. How does this elevate the competition? All that and more on today’s Mixture of Experts.00:01 -- Introduction00:48 -- OpenAI goes open  11:36 -- Anthropic interpretability results 24:55 -- Daring Fireball on Apple Intelligence 34:22 -- Amazon’s AI agentsThe opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.Subscribe for AI updates: https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120Learn more about artificial intelligence → https://www.ibm.com/think/artificial-intelligenceVisit Mixture of Experts podcast page to learn more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts
What’s the best open-source model? In episode 48 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Kush Varshney and Skyler Speakman to explore the future of open-source AI models. First, we chat about the release of DeepSeek-V3-0324. Then, more announcements coming out of Google including Gemini Canvas and Gemini 2.5. Next, Extropic has entered the chat with a thermodynamic chip. Finally, AI image generation is on the rise as OpenAI released GPT-4o image generation. All that, and more on today’s Mixture of Experts. 00:01 – Intro 00:42– DeepSeek-V3-0324 09:48 – Gemini 2.5 and Canvas 21:27– Extropic’s thermodynamic chip 30:20 – OpenAI image generation The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.
What’s the most exciting announcement coming out of NVIDIA GTC? In episode 47 of Mixture of Experts, host Tim Hwang is joined by Nathalie Baracaldo, Kaoutar El Maghraoui and Vyoma Gajjar. First, we dive into the latest announcements from NVIDIA GTC, including the Groot N1 model for humanoid robotics. Next, Baidu released some new AI reasoning models, and they’re not open source? Then, for our paper of the week we discuss the flaws of Chain-of-Thought reasoning. Finally, Gemini Flash 2.0 has released image generation models for developer experimentation., Iis Google catching up on the AI game? Tune -in to today’s Mixture of Experts to find out!  00:01 – Intro  01:27– NVIDIA GTC 14:18– New Baidu AI models 21:19– Chain-of-Thought reasoning 32:18 – Gemini image generation  The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
Is Manus a second DeepSeek moment? In episode 46 of Mixture of Experts, host Tim Hwang is joined by Chris Hay, Kaoutar El Maghraoui and Vyoma Gajjar to talk Manus! Next, the rise of vibe coding—what started as a joke has now become a thing? Then, we dive deep into the future of scaling laws. Finally, Perplexity is teaming up with Deutsche Telekom to release an AI phone—what’s the motivation here? Tune-in to today’s Mixture of Experts to find out more! 00:01 – Intro 00:37 -- Manus 14:09 – Vibe coding 30:13 – Scaling laws 39:07 – Perplexity's AI phone  The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
When can we expect quantum to reach consumer devices? In episode 45 of Mixture of Experts, host Tim Hwang is joined by special guest, Blake Johnson, to debrief the quantum noise in the news. Blake helps us understand the intersection between quantum and AI and how far we are from this technology. Then, veteran experts Chris Hay and Volkmar Uhlig hash out some other news in AI this week. We cover Anthropic’s Model Context Protocol, CoreWeave filing for an IPO and Sesame AI’s new voice companion. All that and more on today’s Mixture of Experts! 00:01 – Intro  01:06 – Quantum leap 20:08 -- Model Context Protocol 28:24 -- CoreWeave IPO 40:12 -- Sesame AI voice companion The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
Is pre-training dead? In this bonus episode of Mixture of Experts, guest host Bryan Casey is joined by Kate Soule and Chris Hay. On Thursday, Sam Altman dropped GPT-4.5 just after we wrapped our weekly recording. We got a few of our veteran experts on the podcast to analyze OpenAI’s largest and “best” chat model yet. What’s the hype? Tune-in to this bonus episode to find out! 00:01 – Intro  00:25 – GPT-4.5 The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
Granite 3.2 is officially here! In episode 44 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Maya Murad and Kaoutar El Maghraoui to debrief a few big AI announcements. Last week we covered small vision-language models (VLMs), and this week Granite 3.2 dropped with  new VLMs, enhanced reasoning capabilities, and more! Kate takes us under the hood to understand the new features and how they were created. Next, Anthropic dropped a new intelligence model, Claude 3.7 Sonnet, and a new agentic coding tool, Claude Code. Why did Anthropic release these separately? Then, as we cannot have an episode without covering agents, Maya takes us through the new BeeAI agents! Finally, can fine tuning on a malicious task lead to much broader misalignment? Our experts analyze a new paper released on ‘Emergent misalignment.’ All that and more on this week's episode! 00:01 – Intro  00:41 – Claude 3.7 Sonnet 11:58 – BeeAI agents  20:11– Granite 3.2 29:23 – Emergent misalignment The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
What is all the hype around Deep Research? In episode 43 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Volkmar Uhlig and Shobhit Varshney. This week, we discuss reasoning model features coming out of companies like OpenAI’s Deep Research, Google Gemini, Perplexity, xAI’s Grok-3 and more! Next, OpenAI is rumored to release an inference chip, but how likely is this to be a success in the AI chip game? Then, we analyze the capabilities of small vision-language models (VLMs). Finally, a startup, Firecrawl, released a job posting in search of an AI agent. Is this the future for AI tools in the workforce? Tune-in to today’s Mixture of Experts to find out. 00:01 – Intro 00:35 – Deep Research 11:58 – OpenAI inference chip 22:17 – Small VLMs 32:31 – AI agent job posting The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.
Live from Paris, Tim Hwang is at the AI Action Summit 2025. In episode 42 of Mixture of Experts, we welcome Anastasia Stasenko, CEO and Co-Founder of pleias along with our veteran experts Marina Danilevsky and Chris Hay. Last week, we touched on some potential conversations at the Paris AI Summit, this week we recap what actually happened. Is AI safety improving Globally? Next, for our paper of the week, we breakdown s1: Simple test-time scaling. Then, Sam Altman is back with another blog, “Three Observations,” what do our experts have to say? Finally, what can we learn from Anthropic’s Economic Index? All that and more on today’s Mixture of Experts. 00:01 – Intro 00:42 – Paris AI Summit 11:10 – s1: Simple test-time scaling 19:32 – Sam Altman’s “Three Observations” 30:41 – Anthropic’s Economic Index The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Resources:Read the paper about s1: Simple test-time scaling: https://arxiv.org/abs/2501.19393Read Sam Altman's "Three Observations": https://blog.samaltman.com/three-observationsRead Anthropic's Economic Index: https://www.anthropic.com/economic-indexRead more about AGI: https://www.ibm.com/think/topics/artificial-general-intelligence
What does Sam Altman have up his sleeve? In episode 41 of Mixture of Experts, join host Tim Hwang along with experts Nathalie Baracaldo, Marina Danilevsky and Chris Hay. Last week, we covered all things DeepSeek, and this week OpenAI has some new releases to share. Today, the experts dissect deep research and o3-mini. Next, our host Tim Hwang is travelling to AI Action Summit, he asks our experts what we can expect coming out of the event. Then, we talk about Anthropic’s Constitutional Classifiers. Finally, Microsoft is creating a unit to study AI’s impact, what does this mean? Find out all this and more on Mixture of Experts. 00:01 – intro 00:41 – Open AI deep research and o3-mini 13:51 – AI Action Summit 20:17 – Anthropic’s Constitutional Classifiers 28:54 – Microsoft AI Impact team The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updatesLearn more about artificial intelligenceDeepSeek's reasoning AI shows power of small models, efficiently trainedVisit Mixture of Experts podcast page to learn more AI content
Let’s bust some early myths about DeepSeek. In episode 40 of Mixture of Experts, join host Tim Hwang along with experts Aaron Baughman, Chris Hay and Kate Soule. Last week, we covered the release of DeepSeek-R1; now that the entire world is up to speed, let’s separate the facts from the hype. Next, what is model distillation and why does it matter for competition in AI? Finally, Sam Altman among other tech CEOs shared his response to DeepSeek. Will R1 radically change the open-source strategy of other tech giants? Find out all this and more on Mixture of Experts. 00:01 – Intro 00:41 – DeepSeek facts vs hype 21:00 – Model distillation 31:21 – Open source and OpenAI The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.
What does the future hold for DeepSeek? In episode 39 of Mixture of Experts, join host Tim Hwang along with experts Abraham Daniels, Kaoutar El Maghraoui and Skyler Speakman to discuss the release of DeepSeek-R1. Next, Mistral indicates going IPO. Then, FrontierMath’s new benchmark is particularly difficult, the experts debrief. Finally, IDC released a report on code assistants, what do we need to know about generalist and specialized coding assistants? Tune-in to this week’s episode to find out. 00:01 – Intro  01:08 – DeepSeek-R1 14:08 – Mistral indicates IPO 20:54 – FrontierMath controversy 30:04 -- IDC code assistants report The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
loading
Comments