DiscoverCloud Foundry WeeklyOptimizing Qwen3 CPU ONLY inference on Tanzu Platform: Cloud Foundry Weekly: Ep 55
Optimizing Qwen3 CPU ONLY inference on Tanzu Platform: Cloud Foundry Weekly: Ep 55

Optimizing Qwen3 CPU ONLY inference on Tanzu Platform: Cloud Foundry Weekly: Ep 55

Update: 2025-05-21
Share

Description

Hot off the presses in model releases - we will explore the Qwen3-30b-a3b MoE model running on the Tanzu Platform. Early testing shows it performs exceptionally well on somewhat older enterprise-grade server CPUs (aka Cascade Lake). This show will provide some insights on how enterprises can use their existing server infrastructure to start their intelligent application modernization efforts.

Comments 
loading
In Channel
loading
00:00
00:00
1.0x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Optimizing Qwen3 CPU ONLY inference on Tanzu Platform: Cloud Foundry Weekly: Ep 55

Optimizing Qwen3 CPU ONLY inference on Tanzu Platform: Cloud Foundry Weekly: Ep 55

Nick Kuhn and Nicky Pike