DiscoverAWS re:Think PodcastEpisode 20: AI Accelerators in the Cloud
Episode 20: AI Accelerators in the Cloud

Episode 20: AI Accelerators in the Cloud

Update: 2024-02-01
Share

Description

In this episode we meet with Matthew McClean, a Sr Manager from AWS's Annapurna Team to talk about Accelerators - the chips that make AI possible. We cover different Accelerators - GPUs, Trainium, Inferentia, Graviton and more.


AWS Hosts: Nolan Chen & Malini Chatterjee

Email Your Feedback: rethinkpodcast@amazon.com


Links for the Show:

AWS re:Invent 2023 - Behind-the-scenes look at generative AI infrastructure at Amazon (CMP206)

https://www.youtube.com/watch?v=fDk09hms8s8


AWS re:Invent 2023 - Behind-the-scenes look at generative AI infrastructure at Amazon (CMP206)

https://www.youtube.com/watch?v=VPvguzeWlbU


Welcome to AWS Neuron

https://awsdocs-neuron.readthedocs-hosted.com/en/latest/index.html


Optimum Neuron

https://huggingface.co/docs/optimum-neuron/index


Comments 
In Channel
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Episode 20: AI Accelerators in the Cloud

Episode 20: AI Accelerators in the Cloud