Episode 20: AI Accelerators in the Cloud
Description
In this episode we meet with Matthew McClean, a Sr Manager from AWS's Annapurna Team to talk about Accelerators - the chips that make AI possible. We cover different Accelerators - GPUs, Trainium, Inferentia, Graviton and more.
AWS Hosts: Nolan Chen & Malini Chatterjee
Email Your Feedback: rethinkpodcast@amazon.com
Links for the Show:
AWS re:Invent 2023 - Behind-the-scenes look at generative AI infrastructure at Amazon (CMP206)
https://www.youtube.com/watch?v=fDk09hms8s8
AWS re:Invent 2023 - Behind-the-scenes look at generative AI infrastructure at Amazon (CMP206)
https://www.youtube.com/watch?v=VPvguzeWlbU
Welcome to AWS Neuron
https://awsdocs-neuron.readthedocs-hosted.com/en/latest/index.html
Optimum Neuron
https://huggingface.co/docs/optimum-neuron/index