DiscoverThe InfoQ PodcastMeryem Arik on LLM Deployment, State-of-the-art RAG Apps, and Inference Architecture Stack
Meryem Arik on LLM Deployment, State-of-the-art RAG Apps, and Inference Architecture Stack

Meryem Arik on LLM Deployment, State-of-the-art RAG Apps, and Inference Architecture Stack

Update: 2024-06-10
Share

Description

In this podcast, Meryem Arik, Co-founder/CEO at TitanML, discusses the innovations in Generative AI and Large Language Model (LLM) technologies including current state of large language models, LLM Deployment, state-of-the-art Retrieval Augmented Generation (RAG) apps, and inference architecture stack for LLM applications.

Read a transcript of this interview: https://bit.ly/3X5ZVPu

Subscribe to the Software Architects’ Newsletter for your monthly guide to the essential news and experience from industry peers on emerging patterns and technologies:
www.infoq.com/software-architects-newsletter

Upcoming Events:

InfoQ Dev Summit Boston (June 24-25, 2024)
Actionable insights on today’s critical dev priorities.
devsummit.infoq.com/conference/boston2024

InfoQ Dev Summit Munich (Sept 26-27, 2024)
Practical learnings from senior software practitioners navigating Generative AI, security, modern web applications, and more.
devsummit.infoq.com/conference/munich2024

QCon San Francisco (November 18-22, 2024)
Get practical inspiration and best practices on emerging software trends directly from senior software developers at early adopter companies.
qconsf.com/

QCon London (April 7-9, 2025)
Discover new ideas and insights from senior practitioners driving change and innovation in software development.
qconlondon.com/

The InfoQ Podcasts:
Weekly inspiration to drive innovation and build great teams from senior software leaders. Listen to all our podcasts and read interview transcripts:

- The InfoQ Podcast www.infoq.com/podcasts/
- Engineering Culture Podcast by InfoQ www.infoq.com/podcasts/#engineering_culture
- Generally AI

Follow InfoQ:
- Mastodon: techhub.social/@infoq
- Twitter: twitter.com/InfoQ
- LinkedIn: www.linkedin.com/company/infoq
- Facebook: bit.ly/2jmlyG8
- Instagram: @infoqdotcom
- Youtube: www.youtube.com/infoq

Write for InfoQ:
Learn and share the changes and innovations in professional software development.
- Join a community of experts.
- Increase your visibility.
- Grow your career.
www.infoq.com/write-for-infoq
Comments 
loading
In Channel
loading
00:00
00:00
1.0x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Meryem Arik on LLM Deployment, State-of-the-art RAG Apps, and Inference Architecture Stack

Meryem Arik on LLM Deployment, State-of-the-art RAG Apps, and Inference Architecture Stack

InfoQ