Economical way of serving vector search workloads with Simon Eskildsen, CEO Turbopuffer

Update: 2025-09-19

Description

Turbopuffer search engine supports such products as Cursor, Notion, Linear, Superhuman and Readwise.

This episode on YouTube: https://youtu.be/I8Ztqajighg

Medium: https://dmitry-kan.medium.com/vector-podcast-simon-eskildsen-turbopuffer-69e456da8df3

Dev: https://dev.to/vectorpodcast/vector-podcast-simon-eskildsen-turbopuffer-cfa

If you are on Lucene / OpenSearch stack, you can go managed by signing up here: https://console.aiven.io/signup?utm_source=youtube&utm_medium=&&utm_content=vectorpodcast

Time codes:

00:00 Intro

00:15 Napkin Problem 4: Throughput of Redis

01:35 Episode intro

02:45 Simon's background, including implementation of Turbopuffer

09:23 How Cursor became an early client

11:25 How to test pre-launch

14:38 Why a new vector DB deserves to exist?

20:39 Latency aspect

26:27 Implementation language for Turbopuffer

28:11 Impact of LLM coding tools on programmer craft

30:02 Engineer 2 CEO transition

35:10 Architecture of Turbopuffer

43:25 Disk vs S3 latency, NVMe disks, DRAM

48:27 Multitenancy

50:29 Recall@N benchmarking

59:38 filtered ANN and Big-ANN Benchmarks

1:00:54 What users care about more (than Recall@N benchmarking)

1:01:28 Spicy question about benchmarking in competition

1:06:01 Interesting challenges ahead to tackle

1:10:13 Simon's announcement

Show notes:

- Turbopuffer in Cursor: https://www.youtube.com/watch?v=oFfVt3S51T4&t=5223s

transcript: https://lexfridman.com/cursor-team-transcript

- https://turbopuffer.com/

- Napkin Math: https://sirupsen.com/napkin

- Follow Simon on X: https://x.com/Sirupsen

- Not All Vector Databases Are Made Equal: https://towardsdatascience.com/milvus-pinecone-vespa-weaviate-vald-gsi-what-unites-these-buzz-words-and-what-makes-each-9c65a3bd0696/

Comments

In Channel

Trey Grainger - Wormhole Vectors

2025-11-0701:19:17

Economical way of serving vector search workloads with Simon Eskildsen, CEO Turbopuffer

2025-09-1901:15:26

Adding ML layer to Search: Hybrid Search Optimizer with Daniel Wrigley and Eric Pugh

2025-03-2101:03:09

Vector Databases: The Rise, Fall and Future - by NotebookLM

2025-03-0219:52

Code search, Copilot, LLM prompting with empathy and Artifacts with John Berryman

2025-02-1001:07:24

Debunking myths of vector search and LLMs with Leo Boytsov

2025-01-1701:07:54

Berlin Buzzwords 2024 - Alessandro Benedetti - LLMs in Solr

2024-11-0738:04

Berlin Buzzwords 2024 - Sonam Pankaj - EmbedAnything

2024-09-1923:00

Berlin Buzzwords 2024 - Doug Turnbull - Learning in Public

2024-07-1827:29

Eric Pugh - Measuring Search Quality with Quepid

2024-06-2647:37

Sid Probstein, part II - Bring AI to company data with SWIRL

2024-05-1538:15

Louis Brandy - SQL meets Vector Search at Rockset

2024-05-0152:50

Saurabh Rai - Growing Resume Matcher

2024-04-1226:15

Sid Probstein - Creator of SWIRL - Search in siloed data with LLMs

2023-07-2201:32:23

Atita Arora - Search Relevance Consultant - Revolutionizing E-commerce with Vector Search

2023-05-1701:32:20

Connor Shorten - Research Scientist, Weaviate - ChatGPT, LLMs, Form vs Meaning

2023-03-1101:33:11

Evgeniya Sukhodolskaya - Data Advocate, Toloka - Data at the core of all the cool ML

2023-01-2801:26:45

Yaniv Vaknin - Director of Product, Searchium - Hardware accelerated vector search

2022-12-2101:13:31

Doug Turnbull - Staff Relevance Engineer, Shopify - Search as a constant experimentation cycle

2022-10-0101:33:20

Malte Pietsch - CTO, Deepset - Passion in NLP and bridging the academia-industry gap with Haystack

2022-08-3001:26:10

00:00

1.0x

Economical way of serving vector search workloads with Simon Eskildsen, CEO Turbopuffer

#box-pro-ellipsis-176290143532614{-webkit-line-clamp:2;}Economical way of serving vector search workloads with Simon Eskildsen, CEO Turbopuffer

Economical way of serving vector search workloads with Simon Eskildsen, CEO Turbopuffer

Economical way of serving vector search workloads with Simon Eskildsen, CEO Turbopuffer