Listen Top Shows Blog

Time for some (extreme) distillation with Thomas van Dongen - founder of the Minish Lab

Time for some (extreme) distillation with Thomas van Dongen - founder of the Minish Lab

Update: 2025-01-15

Share

Description

Word embeddings might feel like they are a little bit out of fashion. After all, we have attention mechanisms and transformer models now, right? Well, it turns out that if you apply distillation the right way you can actually get highly performant word embeddings out. It's a technique featured by the model2vec project from the Minish lab and in this episode we talk to the founder to learn more about the technique.

We have a Discord these days, feel free to discuss the podcast with us there! https://discord.probabl.ai

This podcast is part of the open efforts over at probabl.

To learn more you can check out website or reach out to us on social media.

Website: https://probabl.ai/

Bluesky: https://bsky.app/profile/probabl.bsky.social

LinkedIn: https://www.linkedin.com/company/probabl

Twitter: https://x.com/probabl_ai

#probabl

Comments

In Channel

Time for some (extreme) distillation with Thomas van Dongen - founder of the Minish Lab

Time for some (extreme) distillation with Thomas van Dongen - founder of the Minish Lab

2025-01-1549:21

Imbalanced learn: regrets and onwards - with Guillaume Lemaitre, maintainer

Imbalanced learn: regrets and onwards - with Guillaume Lemaitre, maintainer

2024-12-0654:06

You want to be in control of your own Copilot - with Ty Dunn, co-founder at Continue.dev

You want to be in control of your own Copilot - with Ty Dunn, co-founder at Continue.dev

2024-11-0601:07:16

What it is like to maintain the scikit-learn docs - with David Arturo Amor Quiroz, scikit-learn docs maintainer

What it is like to maintain the scikit-learn docs - with David Arturo Amor Quiroz, scikit-learn docs maintainer

2024-10-3155:01

Sqlite can totally do embeddings now - with Alex Garcia, sqlite-vec maintainer

Sqlite can totally do embeddings now - with Alex Garcia, sqlite-vec maintainer

2024-10-2359:20

How to rethink the notebook - with Akshay Agrawal, co-creator of Marimo

How to rethink the notebook - with Akshay Agrawal, co-creator of Marimo

2024-10-1601:12:04

You are always dealing with many tables - with Madelon Hulsebos

You are always dealing with many tables - with Madelon Hulsebos

2024-09-1001:09:10

How Narwhals has many end users ... that never use it directly with Marco Gorelli

How Narwhals has many end users ... that never use it directly with Marco Gorelli

2024-08-2101:00:53

Pragmatic data science checklists with Peter Bull - cofounder Drivendata

Pragmatic data science checklists with Peter Bull - cofounder Drivendata

2024-07-1701:05:38

Model safety, that's a pickle! with Adrin Jalali - scikit-learn maintainer

Model safety, that's a pickle! with Adrin Jalali - scikit-learn maintainer

2024-06-2701:01:47

Moving Towards KDearestNeighbors with Leland McInnes - creator of UMAP

Moving Towards KDearestNeighbors with Leland McInnes - creator of UMAP

2024-05-3057:19

Talk like a DataFrame, run like SQL with Phillip Cloud - core-committer on Ibis

Talk like a DataFrame, run like SQL with Phillip Cloud - core-committer on Ibis

2024-05-0201:04:09

Enhancing Jupyter with Widgets with Trevor Manz - creator of anywidget.

Enhancing Jupyter with Widgets with Trevor Manz - creator of anywidget.

2024-04-1101:11:56

Introducing Sample Space

Introducing Sample Space

2024-04-0301:32

00:00

00:00

1.0x

Time for some (extreme) distillation with Thomas van Dongen - founder of the Minish Lab

Time for some (extreme) distillation with Thomas van Dongen - founder of the Minish Lab