DiscoverMachine Learning CafeFinding the label errors with Cleanlab with Curtis Northcutt - 006
Finding the label errors with Cleanlab with Curtis Northcutt - 006

Finding the label errors with Cleanlab with Curtis Northcutt - 006

Update: 2020-01-23
Share

Description

In this episode, I talked with Curtis Northcutt about his application cleanlab, with which you can find label errors in your dataset. Cleanlab computes cross-validated probabilities, the confident joint, and the statistics used in uncertainty estimation for dataset labels, and it ranks and sorts the labels by the probabilities of error, so you can easily find them in your dataset.

Curtis' website:
https://www.curtisnorthcutt.com/

Curtis on LinkedIn:
https://www.linkedin.com/in/cgnorthcutt/

Cleanlab on GitHub:
https://github.com/cgnorthcutt/cleanlab

Cleanlab's blog:
https://l7.curtisnorthcutt.com/cleanlab-python-package

White Papers:
https://arxiv.org/abs/1911.00068
https://arxiv.org/abs/1705.01936


Music by Curtis (PomDP the PhD rapper):
https://soundcloud.com/thephdrapper/bars-on-bars
https://soundcloud.com/thephdrapper/crown
https://soundcloud.com/thephdrapper/dub-dub
https://open.spotify.com/album/2Fjg3zF8PGEg9WWNoeyx3X

---General Info---

Podcast's Website: http://machinelearningcafe.org/
Host's LinkedIn: https://www.linkedin.com/in/miklostoth/
Email of the host: miklos@machinelearningcafe.org

---Copyright Info---
Music is from https://filmmusic.io, intro first part is by Miklos Toth and some free garage band loops. :) intro second part: "Aces High" by Kevin MacLeod, outro: "Bars on Bars" by Curtis Northcutt (with his explicit allowance to play)

Comments 
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Finding the label errors with Cleanlab with Curtis Northcutt - 006

Finding the label errors with Cleanlab with Curtis Northcutt - 006