#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Update: 2021-08-02

Description

Podcast: Lex Fridman Podcast (LS 77 · TOP 0.01% what is this?)
Episode: #206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision
Pub date: 2021-07-31

Get Podcast Transcript →
powered by Listen411 - fast audio-to-text and summarization

Ishan Misra is a research scientist at FAIR working on self-supervised visual learning. Please support this podcast by checking out our sponsors:

– Onnit: https://lexfridman.com/onnit to get up to 10% off

– The Information: https://theinformation.com/lex to get 75% off first month

– Grammarly: https://grammarly.com/lex to get 20% off premium

– Athletic Greens: https://athleticgreens.com/lex and use code LEX to get 1 month of fish oil

EPISODE LINKS:

Ishan’s twitter: https://twitter.com/imisra_

Ishan’s website: https://imisra.github.io

Ishan’s FAIR page: https://ai.facebook.com/people/ishan-misra/

PODCAST INFO:

Podcast website: https://lexfridman.com/podcast

Apple Podcasts: https://apple.co/2lwqZIr

Spotify: https://spoti.fi/2nEwCF8

RSS: https://lexfridman.com/feed/podcast/

YouTube Full Episodes: https://youtube.com/lexfridman

YouTube Clips: https://youtube.com/lexclips

SUPPORT & CONNECT:

– Check out the sponsors above, it’s the best way to support this podcast

– Support on Patreon: https://www.patreon.com/lexfridman

– Twitter: https://twitter.com/lexfridman

– Instagram: https://www.instagram.com/lexfridman

– LinkedIn: https://www.linkedin.com/in/lexfridman

– Facebook: https://www.facebook.com/lexfridman

– Medium: https://medium.com/@lexfridman

OUTLINE:

Here’s the timestamps for the episode. On some podcast players you should be able to click the timestamp to jump to that time.

(00:00 ) – Introduction

(07:49 ) – Self-supervised learning

(16:24 ) – Self-supervised learning is the dark matter of intelligence

(20:17 ) – Categorization

(28:50 ) – Is computer vision still really hard?

(32:35 ) – Understanding Language

(42:14 ) – Harder to solve: vision or language

(48:59 ) – Contrastive learning & energy-based models

(52:59 ) – Data augmentation

(57:19 ) – Fixed audio spike by lowering sound with pen tool

(1:05:33 ) – Real data vs. augmented data

(1:09:16 ) – Non-contrastive learning energy based self supervised learning methods

(1:12:54 ) – Unsupervised learning (SwAV)

(1:15:37 ) – Self-supervised Pretraining (SEER)

(1:20:44 ) – Self-supervised learning (SSL) architectures

(1:26:43 ) – VISSL pytorch-based SSL library

(1:29:38 ) – Multi-modal

(1:37:06 ) – Active learning

(1:42:45 ) – Autonomous driving

(1:54:12 ) – Limits of deep learning

(1:58:19 ) – Difference between learning and reasoning

(2:03:26 ) – Building super-human AI

(2:11:14 ) – Most beautiful idea in self-supervised learning

(2:15:02 ) – Simulation for training AI

(2:18:27 ) – Video games replacing reality

(2:19:40 ) – How to write a good research paper

(2:24:08 ) – Best programming language for beginners

(2:25:01 ) – PyTorch vs TensorFlow

(2:28:26 ) – Advice for getting into machine learning

(2:30:31 ) – Advice for young people

(2:32:58 ) – Meaning of life

The podcast and artwork embedded on this page are from Lex Fridman, which is the property of its owner and not affiliated with or endorsed by Listen Notes, Inc.