DiscoverTic-Tac-Toe the Hard WayGive that model a treat! : Reinforcement learning explained
Give that model a treat! : Reinforcement learning explained

Give that model a treat! : Reinforcement learning explained

Update: 2020-07-22
Share

Description

Switching gears, we focus on how Yannick’s been training his model using reinforcement learning.  He explains the differences from David’s supervised learning approach. We find out how his system performs against a player that makes random tic-tac-toe moves.

Resources: 

Deep Learning for JavaScript book

Playing Atari with Deep Reinforcement Learning

Two Minute Papers episode on Atari DQN

For more information about the show, check out pair.withgoogle.com/thehardway/.


You can reach out to the hosts on Twitter: @dweinberger and @tafsiri


Comments 
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

Give that model a treat! : Reinforcement learning explained

Give that model a treat! : Reinforcement learning explained

People + AI Research