Reinforcement learning #RB7

Update: 2020-02-26

Description

Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model.
SAHSS+19.
https://arxiv.org/abs/1911.08265

Find out more on the Robustly Beneficial Wiki:
https://robustlybeneficial.org/wiki/index.php?title=Reinforcement_learning

Next week's paper is:
A Roadmap for Robust End-to-End Alignment. LN Hoang 18.
https://arxiv.org/abs/1809.01036

Comments

In Channel

The Social Dilemma #RB23

2020-10-0555:26

The Complexity of Agreement #RB22

2020-09-2728:07

Computable philosophy #RB21

2020-08-0251:57

The online competition between pro- and anti-vaccination views #RB20

2020-07-2525:52

Stanford Encyclopaedia of Philosophy Entry on Ethics of Artificial Intelligence - #RB19

2020-07-0838:37

Does increasing diversity reduce polarization? #RB18

2020-06-2018:20

The Philosophical Aspects of Computing and Complexity #RB17

2020-06-1101:04:06

Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims (BAWB+2020) #RB16

2020-06-0348:07

AI vs COVID19 #RB15

2020-04-2701:02:08

Privacy-Preserving Contact Tracing #RB14

2020-04-2438:44

Security and Privacy in Machine Learning #RB13

2020-04-1836:44

The Mathematical Ethics of Clinical Trials #RB12

2020-04-0337:07

AI Safety via Debates #RB11

2020-03-2929:44

Misinformation on Social Media #RB10

2020-03-2244:28

User-driven ethics #RB9

2020-03-1351:16

A roadmap towards robustly beneficial AIs #RB8

2020-03-0550:33

Reinforcement learning #RB7

2020-02-2646:46

Can autonomous weapons be safe? #RB6

2020-02-2137:13

Preference learning from comparisons #RB5

2020-02-1439:07

Can We Study Long Term Effects? #RB4

2020-02-0735:32

00:00

Reinforcement learning #RB7

Robustly Beneficial Podcast

#box-pro-ellipsis-176163156707950{-webkit-line-clamp:2;}Reinforcement learning #RB7

Reinforcement learning #RB7

Robustly Beneficial Podcast

Reinforcement learning #RB7