RSS

OpenAI an­nounces GPT-3

gwern
29 May 2020 1:49 UTC
29 points
1 comment1 min readLW link
(arxiv.org)

[AN #101]: Why we should rigor­ously mea­sure and fore­cast AI progress

rohinmshah
27 May 2020 17:20 UTC
15 points
0 comments10 min readLW link
(mailchi.mp)

AI Safety Dis­cus­sion Days

Linda Linsefors
27 May 2020 16:54 UTC
9 points
0 comments3 min readLW link

How can In­ter­pretabil­ity help Align­ment?

23 May 2020 16:16 UTC
31 points
3 comments9 min readLW link

AGIs as populations

ricraz
22 May 2020 20:36 UTC
20 points
22 comments4 min readLW link

Com­par­ing re­ward learn­ing/​re­ward tam­per­ing formalisms

Stuart_Armstrong
21 May 2020 12:03 UTC
9 points
0 comments3 min readLW link

[AN #100]: What might go wrong if you learn a re­ward func­tion while acting

rohinmshah
20 May 2020 17:30 UTC
33 points
2 comments12 min readLW link
(mailchi.mp)

Prob­a­bil­ities, weights, sums: pretty much the same for re­ward functions

Stuart_Armstrong
20 May 2020 15:19 UTC
11 points
1 comment2 min readLW link

Learn­ing and ma­nipu­lat­ing learning

Stuart_Armstrong
19 May 2020 13:02 UTC
38 points
4 comments10 min readLW link

Point­ing to a Flower

johnswentworth
18 May 2020 18:54 UTC
51 points
18 comments9 min readLW link

The Mechanis­tic and Nor­ma­tive Struc­ture of Agency

G Gordon Worley III
18 May 2020 16:03 UTC
14 points
4 comments1 min readLW link
(philpapers.org)

Re­ward func­tions and up­dat­ing as­sump­tions can hide a mul­ti­tude of sins

Stuart_Armstrong
18 May 2020 15:18 UTC
16 points
2 comments9 min readLW link

Why you should min­i­max in two-player zero-sum games

Nisan
17 May 2020 20:48 UTC
17 points
1 comment1 min readLW link

Multi-agent safety

ricraz
16 May 2020 1:59 UTC
19 points
7 comments5 min readLW link

Con­jec­ture Workshop

johnswentworth
15 May 2020 22:41 UTC
34 points
2 comments2 min readLW link

How should AIs up­date a prior over hu­man prefer­ences?

Stuart_Armstrong
15 May 2020 13:14 UTC
17 points
9 comments2 min readLW link

[AN #99]: Dou­bling times for the effi­ciency of AI algorithms

rohinmshah
13 May 2020 17:20 UTC
30 points
0 comments10 min readLW link
(mailchi.mp)

Book re­port: The­ory of Games and Eco­nomic Be­hav­ior (von Neu­mann & Mor­gen­stern)

Nisan
11 May 2020 9:47 UTC
37 points
4 comments6 min readLW link

Cor­rigi­bil­ity as out­side view

TurnTrout
8 May 2020 21:56 UTC
31 points
11 comments4 min readLW link

Speci­fi­ca­tion gam­ing: the flip side of AI ingenuity

6 May 2020 23:51 UTC
41 points
3 comments6 min readLW link