RSS

Stuart_Armstrong

Karma: 18,023

Tech­ni­cal model re­fine­ment formalism

Stuart_ArmstrongAug 27, 2020, 11:54 AM
19 points
0 comments6 min readLW link

Model splin­ter­ing: mov­ing from one im­perfect model to another

Stuart_ArmstrongAug 27, 2020, 11:53 AM
79 points
10 comments33 min readLW link

Learn­ing hu­man prefer­ences: black-box, white-box, and struc­tured white-box access

Stuart_ArmstrongAug 24, 2020, 11:42 AM
26 points
9 comments6 min readLW link

AI safety as feather­less bipeds *with broad flat nails*

Stuart_ArmstrongAug 19, 2020, 10:22 AM
38 points
1 comment1 min readLW link

Learn­ing hu­man prefer­ences: op­ti­mistic and pes­simistic scenarios

Stuart_ArmstrongAug 18, 2020, 1:05 PM
27 points
6 comments6 min readLW link

Strong im­pli­ca­tion of prefer­ence uncertainty

Stuart_ArmstrongAug 12, 2020, 7:02 PM
19 points
3 comments2 min readLW link

“Go west, young man!”—Prefer­ences in (im­perfect) maps

Stuart_ArmstrongJul 31, 2020, 7:50 AM
25 points
10 comments3 min readLW link

Learn­ing Values in Practice

Stuart_ArmstrongJul 20, 2020, 6:38 PM
24 points
0 comments5 min readLW link

The Gold­bach con­jec­ture is prob­a­bly cor­rect; so was Fer­mat’s last theorem

Stuart_ArmstrongJul 14, 2020, 7:30 PM
82 points
28 comments4 min readLW link

Why is the im­pact penalty time-in­con­sis­tent?

Stuart_ArmstrongJul 9, 2020, 5:26 PM
16 points
1 comment2 min readLW link

Dy­namic in­con­sis­tency of the in­ac­tion and ini­tial state baseline

Stuart_ArmstrongJul 7, 2020, 12:02 PM
30 points
8 comments2 min readLW link

Models, myths, dreams, and Cheshire cat grins

Stuart_ArmstrongJun 24, 2020, 10:50 AM
21 points
7 comments2 min readLW link

Re­sults of $1,000 Or­a­cle con­test!

Stuart_ArmstrongJun 17, 2020, 5:44 PM
60 points
2 comments1 min readLW link

Com­par­ing re­ward learn­ing/​re­ward tam­per­ing formalisms

Stuart_ArmstrongMay 21, 2020, 12:03 PM
9 points
3 comments3 min readLW link

Prob­a­bil­ities, weights, sums: pretty much the same for re­ward functions

Stuart_ArmstrongMay 20, 2020, 3:19 PM
11 points
1 comment2 min readLW link

Learn­ing and ma­nipu­lat­ing learning

Stuart_ArmstrongMay 19, 2020, 1:02 PM
39 points
5 comments10 min readLW link

Re­ward func­tions and up­dat­ing as­sump­tions can hide a mul­ti­tude of sins

Stuart_ArmstrongMay 18, 2020, 3:18 PM
16 points
2 comments9 min readLW link

How should AIs up­date a prior over hu­man prefer­ences?

Stuart_ArmstrongMay 15, 2020, 1:14 PM
17 points
9 comments2 min readLW link

Dist­in­guish­ing lo­gis­tic curves

Stuart_ArmstrongMay 15, 2020, 11:38 AM
24 points
0 comments7 min readLW link

Dist­in­guish­ing lo­gis­tic curves: visual

Stuart_ArmstrongMay 15, 2020, 10:33 AM
17 points
0 comments1 min readLW link