RSS

Stuart_Armstrong(Stuart Armstrong)

Karma: 22,745

Com­par­ing re­ward learn­ing/​re­ward tam­per­ing formalisms

Stuart_Armstrong
21 May 2020 12:03 UTC
9 points
0 comments3 min readLW link

Prob­a­bil­ities, weights, sums: pretty much the same for re­ward functions

Stuart_Armstrong
20 May 2020 15:19 UTC
11 points
1 comment2 min readLW link

Learn­ing and ma­nipu­lat­ing learning

Stuart_Armstrong
19 May 2020 13:02 UTC
38 points
4 comments10 min readLW link

Re­ward func­tions and up­dat­ing as­sump­tions can hide a mul­ti­tude of sins

Stuart_Armstrong
18 May 2020 15:18 UTC
16 points
2 comments9 min readLW link

How should AIs up­date a prior over hu­man prefer­ences?

Stuart_Armstrong
15 May 2020 13:14 UTC
17 points
9 comments2 min readLW link

Dist­in­guish­ing lo­gis­tic curves

Stuart_Armstrong
15 May 2020 11:38 UTC
23 points
0 comments7 min readLW link

Dist­in­guish­ing lo­gis­tic curves: visual

Stuart_Armstrong
15 May 2020 10:33 UTC
9 points
0 comments1 min readLW link

Kurzweil’s pre­dic­tions’ in­di­vi­d­ual scores

Stuart_Armstrong
7 May 2020 17:10 UTC
17 points
0 comments1 min readLW link