RSS

Stuart_Armstrong(Stuart Armstrong)

Karma: 15,522

Hu­man pri­ors, fea­tures and mod­els, lan­guages, and Sol­monoff induction

Stuart_Armstrong10 May 2021 10:55 UTC
12 points
2 comments4 min readLW link

An­throp­ics: differ­ent prob­a­bil­ities, differ­ent questions

Stuart_Armstrong6 May 2021 13:14 UTC
19 points
1 comment15 min readLW link

Con­sis­ten­cies as (meta-)preferences

Stuart_Armstrong3 May 2021 15:10 UTC
15 points
0 comments3 min readLW link

Why un­rig­gable *al­most* im­plies uninfluenceable

Stuart_Armstrong9 Apr 2021 17:07 UTC
11 points
0 comments4 min readLW link

A pos­si­ble prefer­ence algorithm

Stuart_Armstrong8 Apr 2021 18:25 UTC
22 points
0 comments4 min readLW link

If you don’t de­sign for ex­trap­o­la­tion, you’ll ex­trap­o­late poorly—pos­si­bly fatally

Stuart_Armstrong8 Apr 2021 18:10 UTC
17 points
0 comments4 min readLW link

Which coun­ter­fac­tu­als should an AI fol­low?

Stuart_Armstrong7 Apr 2021 16:47 UTC
19 points
5 comments7 min readLW link

Toy model of prefer­ence, bias, and ex­tra information

Stuart_Armstrong24 Mar 2021 10:14 UTC
9 points
0 comments4 min readLW link

Prefer­ences and bi­ases, the in­for­ma­tion argument

Stuart_Armstrong23 Mar 2021 12:44 UTC
14 points
5 comments1 min readLW link

Why sig­moids are so hard to predict

Stuart_Armstrong18 Mar 2021 18:21 UTC
41 points
6 comments5 min readLW link

Con­nect­ing the good reg­u­la­tor the­o­rem with se­man­tics and sym­bol grounding

Stuart_Armstrong4 Mar 2021 14:35 UTC
11 points
0 comments2 min readLW link

Carte­sian frames as gen­er­al­ised models

Stuart_Armstrong16 Feb 2021 16:09 UTC
20 points
0 comments5 min readLW link

Gen­er­al­ised mod­els as a category

Stuart_Armstrong16 Feb 2021 16:08 UTC
12 points
5 comments4 min readLW link

Coun­ter­fac­tual con­trol incentives

Stuart_Armstrong21 Jan 2021 16:54 UTC
20 points
10 comments9 min readLW link

Short sum­mary of mAIry’s room

Stuart_Armstrong18 Jan 2021 18:11 UTC
26 points
2 comments4 min readLW link

Syn­tax, se­man­tics, and sym­bol ground­ing, simplified

Stuart_Armstrong23 Nov 2020 16:12 UTC
25 points
4 comments9 min readLW link

The ethics of AI for the Rout­ledge En­cy­clo­pe­dia of Philosophy

Stuart_Armstrong18 Nov 2020 17:55 UTC
45 points
8 comments1 min readLW link

Ex­tor­tion beats brinks­man­ship, but the au­di­ence matters

Stuart_Armstrong16 Nov 2020 21:13 UTC
27 points
15 comments4 min readLW link

Hu­mans are stun­ningly ra­tio­nal and stun­ningly irrational

Stuart_Armstrong23 Oct 2020 14:13 UTC
21 points
4 comments2 min readLW link

Knowl­edge, ma­nipu­la­tion, and free will

Stuart_Armstrong13 Oct 2020 17:47 UTC
32 points
15 comments3 min readLW link