RSS

jbkjr

Karma: 288

In­te­grat­ing Three Models of (Hu­man) Cognition

jbkjr23 Nov 2021 1:06 UTC
31 points
4 comments32 min readLW link

Grokking the In­ten­tional Stance

jbkjr31 Aug 2021 15:49 UTC
43 points
22 comments20 min readLW link1 review

Dis­cus­sion: Ob­jec­tive Ro­bust­ness and In­ner Align­ment Terminology

23 Jun 2021 23:25 UTC
73 points
7 comments9 min readLW link

Em­piri­cal Ob­ser­va­tions of Ob­jec­tive Ro­bust­ness Failures

23 Jun 2021 23:23 UTC
63 points
5 comments9 min readLW link

[Question] Old post/​writ­ing on op­ti­miza­tion dae­mons?

jbkjr15 Apr 2021 18:00 UTC
2 points
2 comments1 min readLW link

Map­ping the Con­cep­tual Ter­ri­tory in AI Ex­is­ten­tial Safety and Alignment

jbkjr12 Feb 2021 7:55 UTC
15 points
0 comments26 min readLW link