RSS

beren(Beren Millidge)

Karma: 971

Em­pa­thy as a nat­u­ral con­se­quence of learnt re­ward models

beren4 Feb 2023 15:35 UTC
30 points
12 comments13 min readLW link

AGI will have learnt util­ity functions

beren25 Jan 2023 19:42 UTC
28 points
2 comments13 min readLW link

Gra­di­ent hack­ing is ex­tremely difficult

beren24 Jan 2023 15:45 UTC
141 points
18 comments5 min readLW link