robertzk

Karma: 465

The rationalist’s checklist

robertzk16 Dec 2011 16:21 UTC

44 points

8 comments1 min readLW link

Ask LW: ω-self-aware systems

robertzk16 Dec 2012 22:18 UTC

0 points

10 comments1 min readLW link

Emily Brontë on: Psychology Required for Serious™ AGI Safety Research

robertzk14 Sep 2022 14:47 UTC

2 points

0 comments1 min readLW link

Getting up to Speed on the Speed Prior in 2022

robertzk28 Dec 2022 7:49 UTC

36 points

5 comments65 min readLW link

Training Process Transparency through Gradient Interpretability: Early experiments on toy language models

robertzk and evhub

21 Jul 2023 14:52 UTC

56 points

1 comment1 min readLW link

We Inspected Every Head In GPT-2 Small using SAEs So You Don’t Have To

robertzk, Connor Kissane, Arthur Conmy and Neel Nanda

6 Mar 2024 5:03 UTC

56 points

0 comments12 min readLW link