RSS

Thomas Kwa

Karma: 1,298

Student at Caltech. Currently trying to get an AI safety inside view.

Failure modes in a shard the­ory al­ign­ment plan

Thomas Kwa27 Sep 2022 22:34 UTC
23 points
2 comments7 min readLW link