RSS

LawrenceC(Lawrence Chan)

Karma: 5,069

I do AI Alignment research. Currently independent, but previously at: METR, Redwood, UC Berkeley, Good Judgment Project.

I’m also a part-time fund manager for the LTFF.

Obligatory research billboard website: https://​​chanlawrence.me/​​

[Question] What progress have we made on au­to­mated au­dit­ing?

LawrenceC6 Jul 2024 1:49 UTC
37 points
1 comment1 min readLW link

Com­pact Proofs of Model Perfor­mance via Mechanis­tic Interpretability

24 Jun 2024 19:27 UTC
92 points
3 comments8 min readLW link
(arxiv.org)