RSS

davidad

Karma: 2,045

Programme Director at UK Advanced Research + Invention Agency focusing on safe transformative AI; formerly Protocol Labs, FHI/​Oxford, Harvard Biophysics, MIT Mathematics And Computation.

A list of core AI safety prob­lems and how I hope to solve them

davidad26 Aug 2023 15:12 UTC
157 points
23 comments5 min readLW link

You can still fetch the coffee to­day if you’re dead tomorrow

davidad9 Dec 2022 14:06 UTC
84 points
19 comments5 min readLW link

Com­pute Thresh­olds: pro­posed rules to miti­gate risk of a “lab leak” ac­ci­dent dur­ing AI train­ing runs

davidad22 Jul 2023 18:09 UTC
80 points
2 comments2 min readLW link

An Open Agency Ar­chi­tec­ture for Safe Trans­for­ma­tive AI

davidad20 Dec 2022 13:04 UTC
79 points
22 comments4 min readLW link

Why I Moved from AI to Neu­ro­science, or: Upload­ing Worms

davidad13 Apr 2012 7:10 UTC
67 points
58 comments1 min readLW link

AI Ne­o­re­al­ism: a threat model & suc­cess crite­rion for ex­is­ten­tial safety

davidad15 Dec 2022 13:42 UTC
64 points
1 comment3 min readLW link

Refram­ing in­ner alignment

davidad11 Dec 2022 13:53 UTC
53 points
13 comments4 min readLW link

Side-chan­nels: in­put ver­sus output

davidad12 Dec 2022 12:32 UTC
44 points
16 comments2 min readLW link

The Promise and Peril of Finite Sets

davidad10 Dec 2021 12:29 UTC
42 points
4 comments6 min readLW link

Cryptoepistemology

davidad24 Feb 2022 20:34 UTC
30 points
3 comments2 min readLW link