RSS

JustinShovelain

Karma: 539

I am the co founder of and researcher at the quantitative long term strategy organization Convergence (see here for our growing list of publications). Over the last fourteen years I have worked with MIRI, CFAR, EA Global, and Founders Fund, and done work in EA strategy, fundraising, networking, teaching, cognitive enhancement, and AI safety research. I have a MS degree in computer science and BS degrees in computer science, mathematics, and physics.

In­for­ma­tion-The­o­retic Box­ing of Superintelligences

30 Nov 2023 14:31 UTC
30 points
0 comments7 min readLW link

The risk-re­ward trade­off of in­ter­pretabil­ity research

5 Jul 2023 17:05 UTC
15 points
1 comment6 min readLW link

Align­ing AI by op­ti­miz­ing for “wis­dom”

27 Jun 2023 15:20 UTC
22 points
7 comments12 min readLW link

Im­prov­ing the safety of AI evals

17 May 2023 22:24 UTC
13 points
7 comments7 min readLW link

Keep hu­mans in the loop

19 Apr 2023 15:34 UTC
22 points
1 comment10 min readLW link

Up­dat­ing Utility Functions

9 May 2022 9:44 UTC
37 points
6 comments8 min readLW link

Good­hart’s Law Causal Diagrams

11 Apr 2022 13:52 UTC
32 points
5 comments6 min readLW link

How Money Fails to Track Value

JustinShovelain2 Apr 2022 12:32 UTC
17 points
0 comments5 min readLW link

Eval­u­at­ing ex­per­tise: a clear box model

JustinShovelain15 Oct 2020 14:18 UTC
36 points
3 comments5 min readLW link

Good and bad ways to think about down­side risks

11 Jun 2020 1:38 UTC
18 points
12 comments11 min readLW link

COVID-19: An op­por­tu­nity to help by mod­el­ling test­ing and trac­ing to in­form the UK government

JustinShovelain17 Apr 2020 17:21 UTC
14 points
2 comments2 min readLW link

[Question] Test­ing and con­tact trac­ing im­pact as­sess­ment model?

JustinShovelain9 Apr 2020 17:42 UTC
6 points
3 comments1 min readLW link

COVID-19: List of ideas to re­duce the di­rect harm from the virus, with an em­pha­sis on un­usual ideas

JustinShovelain9 Apr 2020 11:33 UTC
30 points
12 comments7 min readLW link

Memetic down­side risks: How ideas can evolve and cause harm

25 Feb 2020 19:47 UTC
21 points
3 comments15 min readLW link

In­for­ma­tion haz­ards: Why you should care and what you can do

23 Feb 2020 20:47 UTC
18 points
4 comments15 min readLW link

Map­ping down­side risks and in­for­ma­tion hazards

20 Feb 2020 14:46 UTC
22 points
0 comments9 min readLW link

Us­ing vec­tor fields to vi­su­al­ise prefer­ences and make them consistent

28 Jan 2020 19:44 UTC
41 points
32 comments11 min readLW link

AI al­ign­ment con­cepts: philo­soph­i­cal break­ers, stop­pers, and distorters

JustinShovelain24 Jan 2020 19:23 UTC
20 points
3 comments3 min readLW link

Safety reg­u­la­tors: A tool for miti­gat­ing tech­nolog­i­cal risk

JustinShovelain21 Jan 2020 13:07 UTC
13 points
4 comments4 min readLW link

FAI Re­search Con­straints and AGI Side Effects

JustinShovelain3 Jun 2015 19:25 UTC
27 points
59 comments7 min readLW link