RSS

JustinShovelain

Karma: 624

I am the co founder of and researcher at the quantitative long term strategy organization Convergence (see here for our growing list of publications). Over the last sixteen years I have worked with MIRI, CFAR, EA Global, and Founders Fund, and done work in EA strategy, fundraising, networking, teaching, cognitive enhancement, and AI safety research. I have a MS degree in computer science and BS degrees in computer science, mathematics, and physics.

Counter-con­sid­er­a­tions on AI arms races

15 May 2025 14:54 UTC
23 points
0 comments18 min readLW link

Good­hart Ty­pol­ogy via Struc­ture, Func­tion, and Ran­dom­ness Distributions

25 Mar 2025 16:01 UTC
35 points
1 comment15 min readLW link

Bounded AI might be viable

6 Mar 2025 12:55 UTC
24 points
4 comments20 min readLW link

In­for­ma­tion-The­o­retic Box­ing of Superintelligences

30 Nov 2023 14:31 UTC
31 points
0 comments7 min readLW link

The risk-re­ward trade­off of in­ter­pretabil­ity research

5 Jul 2023 17:05 UTC
16 points
1 comment6 min readLW link

Align­ing AI by op­ti­miz­ing for “wis­dom”

27 Jun 2023 15:20 UTC
28 points
8 comments12 min readLW link

Im­prov­ing the safety of AI evals

17 May 2023 22:24 UTC
13 points
7 comments7 min readLW link

Keep hu­mans in the loop

19 Apr 2023 15:34 UTC
23 points
1 comment10 min readLW link

Up­dat­ing Utility Functions

9 May 2022 9:44 UTC
42 points
6 comments8 min readLW link

Good­hart’s Law Causal Diagrams

11 Apr 2022 13:52 UTC
35 points
6 comments6 min readLW link

How Money Fails to Track Value

JustinShovelain2 Apr 2022 12:32 UTC
17 points
0 comments5 min readLW link

Eval­u­at­ing ex­per­tise: a clear box model

JustinShovelain15 Oct 2020 14:18 UTC
37 points
3 comments5 min readLW link

Good and bad ways to think about down­side risks

11 Jun 2020 1:38 UTC
19 points
12 comments11 min readLW link

COVID-19: An op­por­tu­nity to help by mod­el­ling test­ing and trac­ing to in­form the UK government

JustinShovelain17 Apr 2020 17:21 UTC
14 points
2 comments2 min readLW link

[Question] Test­ing and con­tact trac­ing im­pact as­sess­ment model?

JustinShovelain9 Apr 2020 17:42 UTC
6 points
3 comments1 min readLW link

COVID-19: List of ideas to re­duce the di­rect harm from the virus, with an em­pha­sis on un­usual ideas

JustinShovelain9 Apr 2020 11:33 UTC
30 points
12 comments7 min readLW link

Memetic down­side risks: How ideas can evolve and cause harm

25 Feb 2020 19:47 UTC
27 points
3 comments15 min readLW link

In­for­ma­tion haz­ards: Why you should care and what you can do

23 Feb 2020 20:47 UTC
18 points
4 comments15 min readLW link

Map­ping down­side risks and in­for­ma­tion hazards

20 Feb 2020 14:46 UTC
23 points
0 comments9 min readLW link

Us­ing vec­tor fields to vi­su­al­ise prefer­ences and make them consistent

28 Jan 2020 19:44 UTC
42 points
32 comments11 min readLW link