JustinShovelain

Karma: 539

I am the co founder of and researcher at the quantitative long term strategy organization Convergence (see here for our growing list of publications). Over the last fourteen years I have worked with MIRI, CFAR, EA Global, and Founders Fund, and done work in EA strategy, fundraising, networking, teaching, cognitive enhancement, and AI safety research. I have a MS degree in computer science and BS degrees in computer science, mathematics, and physics.

Coffee: When it helps, when it hurts

JustinShovelain10 Mar 2010 6:14 UTC

52 points

109 comments1 min readLW link

Updating Utility Functions

JustinShovelain and Joar Skalse

9 May 2022 9:44 UTC

37 points

6 comments8 min readLW link

Evaluating expertise: a clear box model

JustinShovelain15 Oct 2020 14:18 UTC

36 points

3 comments5 min readLW link

Goodhart’s Law Causal Diagrams

JustinShovelain and Jeremy Gillen

11 Apr 2022 13:52 UTC

32 points

5 comments6 min readLW link

Information-Theoretic Boxing of Superintelligences

JustinShovelain and Elliot_Mckernon

30 Nov 2023 14:31 UTC

30 points

0 comments7 min readLW link

Sequential Organization of Thinking: “Six Thinking Hats”

JustinShovelain18 Mar 2010 5:22 UTC

30 points

14 comments3 min readLW link

COVID-19: List of ideas to reduce the direct harm from the virus, with an emphasis on unusual ideas

JustinShovelain9 Apr 2020 11:33 UTC

30 points

12 comments7 min readLW link

FAI Research Constraints and AGI Side Effects

JustinShovelain3 Jun 2015 19:25 UTC

27 points

59 comments7 min readLW link

Causes of disagreements

JustinShovelain16 Jul 2009 21:51 UTC

27 points

20 comments4 min readLW link

Keep humans in the loop

JustinShovelain and Elliot_Mckernon

19 Apr 2023 15:34 UTC

22 points

1 comment10 min readLW link

Aligning AI by optimizing for “wisdom”

JustinShovelain and Elliot_Mckernon

27 Jun 2023 15:20 UTC

22 points

7 comments12 min readLW link

AI alignment concepts: philosophical breakers, stoppers, and distorters

JustinShovelain24 Jan 2020 19:23 UTC

20 points

3 comments3 min readLW link

How Money Fails to Track Value

JustinShovelain2 Apr 2022 12:32 UTC

17 points

0 comments5 min readLW link

The risk-reward tradeoff of interpretability research

JustinShovelain and Elliot_Mckernon

5 Jul 2023 17:05 UTC

15 points

1 comment6 min readLW link

COVID-19: An opportunity to help by modelling testing and tracing to inform the UK government

JustinShovelain17 Apr 2020 17:21 UTC

14 points

2 comments2 min readLW link

Improving the safety of AI evals

JustinShovelain and Elliot_Mckernon

17 May 2023 22:24 UTC

13 points

7 comments7 min readLW link

Safety regulators: A tool for mitigating technological risk

JustinShovelain21 Jan 2020 13:07 UTC

13 points

4 comments4 min readLW link

Intuitive supergoal uncertainty

JustinShovelain4 Dec 2009 5:21 UTC

11 points

27 comments5 min readLW link

Meetup: Bay Area: Sunday, March 7th, 7pm

JustinShovelain2 Mar 2010 21:18 UTC

8 points

44 comments1 min readLW link

Minneapolis Meetup: Saturday May 14, 3:00PM

JustinShovelain13 May 2011 21:14 UTC

8 points

5 comments1 min readLW link