JustinShovelain

Karma: 539

I am the co founder of and researcher at the quantitative long term strategy organization Convergence (see here for our growing list of publications). Over the last fourteen years I have worked with MIRI, CFAR, EA Global, and Founders Fund, and done work in EA strategy, fundraising, networking, teaching, cognitive enhancement, and AI safety research. I have a MS degree in computer science and BS degrees in computer science, mathematics, and physics.

Information-Theoretic Boxing of Superintelligences

JustinShovelain and Elliot_Mckernon

30 Nov 2023 14:31 UTC

30 points

0 comments7 min readLW link

The risk-reward tradeoff of interpretability research

JustinShovelain and Elliot_Mckernon

5 Jul 2023 17:05 UTC

15 points

1 comment6 min readLW link

Aligning AI by optimizing for “wisdom”

JustinShovelain and Elliot_Mckernon

27 Jun 2023 15:20 UTC

22 points

7 comments12 min readLW link

Improving the safety of AI evals

JustinShovelain and Elliot_Mckernon

17 May 2023 22:24 UTC

13 points

7 comments7 min readLW link

Keep humans in the loop

JustinShovelain and Elliot_Mckernon

19 Apr 2023 15:34 UTC

22 points

1 comment10 min readLW link

Updating Utility Functions

JustinShovelain and Joar Skalse

9 May 2022 9:44 UTC

37 points

6 comments8 min readLW link

Goodhart’s Law Causal Diagrams

JustinShovelain and Jeremy Gillen

11 Apr 2022 13:52 UTC

32 points

5 comments6 min readLW link

How Money Fails to Track Value

JustinShovelain2 Apr 2022 12:32 UTC

17 points

0 comments5 min readLW link

Evaluating expertise: a clear box model

JustinShovelain15 Oct 2020 14:18 UTC

36 points

3 comments5 min readLW link

Good and bad ways to think about downside risks

MichaelA and JustinShovelain

11 Jun 2020 1:38 UTC

18 points

12 comments11 min readLW link

COVID-19: An opportunity to help by modelling testing and tracing to inform the UK government

JustinShovelain17 Apr 2020 17:21 UTC

14 points

2 comments2 min readLW link

[Question] Testing and contact tracing impact assessment model?

JustinShovelain9 Apr 2020 17:42 UTC

6 points

3 comments1 min readLW link

COVID-19: List of ideas to reduce the direct harm from the virus, with an emphasis on unusual ideas

JustinShovelain9 Apr 2020 11:33 UTC

30 points

12 comments7 min readLW link

Memetic downside risks: How ideas can evolve and cause harm

MichaelA, JustinShovelain and algekalipso

25 Feb 2020 19:47 UTC

21 points

3 comments15 min readLW link

Information hazards: Why you should care and what you can do

MichaelA, JustinShovelain, David_Kristoffersson and algekalipso

23 Feb 2020 20:47 UTC

18 points

4 comments15 min readLW link

Mapping downside risks and information hazards

MichaelA, JustinShovelain and David_Kristoffersson

20 Feb 2020 14:46 UTC

22 points

0 comments9 min readLW link

Using vector fields to visualise preferences and make them consistent

MichaelA and JustinShovelain

28 Jan 2020 19:44 UTC

41 points

32 comments11 min readLW link

AI alignment concepts: philosophical breakers, stoppers, and distorters

JustinShovelain24 Jan 2020 19:23 UTC

20 points

3 comments3 min readLW link

Safety regulators: A tool for mitigating technological risk

JustinShovelain21 Jan 2020 13:07 UTC

13 points

4 comments4 min readLW link

FAI Research Constraints and AGI Side Effects

JustinShovelain3 Jun 2015 19:25 UTC

27 points

59 comments7 min readLW link