All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 282930

Petrov Day Retrospective: 2022

Ruby28 Sep 2022 22:16 UTC

108 points

41 comments4 min readLW link

Estimating the Current and Future Number of AI Safety Researchers

Stephen McAleese28 Sep 2022 21:11 UTC

50 points

15 comments9 min readLW link

(forum.effectivealtruism.org)

Progress links and tweets, 2022-09-28

jasoncrawford28 Sep 2022 20:26 UTC

13 points

1 comment1 min readLW link

(rootsofprogress.org)

EA & LW Forums Weekly Summary (19 − 25 Sep 22′)

Zoe Williams28 Sep 2022 20:18 UTC

16 points

2 comments19 min readLW link

LOVE in a simbox is all you need

jacob_cannell28 Sep 2022 18:25 UTC

66 points

73 comments44 min readLW link 1 review

A Library and Tutorial for Factored Cognition with Language Models

stuhlmueller, justin_dan and goodgravy

28 Sep 2022 18:15 UTC

47 points

0 comments1 min readLW link

Reward IS the Optimization Target

Carn28 Sep 2022 17:59 UTC

−2 points

3 comments5 min readLW link

AI Safety Endgame Stories

Ivan Vendrov28 Sep 2022 16:58 UTC

31 points

11 comments11 min readLW link

Will Values and Competition Decouple?

interstice28 Sep 2022 16:27 UTC

19 points

11 comments17 min readLW link

Georgism in Space

harsimony28 Sep 2022 16:05 UTC

42 points

12 comments4 min readLW link

(harsimony.wordpress.com)

QAPR 3: interpretability-guided training of neural nets

Quintin Pope28 Sep 2022 16:02 UTC

58 points

2 comments10 min readLW link

Strange Loops—Self-Reference from Number Theory to AI

ojorgensen28 Sep 2022 14:10 UTC

20 points

6 comments18 min readLW link

Why I think strong general AI is coming soon

porby28 Sep 2022 5:40 UTC

344 points

141 comments34 min readLW link 1 review

About Q Home

Q Home28 Sep 2022 4:56 UTC

15 points

4 comments1 min readLW link

[Linkpost] “Intensity and frequency of extreme novel epidemics” by Mariani et al. (2021)

T43128 Sep 2022 3:31 UTC

10 points

0 comments2 min readLW link

(pubmed.ncbi.nlm.nih.gov)

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

Diffractor28 Sep 2022 1:20 UTC

168 points

21 comments53 min readLW link 2 reviews

7 traps that (we think) new alignment researchers often fall into

Orpheus16 and Thomas Larsen

27 Sep 2022 23:13 UTC

180 points

10 comments4 min readLW link

Failure modes in a shard theory alignment plan

Thomas Kwa27 Sep 2022 22:34 UTC

26 points

2 comments7 min readLW link

[Question] Is a PhD necessary to contribute meaningfully to a field?

TrudosKudos27 Sep 2022 21:27 UTC

4 points

7 comments1 min readLW link

Why we’re not founding a human-data-for-alignment org

L Rudolf L and Matt Putz

27 Sep 2022 20:14 UTC

88 points

6 comments29 min readLW link

(forum.effectivealtruism.org)

A Poorly Planned Loft Bed

jefftk27 Sep 2022 17:50 UTC

9 points

2 comments1 min readLW link

(www.jefftk.com)

Wise Crowd & Democratic Spirit

Hristo Zaykov27 Sep 2022 17:45 UTC

1 point

0 comments2 min readLW link

(www.hristo.blog)

Soft skills for meetups

mingyuan27 Sep 2022 17:26 UTC

51 points

3 comments5 min readLW link

[Question] Enriching Youtube content recommendations

Martín Soto27 Sep 2022 16:54 UTC

9 points

4 comments1 min readLW link

The Onion Test for Personal and Institutional Honesty

chanamessinger and Andrew_Critch

27 Sep 2022 15:26 UTC

173 points

32 comments3 min readLW link 3 reviews

Book review: “The Heart of the Brain: The Hypothalamus and Its Hormones”

Steven Byrnes27 Sep 2022 13:20 UTC

66 points

3 comments18 min readLW link

My Thoughts on the ML Safety Course

zeshen27 Sep 2022 13:15 UTC

50 points

3 comments17 min readLW link

Summary of ML Safety Course

zeshen27 Sep 2022 13:05 UTC

7 points

0 comments6 min readLW link

Probabilistic reasoning for description and experience

Q Home27 Sep 2022 10:57 UTC

0 points

0 comments26 min readLW link

A Prince, a Pauper, Power, Panama

Alok Singh27 Sep 2022 7:10 UTC

10 points

0 comments1 min readLW link

(alok.github.io)

Double Asteroid Redirection Test succeeds

sanxiyn27 Sep 2022 6:37 UTC

19 points

5 comments1 min readLW link

(twitter.com)

[Question] How would I know if a PhD is the right career path?

Bob Guran27 Sep 2022 5:49 UTC

4 points

4 comments1 min readLW link

Review of Examine.com’s vitamin write-ups

Elizabeth and Martin Bernstorff

26 Sep 2022 23:40 UTC

60 points

1 comment5 min readLW link

(acesounderglass.com)

D&D.Sci September 2022 Evaluation and Ruleset

abstractapplic26 Sep 2022 22:19 UTC

30 points

5 comments3 min readLW link

[MLSN #5]: Prize Compilation

Dan H26 Sep 2022 21:55 UTC

15 points

1 comment2 min readLW link

Loss of Alignment is not the High-Order Bit for AI Risk

yieldthought26 Sep 2022 21:16 UTC

14 points

18 comments2 min readLW link

Inverse Scaling Prize: Round 1 Winners

Ethan Perez and Ian McKenzie

26 Sep 2022 19:57 UTC

93 points

16 comments4 min readLW link

(irmckenzie.co.uk)

[Question] Does the existence of shared human values imply alignment is “easy”?

Morpheus26 Sep 2022 18:01 UTC

7 points

15 comments1 min readLW link

Meetup: Madison, WI (Oct 8)

svfritz26 Sep 2022 17:55 UTC

1 point

0 comments1 min readLW link

Ambiguity in Prediction Market Resolution is Harmful

aphyer26 Sep 2022 16:22 UTC

69 points

17 comments5 min readLW link

Framery Phone Booth CO2 Accumulation

jefftk26 Sep 2022 16:10 UTC

25 points

0 comments1 min readLW link

(www.jefftk.com)

[Question] How can I remove the launch button from my LW home page?

sudo26 Sep 2022 15:15 UTC

8 points

4 comments1 min readLW link

Brief Notes on Transformers

Adam Jermyn26 Sep 2022 14:46 UTC

48 points

3 comments2 min readLW link

You are Underestimating The Likelihood That Convergent Instrumental Subgoals Lead to Aligned AGI

Mark Neyer26 Sep 2022 14:22 UTC

3 points

6 comments3 min readLW link

Climate-contingent Finance, and A Generalized Mechanism for X-Risk Reduction Financing

John Nay26 Sep 2022 13:23 UTC

0 points

2 comments26 min readLW link

Self-Control Secrets of the Puritan Masters

David Hugh-Jones26 Sep 2022 9:04 UTC

68 points

3 comments5 min readLW link

(wyclif.substack.com)

How I buy things when Lightcone wants them fast

Bird Concept26 Sep 2022 5:02 UTC

240 points

21 comments8 min readLW link

Oren’s Field Guide of Bad AGI Outcomes

Eris Discordia26 Sep 2022 4:06 UTC

0 points

0 comments1 min readLW link

On Generality

Eris Discordia26 Sep 2022 4:06 UTC

2 points

0 comments5 min readLW link

Planning a Loft Bed

jefftk26 Sep 2022 0:10 UTC

15 points

15 comments2 min readLW link

(www.jefftk.com)