All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 2930

Distribution Shifts and The Importance of AI Safety

Leon Lang29 Sep 2022 22:38 UTC

17 points

2 comments9 min readLW link

Clarifying the Agent-Like Structure Problem

johnswentworth29 Sep 2022 21:28 UTC

64 points

19 comments6 min readLW link

Where I currently disagree with Ryan Greenblatt’s version of the ELK approach

So8res29 Sep 2022 21:18 UTC

65 points

7 comments5 min readLW link

It matters when the first sharp left turn happens

Adam Jermyn29 Sep 2022 20:12 UTC

45 points

9 comments4 min readLW link

Covid 9/29/22: The Jones Act Waver

Zvi29 Sep 2022 18:20 UTC

47 points

10 comments24 min readLW link

(thezvi.wordpress.com)

High-Impact Psychology (HIPsy): Piloting a Global Network

Inga G.29 Sep 2022 18:16 UTC

8 points

0 comments5 min readLW link

Unit Test Everything

DirectedEvolution29 Sep 2022 18:12 UTC

30 points

0 comments8 min readLW link

Builder/Breaker for Deconfusion

abramdemski29 Sep 2022 17:36 UTC

73 points

9 comments9 min readLW link

[Question] Resources to find/register the rationalists that specialize in a given topic?

tailcalled29 Sep 2022 17:20 UTC

25 points

8 comments1 min readLW link

Make-A-Video by Meta AI

P.29 Sep 2022 17:07 UTC

9 points

4 comments1 min readLW link

(makeavideo.studio)

FDT is not directly comparable to CDT and EDT

SMK29 Sep 2022 14:42 UTC

53 points

8 comments11 min readLW link

[Link] “Improper Nouns” by siderea

Kenny29 Sep 2022 13:28 UTC

17 points

3 comments1 min readLW link

(siderea.dreamwidth.org)

Open application to become an AI safety project mentor

Charbel-Raphaël29 Sep 2022 11:27 UTC

10 points

0 comments1 min readLW link

(docs.google.com)

[Question] Should reasonably healthy people still take Paxlovid?

Sameerishere29 Sep 2022 3:41 UTC

15 points

2 comments1 min readLW link

Reflection on a Consulting Workshop

myutin29 Sep 2022 3:04 UTC

12 points

1 comment3 min readLW link

Better Construction Cost Estimates?

jefftk29 Sep 2022 2:30 UTC

12 points

4 comments2 min readLW link

(www.jefftk.com)

Petrov Day Retrospective: 2022

Ruby28 Sep 2022 22:16 UTC

108 points

41 comments4 min readLW link

Estimating the Current and Future Number of AI Safety Researchers

Stephen McAleese28 Sep 2022 21:11 UTC

50 points

15 comments9 min readLW link

(forum.effectivealtruism.org)

Progress links and tweets, 2022-09-28

jasoncrawford28 Sep 2022 20:26 UTC

13 points

1 comment1 min readLW link

(rootsofprogress.org)

EA & LW Forums Weekly Summary (19 − 25 Sep 22′)

Zoe Williams28 Sep 2022 20:18 UTC

16 points

2 comments19 min readLW link

LOVE in a simbox is all you need

jacob_cannell28 Sep 2022 18:25 UTC

66 points

73 comments44 min readLW link 1 review

A Library and Tutorial for Factored Cognition with Language Models

stuhlmueller, justin_dan and goodgravy

28 Sep 2022 18:15 UTC

47 points

0 comments1 min readLW link

Reward IS the Optimization Target

Carn28 Sep 2022 17:59 UTC

−2 points

3 comments5 min readLW link

AI Safety Endgame Stories

Ivan Vendrov28 Sep 2022 16:58 UTC

31 points

11 comments11 min readLW link

Will Values and Competition Decouple?

interstice28 Sep 2022 16:27 UTC

19 points

11 comments17 min readLW link

Georgism in Space

harsimony28 Sep 2022 16:05 UTC

42 points

12 comments4 min readLW link

(harsimony.wordpress.com)

QAPR 3: interpretability-guided training of neural nets

Quintin Pope28 Sep 2022 16:02 UTC

58 points

2 comments10 min readLW link

Strange Loops—Self-Reference from Number Theory to AI

ojorgensen28 Sep 2022 14:10 UTC

20 points

6 comments18 min readLW link

Why I think strong general AI is coming soon

porby28 Sep 2022 5:40 UTC

344 points

141 comments34 min readLW link 1 review

About Q Home

Q Home28 Sep 2022 4:56 UTC

15 points

4 comments1 min readLW link

[Linkpost] “Intensity and frequency of extreme novel epidemics” by Mariani et al. (2021)

T43128 Sep 2022 3:31 UTC

10 points

0 comments2 min readLW link

(pubmed.ncbi.nlm.nih.gov)

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

Diffractor28 Sep 2022 1:20 UTC

168 points

21 comments53 min readLW link 2 reviews

7 traps that (we think) new alignment researchers often fall into

Orpheus16 and Thomas Larsen

27 Sep 2022 23:13 UTC

180 points

10 comments4 min readLW link

Failure modes in a shard theory alignment plan

Thomas Kwa27 Sep 2022 22:34 UTC

26 points

2 comments7 min readLW link

[Question] Is a PhD necessary to contribute meaningfully to a field?

TrudosKudos27 Sep 2022 21:27 UTC

4 points

7 comments1 min readLW link

Why we’re not founding a human-data-for-alignment org

L Rudolf L and Matt Putz

27 Sep 2022 20:14 UTC

88 points

6 comments29 min readLW link

(forum.effectivealtruism.org)

A Poorly Planned Loft Bed

jefftk27 Sep 2022 17:50 UTC

9 points

2 comments1 min readLW link

(www.jefftk.com)

Wise Crowd & Democratic Spirit

Hristo Zaykov27 Sep 2022 17:45 UTC

1 point

0 comments2 min readLW link

(www.hristo.blog)

Soft skills for meetups

mingyuan27 Sep 2022 17:26 UTC

51 points

3 comments5 min readLW link

[Question] Enriching Youtube content recommendations

Martín Soto27 Sep 2022 16:54 UTC

9 points

4 comments1 min readLW link

The Onion Test for Personal and Institutional Honesty

chanamessinger and Andrew_Critch

27 Sep 2022 15:26 UTC

173 points

32 comments3 min readLW link 3 reviews

Book review: “The Heart of the Brain: The Hypothalamus and Its Hormones”

Steven Byrnes27 Sep 2022 13:20 UTC

66 points

3 comments18 min readLW link

My Thoughts on the ML Safety Course

zeshen27 Sep 2022 13:15 UTC

50 points

3 comments17 min readLW link

Summary of ML Safety Course

zeshen27 Sep 2022 13:05 UTC

7 points

0 comments6 min readLW link

Probabilistic reasoning for description and experience

Q Home27 Sep 2022 10:57 UTC

0 points

0 comments26 min readLW link

A Prince, a Pauper, Power, Panama

Alok Singh27 Sep 2022 7:10 UTC

10 points

0 comments1 min readLW link

(alok.github.io)

Double Asteroid Redirection Test succeeds

sanxiyn27 Sep 2022 6:37 UTC

19 points

5 comments1 min readLW link

(twitter.com)

[Question] How would I know if a PhD is the right career path?

Bob Guran27 Sep 2022 5:49 UTC

4 points

4 comments1 min readLW link

Review of Examine.com’s vitamin write-ups

Elizabeth and Martin Bernstorff

26 Sep 2022 23:40 UTC

60 points

1 comment5 min readLW link

(acesounderglass.com)

D&D.Sci September 2022 Evaluation and Ruleset

abstractapplic26 Sep 2022 22:19 UTC

30 points

5 comments3 min readLW link