All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Convergence Towards World-Models: A Gears-Level Model

Thane RuthenisAug 4, 2022, 11:31 PM

38 points

1 comment13 min readLW link

Cambist Booking

ScrewtapeAug 4, 2022, 10:40 PM

20 points

3 comments4 min readLW link

Calibration Trivia

ScrewtapeAug 4, 2022, 10:31 PM

12 points

9 comments4 min readLW link

Monthly Shorts 7/22

CelerAug 4, 2022, 10:30 PM

5 points

0 comments3 min readLW link

(keller.substack.com)

The Pragmascope Idea

johnswentworthAug 4, 2022, 9:52 PM

59 points

20 comments3 min readLW link

Running a Basic Meetup

ScrewtapeAug 4, 2022, 9:49 PM

21 points

1 comment2 min readLW link

Fiber arts, mysterious dodecahedrons, and waiting on “Eureka!”

eukaryoteAug 4, 2022, 8:37 PM

125 points

15 comments9 min readLW link 1 review

(eukaryotewritesblog.com)

[Question] Would “Manhattan Project” style be beneficial or deleterious for AI Alignment?

Valentin2026Aug 4, 2022, 7:12 PM

5 points

1 comment1 min readLW link

[Question] AI alignment: Would a lazy self-preservation instinct be sufficient?

BrainFrogAug 4, 2022, 5:53 PM

−1 points

4 comments1 min readLW link

Socratic Ducking, OODA Loops, Frame-by-Frame Debugging

CFAR!DuncanAug 4, 2022, 5:44 PM

26 points

1 comment5 min readLW link

What do ML researchers think about AI in 2022?

KatjaGraceAug 4, 2022, 3:40 PM

221 points

33 comments3 min readLW link

(aiimpacts.org)

Interpretability isn’t Free

Joel BurgetAug 4, 2022, 3:02 PM

12 points

1 comment2 min readLW link

Covid 8/4/22: Rebound

ZviAug 4, 2022, 11:20 AM

36 points

0 comments11 min readLW link

(thezvi.wordpress.com)

High Reliability Orgs, and AI Companies

RaemonAug 4, 2022, 5:45 AM

86 points

7 comments12 min readLW link 1 review

Surprised by ELK report’s counterexample to Debate, IDA

Evan R. MurphyAug 4, 2022, 2:12 AM

18 points

0 comments5 min readLW link

Clapping Lower

jefftkAug 4, 2022, 2:10 AM

38 points

7 comments1 min readLW link

(www.jefftk.com)

[Question] How do I know if my first post should be a post, or a question?

Nathan1123Aug 4, 2022, 1:46 AM

3 points

4 comments1 min readLW link

Three pillars for avoiding AGI catastrophe: Technical alignment, deployment decisions, and coordination

LintzAAug 3, 2022, 11:15 PM

24 points

0 comments11 min readLW link

Precursor checking for deceptive alignment

evhubAug 3, 2022, 10:56 PM

24 points

0 comments14 min readLW link

Transformer language models are doing something more general

NumendilAug 3, 2022, 9:13 PM

53 points

6 comments2 min readLW link

[Question] Some doubts about Non Superintelligent AIs

aditya malikAug 3, 2022, 7:55 PM

0 points

4 comments1 min readLW link

Announcing Squiggle: Early Access

ozziegooenAug 3, 2022, 7:48 PM

51 points

7 comments7 min readLW link

(forum.effectivealtruism.org)

Survey: What (de)motivates you about AI risk?

Daniel_FriedrichAug 3, 2022, 7:17 PM

1 point

0 comments1 min readLW link

(forms.gle)

Externalized reasoning oversight: a research direction for language model alignment

tameraAug 3, 2022, 12:03 PM

136 points

23 comments6 min readLW link

Open & Welcome Thread—Aug/Sep 2022

ThomasAug 3, 2022, 10:22 AM

9 points

32 comments1 min readLW link

[Question] How does one recognize information and differentiate it from noise?

M. Y. ZuoAug 3, 2022, 3:57 AM

4 points

29 comments1 min readLW link

Law-Following AI 4: Don’t Rely on Vicarious Liability

CullenAug 2, 2022, 11:26 PM

5 points

2 comments3 min readLW link

Two-year update on my personal AI timelines

Ajeya CotraAug 2, 2022, 11:07 PM

293 points

60 comments16 min readLW link

What are the Red Flags for Neural Network Suffering? - Seeds of Science call for reviewers

rogersbaconAug 2, 2022, 10:37 PM

24 points

6 comments1 min readLW link

Againstness

CFAR!DuncanAug 2, 2022, 7:29 PM

50 points

8 comments9 min readLW link

(Summary) Sequence Highlights—Thinking Better on Purpose

qazzquimbyAug 2, 2022, 5:45 PM

33 points

3 comments11 min readLW link

Progress links and tweets, 2022-08-02

jasoncrawfordAug 2, 2022, 5:03 PM

9 points

0 comments1 min readLW link

(rootsofprogress.org)

[Question] I want to donate some money (not much, just what I can afford) to AGI Alignment research, to whatever organization has the best chance of making sure that AGI goes well and doesn’t kill us all. What are my best options, where can I make the most difference per dollar?

lumenwritesAug 2, 2022, 12:08 PM

15 points

9 comments1 min readLW link

Thinking without priors?

Q HomeAug 2, 2022, 9:17 AM

7 points

0 comments9 min readLW link

[Question] Would quantum immortality mean subjective immortality?

n0ahAug 2, 2022, 4:54 AM

2 points

10 comments1 min readLW link

Turbocharging

CFAR!DuncanAug 2, 2022, 12:01 AM

52 points

5 comments9 min readLW link

Letter from leading Soviet Academicians to party and government leaders of the Soviet Union regarding signs of decline and structural problems of the economic-political system (1970)

M. Y. ZuoAug 1, 2022, 10:35 PM

20 points

10 comments16 min readLW link

Technical AI Alignment Study Group

Eric KAug 1, 2022, 6:33 PM

5 points

0 comments1 min readLW link

[Question] Is there any writing about prompt engineering for humans?

Alex HollowAug 1, 2022, 12:52 PM

18 points

8 comments1 min readLW link

Meditation course claims 65% enlightenment rate: my review

KatWoodsAug 1, 2022, 11:25 AM

111 points

35 comments14 min readLW link

[Question] Which intro-to-AI-risk text would you recommend to...

SherrinfordAug 1, 2022, 9:36 AM

12 points

1 comment1 min readLW link

Polaris, Five-Second Versions, and Thought Lengths

CFAR!DuncanAug 1, 2022, 7:14 AM

50 points

12 comments8 min readLW link

A Word is Worth 1,000 Pictures

KullyAug 1, 2022, 4:08 AM

1 point

0 comments2 min readLW link

On akrasia: starting at the bottom

seecrowAug 1, 2022, 4:08 AM

37 points

2 comments3 min readLW link

[Question] How likely do you think worse-than-extinction type fates to be?

span1Aug 1, 2022, 4:08 AM

3 points

3 comments1 min readLW link

Abstraction sacrifices causal clarity

Marv KJul 31, 2022, 7:24 PM

2 points

0 comments3 min readLW link

Time-logging programs and/or spreadsheets (2022)

mikbpJul 31, 2022, 6:18 PM

3 points

3 comments1 min readLW link

Conservatism is a rational response to epistemic uncertainty

contrarianbritJul 31, 2022, 6:04 PM

2 points

11 comments9 min readLW link

(thomasprosser.substack.com)

South Bay ACX/LW Meetup

ISJul 31, 2022, 3:30 PM

2 points

0 comments1 min readLW link

Perverse Independence Incentives

jefftkJul 31, 2022, 2:40 PM

61 points

3 comments1 min readLW link

(www.jefftk.com)