All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 345 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

A Qualitative Case for LTFF: Filling Critical Ecosystem Gaps

Linch3 Dec 2024 21:57 UTC

64 points

2 comments9 min readLW link

Deep Causal Transcoding: A Framework for Mechanistically Eliciting Latent Behaviors in Language Models

Andrew Mack and TurnTrout

3 Dec 2024 21:19 UTC

109 points

8 comments41 min readLW link

“Alignment at Large”: Bending the Arc of History Towards Life-Affirming Futures

welfvh3 Dec 2024 21:17 UTC

6 points

0 comments4 min readLW link

Roots of Progress is hiring an event manager

jasoncrawford3 Dec 2024 20:46 UTC

10 points

0 comments7 min readLW link

(rootsofprogress.notion.site)

Do simulacra dream of digital sheep?

EuanMcLean3 Dec 2024 20:25 UTC

16 points

36 comments10 min readLW link

Orca communication project—seeking feedback (and collaborators)

Towards_Keeperhood3 Dec 2024 17:29 UTC

38 points

16 comments2 min readLW link

Book a Time to Chat about Interp Research

Logan Riggs3 Dec 2024 17:27 UTC

47 points

3 comments1 min readLW link

Balsa Research 2024 Update

Zvi3 Dec 2024 12:30 UTC

21 points

0 comments5 min readLW link

(thezvi.wordpress.com)

First Solo Bus Ride

jefftk3 Dec 2024 12:20 UTC

28 points

1 comment1 min readLW link

(www.jefftk.com)

How to make evals for the AISI evals bounty

TheManxLoiner3 Dec 2024 10:44 UTC

9 points

0 comments5 min readLW link

Should there be just one western AGI project?

rosehadshar and Tom Davidson

3 Dec 2024 10:11 UTC

78 points

75 comments15 min readLW link

(www.forethought.org)

Cognitive Biases Contributing to AI X-risk — a deleted excerpt from my 2018 ARCHES draft

Andrew_Critch3 Dec 2024 9:29 UTC

48 points

2 comments5 min readLW link

[Question] What is your opinion of Dr. Angelo Dilullo(meditation)?

Suh_Prance_Alot3 Dec 2024 5:54 UTC

0 points

2 comments1 min readLW link

MIRI’s 2024 End-of-Year Update

Rob Bensinger3 Dec 2024 4:33 UTC

99 points

2 comments4 min readLW link

Linkpost: Rat Traps by Sheon Han in Asterisk Mag

Chris_Leong3 Dec 2024 3:22 UTC

12 points

7 comments1 min readLW link

(asteriskmag.com)

[Question] Who are the worthwhile non-European pre-Industrial thinkers?

Lorec3 Dec 2024 1:45 UTC

12 points

4 comments1 min readLW link

A Paradox of Simulated Suffering

arusarda2 Dec 2024 23:44 UTC

−3 points

3 comments1 min readLW link

Levels of Thought: from Points to Fields

HNX2 Dec 2024 20:25 UTC

4 points

2 comments23 min readLW link

From Code to Managing: Why Being a ‘Force Multiplier’ Matters to Me More Than Being a Coding Wizard

cloak2 Dec 2024 20:10 UTC

−3 points

0 comments1 min readLW link

(www.reddit.com)

A case for donating to AI risk reduction (including if you work in AI)

tlevin2 Dec 2024 19:05 UTC

60 points

2 comments3 min readLW link

Fertility Roundup #4

Zvi2 Dec 2024 14:30 UTC

35 points

16 comments49 min readLW link

(thezvi.wordpress.com)

Conjecture: A Roadmap for Cognitive Software and A Humanist Future of AI

Connor Leahy and Gabriel Alfour

2 Dec 2024 13:28 UTC

50 points

10 comments29 min readLW link

(www.conjecture.dev)

2024 Unofficial LessWrong Census/Survey

Screwtape2 Dec 2024 5:30 UTC

103 points

51 comments1 min readLW link 2 reviews

Drexler’s Nanotech Software

PeterMcCluskey2 Dec 2024 4:55 UTC

67 points

9 comments4 min readLW link

(bayesianinvestor.com)

Sorry for the downtime, looks like we got DDosd

habryka2 Dec 2024 4:14 UTC

112 points

13 comments1 min readLW link

[Question] Is malice a real emotion?

landscape_kiwi1 Dec 2024 23:47 UTC

6 points

5 comments1 min readLW link

Teaching My Younger Self to Program: A case study of how I’d pass on my skill at self-learning

Shoshannah Tekofsky1 Dec 2024 21:05 UTC

26 points

1 comment7 min readLW link

(thinkfeelplay.substack.com)

[Question] Which Biases are most important to Overcome?

abstractapplic1 Dec 2024 15:40 UTC

35 points

26 comments1 min readLW link

Commenting Patterns by Platform

jefftk1 Dec 2024 11:50 UTC

12 points

0 comments1 min readLW link

(www.jefftk.com)

[Letter] Chinese Quickstart

lsusr1 Dec 2024 6:38 UTC

33 points

3 comments5 min readLW link

AXRP Episode 39 - Evan Hubinger on Model Organisms of Misalignment

DanielFilan1 Dec 2024 6:00 UTC

41 points

0 comments67 min readLW link

Magnitudes: Let’s Comprehend the Incomprehensible!

joec1 Dec 2024 3:08 UTC

23 points

10 comments3 min readLW link

[Question] Why does ChatGPT throw an error when outputting “David Mayer”?

Archimedes1 Dec 2024 0:11 UTC

6 points

9 comments1 min readLW link

Introducing the Anthropic Fellows Program

Miranda Zhang and Ethan Perez

30 Nov 2024 23:47 UTC

27 points

0 comments4 min readLW link

(alignment.anthropic.com)

The Shape of Heaven

edgecase6430 Nov 2024 23:38 UTC

16 points

1 comment5 min readLW link

AI Training Opt-Outs Reinforce Global Power Asymmetries

kushagra30 Nov 2024 22:08 UTC

3 points

0 comments6 min readLW link

Visual demonstration of Optimizer’s curse

Roman Malov30 Nov 2024 19:34 UTC

26 points

3 comments7 min readLW link

CAIDP Statement on Lethal Autonomous Weapons Systems

Heramb30 Nov 2024 18:16 UTC

−1 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Launching Applications for the Global AI Safety Fellowship 2025!

Aditya_SK30 Nov 2024 14:02 UTC

11 points

5 comments1 min readLW link

Exporting Facebook Comments, Again

jefftk30 Nov 2024 12:40 UTC

10 points

6 comments1 min readLW link

(www.jefftk.com)

Mathematical Futurology: From Pseudoscience to Rigorous Framework

Wenitte Apiou30 Nov 2024 3:27 UTC

−1 points

1 comment2 min readLW link

(The) Lightcone is nothing without its people: LW + Lighthaven’s big fundraiser

habryka30 Nov 2024 2:55 UTC

612 points

273 comments42 min readLW link

Sexual Selection as a Mesa-Optimizer

Lorec29 Nov 2024 23:34 UTC

3 points

0 comments37 min readLW link

INTELLECT-1 Release: The First Globally Trained 10B Parameter Model

Matrice Jacobine29 Nov 2024 23:05 UTC

16 points

1 comment1 min readLW link

(www.primeintellect.ai)

You should consider applying to PhDs (soon!)

bilalchughtai29 Nov 2024 20:33 UTC

115 points

19 comments6 min readLW link

Understanding Emergence in Large Language Models

egek9229 Nov 2024 19:42 UTC

3 points

1 comment2 min readLW link

I’m a rationalist but....

ninney29 Nov 2024 19:41 UTC

−19 points

0 comments1 min readLW link

The ‘Road Not Taken’ in the Multiverse

Jonah Wilberg29 Nov 2024 19:01 UTC

2 points

0 comments7 min readLW link

(art) Optimism

KvmanThinking29 Nov 2024 16:21 UTC

−7 points

0 comments1 min readLW link

The Big Nonprofits Post

Zvi29 Nov 2024 16:10 UTC

120 points

10 comments45 min readLW link

(thezvi.wordpress.com)