All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 20242025

All Jan FebMarApr May Jun

All 1 2 3 4 5 6 7 8 9 10 11 121314 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

LW/ACX Social Meetup

StefanMar 12, 2025, 11:13 PM

2 points

0 comments1 min readLW link

I grade every NBA basketball game I watch based on enjoyability

proshowersingerMar 12, 2025, 9:46 PM

24 points

2 comments4 min readLW link

Kairos is hiring a Head of Operations/Founding Generalist

agucovaMar 12, 2025, 8:58 PM

6 points

0 comments LW link

USAID Outlook: A Metaculus Forecasting Series

ChristianWilliamsMar 12, 2025, 8:34 PM

9 points

0 comments LW link

(www.metaculus.com)

What is instrumental convergence?

Vishakha and Algon

Mar 12, 2025, 8:28 PM

2 points

0 comments2 min readLW link

(aisafety.info)

Revising Stages-Oversight Reveals Greater Situational Awareness in LLMs

Sanyu RajakumarMar 12, 2025, 5:56 PM

16 points

0 comments13 min readLW link

Why Obedient AI May Be the Real Catastrophe

G~Mar 12, 2025, 5:50 PM

5 points

2 comments3 min readLW link

Your Communication Preferences Aren’t Law

Jonathan MoregårdMar 12, 2025, 5:20 PM

25 points

4 comments1 min readLW link

(honestliving.substack.com)

Reflections on Neuralese

Alice BlairMar 12, 2025, 4:29 PM

28 points

0 comments5 min readLW link

Field tests of semi-rationality in Brazilian military training

P. JoãoMar 12, 2025, 4:14 PM

31 points

0 comments2 min readLW link

Many life-saving drugs fail for lack of funding. But there’s a solution: desperate rich people

MvolzMar 12, 2025, 3:24 PM

17 points

0 comments1 min readLW link

(www.theguardian.com)

The Most Forbidden Technique

ZviMar 12, 2025, 1:20 PM

143 points

9 comments17 min readLW link

(thezvi.wordpress.com)

You don’t actually need a physical multiverse to explain anthropic fine-tuning.

FraserMar 12, 2025, 7:33 AM

7 points

8 comments3 min readLW link

(frvser.com)

AI Can’t Write Good Fiction

JustisMillsMar 12, 2025, 6:11 AM

38 points

24 comments7 min readLW link

(justismills.substack.com)

Existing UDTs test the limits of Bayesianism (and consistency)

Cole WyethMar 12, 2025, 4:09 AM

28 points

21 comments7 min readLW link

(Anti)Aging 101

George3d6Mar 12, 2025, 3:59 AM

5 points

2 comments3 min readLW link

(cerebralab.com)

The Grapes of Hardness

adamShimiMar 11, 2025, 9:01 PM

8 points

0 comments5 min readLW link

(formethods.substack.com)

Don’t over-update on FrontierMath results

David MatolcsiMar 11, 2025, 8:44 PM

51 points

7 comments9 min readLW link

Response to Scott Alexander on Imprisonment

ZviMar 11, 2025, 8:40 PM

40 points

4 comments9 min readLW link

(thezvi.wordpress.com)

Paths and waystations in AI safety

Joe CarlsmithMar 11, 2025, 6:52 PM

41 points

1 comment11 min readLW link

(joecarlsmith.substack.com)

Meridian Cambridge Visiting Researcher Programme: Turn AI safety ideas into funded projects in one week!

Meridian CambridgeMar 11, 2025, 5:46 PM

13 points

0 comments2 min readLW link

Elon Musk May Be Transitioning to Bipolar Type I

Cyborg25Mar 11, 2025, 5:45 PM

83 points

22 comments4 min readLW link

Scaling AI Regulation: Realistically, what Can (and Can’t) Be Regulated?

Katalina HernandezMar 11, 2025, 4:51 PM

3 points

1 comment3 min readLW link

How Language Models Understand Nullability

Anish Tondwalkar and Alex Sanchez-Stern

Mar 11, 2025, 3:57 PM

5 points

0 comments2 min readLW link

(dmodel.ai)

Forethought: a new AI macrostrategy group

Max Dalton, Tom Davidson, wdmacaskill and AmritSidhu-Brar

Mar 11, 2025, 3:39 PM

18 points

0 comments3 min readLW link

Preparing for the Intelligence Explosion

fin and wdmacaskill

Mar 11, 2025, 3:38 PM

78 points

17 comments1 min readLW link

(www.forethought.org)

stop solving problems that have already been solved

dhruvmethiMar 11, 2025, 3:30 PM

10 points

3 comments8 min readLW link

AI Control May Increase Existential Risk

Jan_KulveitMar 11, 2025, 2:30 PM

98 points

13 comments1 min readLW link

When is it Better to Train on the Alignment Proxy?

dil-leik-ogMar 11, 2025, 1:35 PM

14 points

0 comments9 min readLW link

A different take on the Musk v OpenAI preliminary injunction order

TFDMar 11, 2025, 12:46 PM

8 points

0 comments20 min readLW link

(www.thefloatingdroid.com)

Do reasoning models use their scratchpad like we do? Evidence from distilling paraphrases

Fabien RogerMar 11, 2025, 11:52 AM

121 points

23 comments11 min readLW link

(alignment.anthropic.com)

A Hogwarts Guide to Citizenship

WillPetilloMar 11, 2025, 5:50 AM

7 points

1 comment3 min readLW link

Cognitive Reframing—How to Overcome Negative Thought Patterns and Behaviors

Declan MolonyMar 11, 2025, 4:56 AM

11 points

0 comments4 min readLW link

Trojan Sky

Richard_NgoMar 11, 2025, 3:14 AM

245 points

39 comments12 min readLW link

(www.narrativeark.xyz)

OpenAI: Detecting misbehavior in frontier reasoning models

Daniel KokotajloMar 11, 2025, 2:17 AM

183 points

26 comments4 min readLW link

(openai.com)

HPMOR Anniversary Parties: Coordination, Resources, and Discussion

ScrewtapeMar 11, 2025, 1:30 AM

52 points

6 comments7 min readLW link

Positional kernels of attention heads

Alex GibsonMar 10, 2025, 11:17 PM

9 points

0 comments12 min readLW link

Progress links and short notes, 2025-03-10

jasoncrawfordMar 10, 2025, 8:27 PM

8 points

0 comments4 min readLW link

(newsletter.rootsofprogress.org)

The Manus Marketing Madness

ZviMar 10, 2025, 8:10 PM

54 points

0 comments24 min readLW link

(thezvi.wordpress.com)

You can just play

aswath krishnanMar 10, 2025, 8:00 PM

−5 points

0 comments2 min readLW link

How to Use Prompt Engineering to Rewire Your Brain

aswath krishnanMar 10, 2025, 8:00 PM

1 point

0 comments5 min readLW link

(www.aswathkrishnan.com)

When Independent Optimization Is Worse Than Randomness

Chaotic rationalistMar 10, 2025, 7:46 PM

−4 points

0 comments2 min readLW link

Stress exists only where the Mind makes it

NoahhMar 10, 2025, 7:44 PM

5 points

2 comments4 min readLW link

Counterargument to Godel’s Modal Ontological Argument

WynnMar 10, 2025, 7:38 PM

−1 points

0 comments4 min readLW link

[Question] How much do frontier LLMs code and browse while in training?

Joe RogeroMar 10, 2025, 7:34 PM

7 points

0 comments1 min readLW link

Observations on self-supervised Learning for vision

Dinkar JuyalMar 10, 2025, 7:31 PM

3 points

0 comments5 min readLW link

Introducing 11 New AI Safety Organizations—Catalyze’s Winter 24/25 London Incubation Program Cohort

Alexandra BosMar 10, 2025, 7:26 PM

70 points

0 comments LW link

The Jackpot Jinx (or why “Superintelligence Strategy” is wrong)

E.G. Blee-GoldmanMar 10, 2025, 7:18 PM

13 points

0 comments5 min readLW link

Effective AI Outreach | A Data Driven Approach

NoahCWilsonMar 10, 2025, 7:18 PM

1 point

0 comments15 min readLW link

Emergent AI Society. Tasks, Scarcity, Talks

Andrey SeryakovMar 10, 2025, 7:18 PM

1 point

0 comments5 min readLW link