All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 171819 20 21 22 23 24 25 26 27 28 29 30 31

Can Reasoning Models Avoid the Most Forbidden Technique?

Brendan Long17 May 2025 23:26 UTC

9 points

9 comments3 min readLW link

(www.brendanlong.com)

What OpenAI Told California’s Attorney General

garrison17 May 2025 23:14 UTC

108 points

3 comments8 min readLW link

(www.obsolete.pub)

Multipolar AI is Underrated

Allison Duettmann17 May 2025 22:03 UTC

20 points

2 comments16 min readLW link

[Question] Will we survive if AI solves engineering before deception?

Knight Lee17 May 2025 19:22 UTC

21 points

14 comments1 min readLW link

Seven ways to Improve the Internal Model Principle

Alfred Harwood17 May 2025 16:38 UTC

15 points

0 comments13 min readLW link

D&D.Sci: The Choosing Ones

abstractapplic17 May 2025 15:26 UTC

49 points

17 comments1 min readLW link

The absent-minded variations

dr_s17 May 2025 6:57 UTC

24 points

13 comments9 min readLW link

Book Review: The Art of Happiness

Screwtape17 May 2025 4:56 UTC

37 points

23 comments11 min readLW link

Management is the Near Future

jefftk17 May 2025 2:50 UTC

61 points

10 comments2 min readLW link

(www.jefftk.com)

Proof Section to an Introduction to Reinforcement Learning for Understanding Infra-Bayesianism

Brittany Gelb17 May 2025 2:36 UTC

4 points

0 comments9 min readLW link

An Introduction to Reinforcement Learning for Understanding Infra-Bayesianism

Brittany Gelb17 May 2025 2:34 UTC

30 points

9 comments20 min readLW link

Memory Decoding Journal Club: “Synaptic architecture of a memory engram in the mouse hippocampus.”

Devin Ward16 May 2025 23:55 UTC

3 points

0 comments1 min readLW link

Social anxiety isn’t about being liked

Chris Lakin16 May 2025 22:26 UTC

151 points

21 comments2 min readLW link

(chrislakin.blog)

Events: Debate & Fiction Project

abramdemski16 May 2025 21:51 UTC

39 points

1 comment1 min readLW link

How Fast Can Algorithms Advance Capabilities? | Epoch Gradient Update

henryj16 May 2025 21:38 UTC

43 points

10 comments6 min readLW link

(epoch.ai)

P-Values Know When You’re Cheating

Eggs16 May 2025 20:34 UTC

21 points

2 comments2 min readLW link

Minds are magic

k6416 May 2025 19:10 UTC

2 points

1 comment2 min readLW link

US-China trade talks should pave way for AI safety treaty [SCMP crosspost]

otto.barten16 May 2025 16:55 UTC

10 points

0 comments3 min readLW link

Direct Realism is probably false

TerriLeaf16 May 2025 16:36 UTC

−3 points

19 comments3 min readLW link

Regarding South Africa

Zvi16 May 2025 16:10 UTC

71 points

5 comments11 min readLW link

(thezvi.wordpress.com)

Notes on Consciousness

CSDD16 May 2025 14:17 UTC

3 points

3 comments1 min readLW link

reflecting on criticism

Vadim Golub16 May 2025 11:59 UTC

4 points

5 comments10 min readLW link

Generating the Funniest Joke with RL (according to GPT-4.1)

agg16 May 2025 5:09 UTC

106 points

22 comments4 min readLW link

Interpretable Fine Tuning Research Update and Working Prototype

Matthew Khoriaty16 May 2025 3:44 UTC

14 points

0 comments4 min readLW link

It Is Untenable That Near-Future AI Scenario Models Like “AI 2027” Don’t Include Open Source AI

Andrew Dickson16 May 2025 2:20 UTC

37 points

19 comments5 min readLW link

Apply to Visiting Fellows at Constellation, due June 13

Ella Markianos16 May 2025 2:20 UTC

1 point

0 comments2 min readLW link

Paranoid Debating

DresdenHeart16 May 2025 2:20 UTC

1 point

0 comments1 min readLW link

Bay Area Summer Solstice

VivaLaPanda and Andrew Keenan Richardson

16 May 2025 0:20 UTC

20 points

0 comments1 min readLW link

Staying in a Capsule Hotel

jefftk16 May 2025 0:20 UTC

25 points

2 comments1 min readLW link

(www.jefftk.com)

Researching Synthetic Consciousness: sound appealing?

Brad Dunn15 May 2025 22:29 UTC

10 points

1 comment1 min readLW link

Starting Over: What to tell Sarah, at the edge of professional oblivion.

Brad Dunn15 May 2025 21:34 UTC

11 points

1 comment20 min readLW link

Tax-Optimized Risk in Portfolio Allocation

Brendan Long15 May 2025 18:53 UTC

6 points

0 comments1 min readLW link

(www.brendanlong.com)

AI Safety Thursdays: Understanding The Self-Other Overlap Approach

Juliana Eberschlag15 May 2025 18:41 UTC

2 points

0 comments1 min readLW link

Some skepticism about skepticism about efficacy of pausing AI

extinction-bounties15 May 2025 18:15 UTC

5 points

1 comment2 min readLW link

time is event based

thiccythot15 May 2025 18:07 UTC

65 points

1 comment4 min readLW link

Consider Others’ Cost Tolerances

nomagicpill15 May 2025 17:43 UTC

24 points

2 comments4 min readLW link

(nomagicpill.github.io)

Problems with instruction-following as an alignment target

Seth Herd15 May 2025 15:41 UTC

56 points

14 comments10 min readLW link

AI #116: If Anyone Builds It, Everyone Dies

Zvi15 May 2025 15:10 UTC

47 points

5 comments42 min readLW link

(thezvi.wordpress.com)

Counter-considerations on AI arms races

Mateusz Bagiński and JustinShovelain

15 May 2025 14:54 UTC

24 points

0 comments18 min readLW link

AlphaEvolve

mannatvjain15 May 2025 14:14 UTC

29 points

0 comments5 min readLW link

(deepmind.google)

From Comments on Accountability Sinks

Martin Sustrik15 May 2025 10:20 UTC

15 points

2 comments7 min readLW link

(250bpm.substack.com)

What Does It Mean to “Write Like You Talk”?

Arjun Panickssery15 May 2025 9:49 UTC

69 points

8 comments5 min readLW link

(arjunpanickssery.substack.com)

What if Agent-4 breaks out?

Alvin Ånestrand15 May 2025 9:15 UTC

12 points

0 comments6 min readLW link

Memory Decoding Journal Club: Synaptic architecture of a memory engram in the mouse hippocampus

Devin Ward15 May 2025 4:14 UTC

1 point

0 comments1 min readLW link

[Question] Why OpenAI projects only $174B of revenue by 2030?

becausecurious15 May 2025 2:50 UTC

28 points

6 comments1 min readLW link

Elastomeric Fitting Session

jefftk15 May 2025 1:50 UTC

15 points

4 comments2 min readLW link

(www.jefftk.com)

Re SMTM: negative feedback on negative feedback

Steven Byrnes14 May 2025 19:50 UTC

60 points

1 comment22 min readLW link

Curate your space

Logan Kieller14 May 2025 19:35 UTC

23 points

0 comments3 min readLW link

(agenticconjectures.substack.com)

Eliezer and I wrote a book: If Anyone Builds It, Everyone Dies

So8res14 May 2025 19:00 UTC

655 points

144 comments2 min readLW link

Notes on Life

CSDD14 May 2025 18:46 UTC

−1 points

0 comments5 min readLW link