All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

All Jan Feb Mar AprMayJun

All 1 2 3 4 5 6 7 8 9 101112 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

What can you do with barely any data?

ohmurphy10 May 2026 23:13 UTC

20 points

1 comment4 min readLW link

(ohmurphy.substack.com)

The Anti-Singularity

Logan Zoellner10 May 2026 22:33 UTC

11 points

7 comments4 min readLW link

Clarifying the role of the behavioral selection model

Alex Mallen10 May 2026 19:41 UTC

17 points

0 comments4 min readLW link

AI Alignment as Equilibrium Design

Elad Hazan10 May 2026 18:56 UTC

19 points

4 comments5 min readLW link

Claude Does Not Actually Taste Bananas: Potassium-Based Synthetic Phenomenology In Language Models

Noah Weinberger10 May 2026 17:13 UTC

8 points

2 comments10 min readLW link

(huggingface.co)

The Darwinian Honeymoon—Why I am not as impressed by human progress as I used to be

Elias Schmied10 May 2026 15:55 UTC

138 points

23 comments4 min readLW link

Reinforcement learning scaling might incentivise hidden reasoning architectures for AI

Oliver Sourbut10 May 2026 15:30 UTC

19 points

5 comments6 min readLW link

(www.oliversourbut.net)

Asymmetry Between Defensive and Acquisitive Instrumental Deception

keith_wynroe10 May 2026 12:33 UTC

17 points

1 comment5 min readLW link

Context Modification as a Negative Alignment Tax

Florian_Dietz10 May 2026 11:32 UTC

7 points

0 comments4 min readLW link

‘Who Let The Docs Out’ Is Awarding Up To $50K For 6 Doc Filmmakers During A LIVE Pitch Competition In LA! Application Deadline: May 19th

Max Hellier10 May 2026 11:08 UTC

1 point

0 comments1 min readLW link

(docsout.org)

[Question] Best Intro AI X-Risk Resource?

XelaP10 May 2026 11:03 UTC

12 points

3 comments2 min readLW link

Stockholm ACX Fika

Ave Mariekex10 May 2026 5:46 UTC

1 point

0 comments1 min readLW link

Control Debt

Ida Caspary10 May 2026 5:07 UTC

11 points

0 comments7 min readLW link

Sawtooth Problems

Alexander Slugworth10 May 2026 5:01 UTC

54 points

14 comments21 min readLW link

Could Frontier AI Researchers Collectively Slow the Race? A Conditional Pledge Mechanism

Cassandra Threshold10 May 2026 3:22 UTC

21 points

2 comments7 min readLW link

Somerville Porchfest 2026

jefftk10 May 2026 1:20 UTC

10 points

0 comments3 min readLW link

(www.jefftk.com)

The AI Industrial Explosion — Part 2: Transition Dynamics

djbinder10 May 2026 1:02 UTC

23 points

0 comments12 min readLW link

(defensesindepth.bio)

The Goblins Are the Paperclips

Hisku9 May 2026 22:51 UTC

12 points

0 comments3 min readLW link

International Law Cannot Prevent Extinction Either

Sausage Vector Machine9 May 2026 22:34 UTC

102 points

16 comments5 min readLW link

Avoid alienating the marginal audience member

winfield9 May 2026 22:20 UTC

5 points

0 comments3 min readLW link

Do capabilities generalize across propensities?

Emil Ryd9 May 2026 21:39 UTC

25 points

0 comments8 min readLW link

Neural Networks learn Bloom Filters

Alex Gibson9 May 2026 20:32 UTC

57 points

1 comment12 min readLW link

Explaining Volition Without Resorting to Free Will

joseph_c9 May 2026 18:57 UTC

20 points

24 comments1 min readLW link

Second order thoughts on current AI agents

Michael Flood9 May 2026 18:40 UTC

14 points

0 comments2 min readLW link

If digital computers are conscious, they are conscious at the hardware level

cube_flipper9 May 2026 15:08 UTC

38 points

42 comments19 min readLW link

(smoothbrains.net)

Why You Can’t Use Your Right to Try

Stephen Martin9 May 2026 6:47 UTC

43 points

2 comments5 min readLW link

(x.com)

Does Opus 4.7 Generate Deceptive Denials About Its Own Guardrails?

usize9 May 2026 4:12 UTC

10 points

0 comments3 min readLW link

(usize.github.io)

Bad Problems Don’t Stop Being Bad Because Somebody’s Wrong About Fault Analysis

Linch9 May 2026 1:30 UTC

264 points

74 comments3 min readLW link

We Should Have Mandatory Media/Communications Training For All Communicators

Darren McKee8 May 2026 20:29 UTC

2 points

6 comments3 min readLW link

Chess as a prediction model of the artificial intelligence impact on culture

8498 May 2026 20:19 UTC

−12 points

1 comment5 min readLW link

(lojkine.art)

The Saturation View: some responses

wdmacaskill8 May 2026 17:32 UTC

25 points

6 comments8 min readLW link

Is ProgramBench Impossible?

frmsaul8 May 2026 17:04 UTC

83 points

11 comments2 min readLW link

Claude Code, Codex and Agentic Coding #8

Zvi8 May 2026 16:40 UTC

45 points

1 comment11 min readLW link

(thezvi.wordpress.com)

AI is Breaking Two Vulnerability Cultures

jefftk8 May 2026 15:50 UTC

78 points

0 comments2 min readLW link

(www.jefftk.com)

Please Be Serious

Oliver Kuperman8 May 2026 14:36 UTC

−11 points

15 comments2 min readLW link

Write Cause You Have Something to Say

Logan Riggs8 May 2026 13:36 UTC

37 points

5 comments2 min readLW link

Userland Alignment

Josh H8 May 2026 13:31 UTC

4 points

0 comments2 min readLW link

A benchmark is a sensor

Håvard Tveit Ihle and Mathias Bynke

8 May 2026 13:24 UTC

36 points

4 comments3 min readLW link

Bringing More Expertise to Bear on Alignment

Edmund Lau, Geoffrey Irving, Cameron Holmes and David Africa

8 May 2026 10:29 UTC

87 points

1 comment8 min readLW link

The Jailbroken Boy of Rushmore

jdcampolargo8 May 2026 6:29 UTC

24 points

0 comments10 min readLW link

Investigating the consequences of accidentally grading CoT during RL

papetoast8 May 2026 6:17 UTC

24 points

0 comments1 min readLW link

(alignment.openai.com)

Uncertain Updates: May 2026

Gordon Seidoh Worley8 May 2026 1:20 UTC

14 points

2 comments1 min readLW link

(www.uncertainupdates.com)

The Frictionless Double

zw57 May 2026 23:11 UTC

10 points

4 comments8 min readLW link

The AI industry is where banking was in 2006. (We’re hiring)

felixgaston7 May 2026 21:52 UTC

53 points

1 comment2 min readLW link

(forum.effectivealtruism.org)

Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations

Subhash Kantamneni, kitft, Euan Ong and Sam Marks

7 May 2026 20:21 UTC

213 points

35 comments8 min readLW link

Axes of Planning in LLMs + Partial Lit Review

NickyP7 May 2026 19:53 UTC

12 points

0 comments9 min readLW link

(blog.sus.cat)

A review of “Investigating the consequences of accidentally grading CoT during RL”

Buck7 May 2026 18:06 UTC

76 points

1 comment8 min readLW link

Try, even if they have you cold

WalterL7 May 2026 17:19 UTC

102 points

14 comments2 min readLW link

Mechanistic estimation for wide random MLPs

Jacob_Hilton7 May 2026 16:20 UTC

85 points

5 comments5 min readLW link

(www.alignment.org)

Over Eight Months of Progress in Two: Analyzing the Mythos Preview Capability Jump

Alvin Ånestrand7 May 2026 16:19 UTC

10 points

8 comments17 min readLW link

(forecastingaifutures.substack.com)