All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

All Jan Feb Mar AprMayJun

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 192021 22 23 24 25 26 27 28 29 30 31

Conclave 1492

Vaniver19 May 2026 23:44 UTC

72 points

7 comments1 min readLW link

Childhood And Education #19: Letting Kids Be Kids #2

Zvi19 May 2026 22:20 UTC

21 points

1 comment12 min readLW link

(thezvi.wordpress.com)

Implications Of Predicting The Next Token

jdp19 May 2026 22:17 UTC

108 points

6 comments31 min readLW link

(minihf.com)

Which goals actually motivate deceptive alignment?

Cleo Nardo and Alex Mallen

19 May 2026 21:53 UTC

25 points

0 comments10 min readLW link

Housing Roundup #15: The War Against Renters

Zvi19 May 2026 21:40 UTC

19 points

1 comment14 min readLW link

(thezvi.wordpress.com)

Leaving DCA to the North on Foot

jefftk19 May 2026 20:30 UTC

19 points

0 comments1 min readLW link

(www.jefftk.com)

A Visual Guide to Natural Latents

Alfred Harwood19 May 2026 19:10 UTC

55 points

0 comments18 min readLW link

Humans are not automatically strategic — “inner work” edition

Chris Lakin19 May 2026 18:37 UTC

36 points

0 comments1 min readLW link

[Webinar]: How close is AI to taking my job? (And what the benchmarks aren’t telling us)

Schizoid Rentoid19 May 2026 17:43 UTC

2 points

0 comments1 min readLW link

We Need to Get Serious about Uplift Studies

frmsaul and Eye You

19 May 2026 17:21 UTC

23 points

0 comments5 min readLW link

Brain Structure and IQ: How Myelin Elevates Intelligence

Shiva's Right Foot19 May 2026 14:13 UTC

57 points

7 comments12 min readLW link

Sealing Conditional Misalignment in Inoculation Prompting with Consistency Training

David Africa, Sukrati_Gautam and Neil Shah

19 May 2026 13:55 UTC

44 points

7 comments6 min readLW link

Let’s have more partial insiders.

Cleo Nardo19 May 2026 7:24 UTC

15 points

0 comments2 min readLW link

Roadmap through AI safety programs for early-career technical researchers

Mikhail Mironov19 May 2026 3:45 UTC

18 points

5 comments5 min readLW link

When Fluency Is Free

mcawesome19 May 2026 3:05 UTC

7 points

2 comments1 min readLW link

The anthropic argument against the existence of God.

usrnmtaken19 May 2026 3:05 UTC

−10 points

1 comment6 min readLW link

Should Rationalists Looksmaxx?

albertcai19 May 2026 3:03 UTC

9 points

2 comments6 min readLW link

(albertjcai.substack.com)

AI emotions and aligned behavior

lisunshiny19 May 2026 3:02 UTC

9 points

0 comments5 min readLW link

(liannsun.com)

Tracking Difficulty with Feature Portfolios

kaivu, leni, zef and rohuang

19 May 2026 2:25 UTC

22 points

0 comments5 min readLW link

Outsiders should focus on specs/constitutions (among other things)

Cleo Nardo19 May 2026 1:04 UTC

4 points

5 comments2 min readLW link

Logical Share Splitting for Intuitionists

DaemonicSigil19 May 2026 0:42 UTC

19 points

9 comments5 min readLW link

(notoneunusualthing.substack.com)

Coordinal: A Postmortem.

Ronak_Mehta18 May 2026 20:43 UTC

37 points

3 comments4 min readLW link

(ronakrm.github.io)

Noticing Confusion: A practice in staying curious

vmehra18 May 2026 19:31 UTC

10 points

1 comment6 min readLW link

Dating Roundup #12: Sex and Violence

Zvi18 May 2026 19:20 UTC

28 points

1 comment27 min readLW link

(thezvi.wordpress.com)

Negation Neglect: When models fail to learn negations in training

harrymayne, Lev McKinney and Owain_Evans

18 May 2026 18:37 UTC

119 points

37 comments8 min readLW link

So are you some kind of communist?

jchan18 May 2026 15:53 UTC

5 points

1 comment3 min readLW link

Thoughts on interviewing candidates for AI safety fellowships

beyarkay (Boyd Kane)18 May 2026 15:28 UTC

35 points

4 comments7 min readLW link

(boydkane.com)

PauseAI Munich Local Group Kickoff

mofeien18 May 2026 15:13 UTC

3 points

0 comments1 min readLW link

Classifier Context Rot: Monitor Performance Degrades with Context Length

Fabien Roger and Sam Martin

18 May 2026 14:05 UTC

54 points

1 comment4 min readLW link

How useful is cross-domain generalization for training LLM monitors?

Fabien Roger and Sam Martin

18 May 2026 13:52 UTC

21 points

0 comments4 min readLW link

Jhana Quick Start Guide

Zmavli Caimle18 May 2026 8:51 UTC

15 points

3 comments11 min readLW link

Links #1: 2026/05 Part 1

papetoast18 May 2026 5:04 UTC

10 points

0 comments18 min readLW link

why pollen allergies?

bhauth18 May 2026 4:44 UTC

33 points

6 comments6 min readLW link

(www.bhauth.com)

Why Physical Attractiveness Matters for Men’s Dating Prospects

johnswentworth18 May 2026 2:22 UTC

9 points

13 comments3 min readLW link

Bay Summer Solstice 2026

Raemon18 May 2026 0:34 UTC

16 points

4 comments1 min readLW link

How to Quit Fandom: Apostasy

Laiba Rehman ✦ RJ17 May 2026 21:09 UTC

58 points

3 comments4 min readLW link

Engineering a Safer World: Risk Modelling — and Safety Engineering? — for AI Loss of Control

Oliver Sourbut17 May 2026 16:02 UTC

10 points

1 comment9 min readLW link

(www.oliversourbut.net)

Next Token Prediction is a Misleading Term

Adam Newgas17 May 2026 11:58 UTC

12 points

2 comments6 min readLW link

(www.boristhebrave.com)

Can ELK be brute-forced? Intertheoretic reduction

Q Home17 May 2026 10:21 UTC

13 points

0 comments3 min readLW link

James C. Scott: Seeing Like a State

Martin Sustrik17 May 2026 8:40 UTC

56 points

6 comments7 min readLW link

(www.250bpm.com)

How to Reason about Your Health Issues

Taylor G. Lunt17 May 2026 5:10 UTC

23 points

28 comments5 min readLW link

Are You Not Rationalists?

J Thomas Moros17 May 2026 3:27 UTC

1 point

0 comments7 min readLW link

Falling for the statistical parrot

FlorianH17 May 2026 1:02 UTC

5 points

0 comments2 min readLW link

On getting unstuck

Joe Rogero17 May 2026 0:59 UTC

21 points

1 comment4 min readLW link

(subatomicarticles.com)

A relatively brief explanation of Boltzmann Brains

Eliezer Yudkowsky16 May 2026 21:19 UTC

206 points

155 comments4 min readLW link

Benchmarking Real Work

kaivu, leni, rohuang and zef

16 May 2026 20:43 UTC

30 points

2 comments4 min readLW link

Critique Systems, Not Reality

Morphism16 May 2026 19:11 UTC

5 points

1 comment25 min readLW link

(thothhermes.substack.com)

Trying to use NLAs to find out how Qwen 2.5 7B does multiplication

Hannes Thurnherr16 May 2026 19:05 UTC

23 points

4 comments6 min readLW link

A Year Late, Claude Finally Beats Pokémon

Julian Bradshaw16 May 2026 7:05 UTC

162 points

12 comments9 min readLW link

NLA Verbalizations on AuditBench: Llama 70B

Realmbird16 May 2026 5:25 UTC

10 points

0 comments3 min readLW link