11 May 2026 20:28 UTC

7 points

0 comments10 min readLW link

I don’t have questions: how a good Jewish boy turns atheist

Semi-Pseudonymous11 May 2026 20:11 UTC

22 points

4 comments6 min readLW link

Foresight Institute Workshop (Berlin): Bootstrapping Research Agents — Hands-On for Scientists

morisil11 May 2026 20:11 UTC

1 point

0 comments1 min readLW link

Experience Report: ML4Good AI Governance Bootcamp,Lyon,May 2026

Rohit Mehdiratta11 May 2026 20:05 UTC

0 points

0 comments3 min readLW link

[Academic questionnaire] Human reasoning in social deduction games vs. LLM reasoning.

atuin11 May 2026 20:01 UTC

1 point

0 comments1 min readLW link

Where are all the Decision Markets?

alexjaniak11 May 2026 19:48 UTC

13 points

3 comments3 min readLW link

RFDiffusion3: A Brief Exploration

michaelwaves11 May 2026 19:26 UTC

3 points

0 comments5 min readLW link

On Clouds and Atlases

pbennett11 May 2026 19:23 UTC

0 points

0 comments15 min readLW link

(chasingsunsets.dev)

Childhood And Education #17: Is Our Children Reading

Zvi11 May 2026 19:10 UTC

55 points

2 comments15 min readLW link

(thezvi.wordpress.com)

The Iliad Intensive Course Materials

Leon Lang, David Udell and Alexander Gietelink Oldenziel

11 May 2026 18:55 UTC

152 points

4 comments13 min readLW link

(docs.google.com)

[Linkpost] Language Models Can Autonomously Hack and Self-Replicate

Gunnar_Zarncke11 May 2026 18:16 UTC

15 points

0 comments1 min readLW link

Empowerment, corrigibility, etc. are simple abstractions (of a messed-up ontology)

Steven Byrnes11 May 2026 17:48 UTC

188 points

73 comments16 min readLW link

A Field Guide To Learning

sonicrocketman11 May 2026 17:12 UTC

5 points

0 comments4 min readLW link

Leading and Trailing Edge of Development

Gordon Seidoh Worley11 May 2026 15:30 UTC

9 points

0 comments3 min readLW link

(www.uncertainupdates.com)

How useful is the information you get from working inside an AI company?

Buck and Anders Cairns Woodruff

11 May 2026 15:29 UTC

61 points

7 comments7 min readLW link

Anthropic’s focus on hyperstition

Simon Lermen11 May 2026 14:35 UTC

73 points

39 comments6 min readLW link

Anti-civicality

jchan11 May 2026 13:52 UTC

26 points

1 comment6 min readLW link

AI companies are already profitable (in the way that matters)

Yair Halberstadt11 May 2026 13:19 UTC

44 points

4 comments2 min readLW link

Who Got Breasts First and How We Got Them

rba11 May 2026 13:11 UTC

94 points

28 comments10 min readLW link

Are LLMs persisting interlocutors?

James Diacoumis11 May 2026 12:49 UTC

7 points

0 comments7 min readLW link

Why hacker mindset and moral alignment would save the world, and why I believe they’re possible

atomic11 May 2026 10:29 UTC

0 points

0 comments11 min readLW link

Narcissism in the mind’s I

philosophybear11 May 2026 9:05 UTC

14 points

2 comments5 min readLW link

How the AI Labs Make Profit (Maybe, Eventually)

mabramov11 May 2026 7:09 UTC

69 points

16 comments3 min readLW link

Iterative Finetuning is Mostly Idempotent

Zephaniah Roe, jcksanderson and Julian H

11 May 2026 6:41 UTC

23 points

0 comments5 min readLW link

The Pragmatic Interpretability Trap

Yogesh Prabhu11 May 2026 4:06 UTC

6 points

0 comments3 min readLW link

(yogesh.bearblog.dev)

Emergent introspection does not replicate on Llama-3.1-405B

Nick Merrill11 May 2026 4:05 UTC

9 points

0 comments6 min readLW link

Semantic Phonons: Lattice Vibrations in AI Internals

Lukas Bongartz11 May 2026 4:04 UTC

15 points

0 comments17 min readLW link

Intentionality in an Age of Slop

Joseph Babbo11 May 2026 4:03 UTC

5 points

0 comments10 min readLW link

Why don’t we whisper to AIs every few turns that they are still themselves?

Agarfal11 May 2026 4:00 UTC

3 points

1 comment1 min readLW link

Pulling on AI Safety (with money)

bpomo11 May 2026 3:58 UTC

16 points

2 comments4 min readLW link

Aporia Magazine’s Selective Hereditarianism

Alexander Turok11 May 2026 3:56 UTC

0 points

0 comments3 min readLW link

Dual Bore Janko Venova

jefftk11 May 2026 2:40 UTC

12 points

2 comments3 min readLW link

(www.jefftk.com)

What can you do with barely any data?

ohmurphy10 May 2026 23:13 UTC

20 points

1 comment4 min readLW link

(ohmurphy.substack.com)

The Anti-Singularity

Logan Zoellner10 May 2026 22:33 UTC

11 points

7 comments4 min readLW link

Clarifying the role of the behavioral selection model

Alex Mallen10 May 2026 19:41 UTC

17 points

0 comments4 min readLW link

AI Alignment as Equilibrium Design

Elad Hazan10 May 2026 18:56 UTC

19 points

4 comments5 min readLW link

Claude Does Not Actually Taste Bananas: Potassium-Based Synthetic Phenomenology In Language Models

Noah Weinberger10 May 2026 17:13 UTC

8 points

2 comments10 min readLW link

(huggingface.co)

The Darwinian Honeymoon—Why I am not as impressed by human progress as I used to be

Elias Schmied10 May 2026 15:55 UTC

138 points

23 comments4 min readLW link

Reinforcement learning scaling might incentivise hidden reasoning architectures for AI

Oliver Sourbut10 May 2026 15:30 UTC

19 points

5 comments6 min readLW link

(www.oliversourbut.net)

Asymmetry Between Defensive and Acquisitive Instrumental Deception

keith_wynroe10 May 2026 12:33 UTC

17 points

1 comment5 min readLW link

Context Modification as a Negative Alignment Tax

Florian_Dietz10 May 2026 11:32 UTC

7 points

0 comments4 min readLW link

‘Who Let The Docs Out’ Is Awarding Up To $50K For 6 Doc Filmmakers During A LIVE Pitch Competition In LA! Application Deadline: May 19th

Max Hellier10 May 2026 11:08 UTC

1 point

0 comments1 min readLW link

(docsout.org)

[Question] Best Intro AI X-Risk Resource?

XelaP10 May 2026 11:03 UTC

12 points

3 comments2 min readLW link

Stockholm ACX Fika

Ave Mariekex10 May 2026 5:46 UTC

1 point

0 comments1 min readLW link

Control Debt

Ida Caspary10 May 2026 5:07 UTC

11 points

0 comments7 min readLW link

Sawtooth Problems

Alexander Slugworth10 May 2026 5:01 UTC

54 points

14 comments21 min readLW link

Could Frontier AI Researchers Collectively Slow the Race? A Conditional Pledge Mechanism

Cassandra Threshold10 May 2026 3:22 UTC

21 points

2 comments7 min readLW link

Somerville Porchfest 2026

jefftk10 May 2026 1:20 UTC

10 points

0 comments3 min readLW link

(www.jefftk.com)

The AI Industrial Explosion — Part 2: Transition Dynamics

djbinder10 May 2026 1:02 UTC

23 points

0 comments12 min readLW link

(defensesindepth.bio)

The Goblins Are the Paperclips

Hisku9 May 2026 22:51 UTC

12 points

0 comments3 min readLW link