All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Semiconductor Fabs I: The Equipment

nomagicpill4 Jun 2025 22:09 UTC

19 points

0 comments19 min readLW link

(nomagicpill.github.io)

The Stereotype of the Stereotype

Ike4 Jun 2025 21:06 UTC

58 points

17 comments9 min readLW link

2. Why intuitive comparisons of large-scale impact are unjustified

Anthony DiGiovanni4 Jun 2025 20:30 UTC

25 points

0 comments16 min readLW link

Dating Roundup #6

Zvi4 Jun 2025 20:00 UTC

36 points

2 comments55 min readLW link

(thezvi.wordpress.com)

Rational Prime Calendar

RickHull4 Jun 2025 19:30 UTC

−1 points

0 comments3 min readLW link

A Technique of Pure Reason

Adam Newgas4 Jun 2025 19:07 UTC

11 points

3 comments2 min readLW link

“Flaky breakthroughs” pervade inner work — but almost no one tracks them

Chris Lakin4 Jun 2025 19:02 UTC

216 points

45 comments2 min readLW link

(chrislakin.blog)

[Question] LessOnline saved my life. Now how do I let go of this house?

RedMan4 Jun 2025 18:47 UTC

24 points

7 comments1 min readLW link

Linkpost: Predicting Empirical AI Research Outcomes with Language Models

quetzal_rainbow4 Jun 2025 18:14 UTC

10 points

1 comment1 min readLW link

(arxiv.org)

Self-Coordinated Deception in Current AI Models

Avi Brach-Neufeld4 Jun 2025 17:59 UTC

8 points

5 comments4 min readLW link

To MAIM or Not to MAIM. Introducing MARS: The Nuclear Deterrent case for Hardened Datacenters

kinsman4 Jun 2025 17:56 UTC

1 point

0 comments7 min readLW link

The Belocrat: a servant leader

belos4 Jun 2025 17:25 UTC

1 point

0 comments10 min readLW link

(bestofagreatlot.substack.com)

A list of books which are adjacent to EA

marco moldo4 Jun 2025 12:31 UTC

−1 points

0 comments3 min readLW link

Philosophical Jailbreaks: Demo of LLM Nihilism

Artem Karpov4 Jun 2025 12:03 UTC

3 points

0 comments5 min readLW link

Notes from a mini-replication of the alignment faking paper

Ben_Snodin4 Jun 2025 11:01 UTC

13 points

5 comments9 min readLW link

(www.bensnodin.com)

ARENA 6.0 - Call for Applicants

JamesH, JScriven, David Quarel, CallumMcDougall and James Fox

4 Jun 2025 10:19 UTC

26 points

3 comments6 min readLW link

Quickly Assessing Reward Hacking-like Behavior in LLMs and its Sensitivity to Prompt Variations

AndresCampero4 Jun 2025 7:22 UTC

26 points

1 comment17 min readLW link

Draft: A concise theory of agentic consciousness

Martin Vlach4 Jun 2025 5:00 UTC

2 points

4 comments1 min readLW link

Individual AI representatives don’t solve Gradual Disempowerement

Jan_Kulveit4 Jun 2025 1:26 UTC

62 points

4 comments3 min readLW link

Lectures on AI for high school students (and others)

Radford Neal3 Jun 2025 23:54 UTC

6 points

0 comments1 min readLW link

(radfordneal.wordpress.com)

Does the Taiwan invasion prevent mankind from obtaining the aligned ASI?

StanislavKrym3 Jun 2025 23:35 UTC

−14 points

1 comment5 min readLW link

Self-inquiry

Vadim Golub3 Jun 2025 22:15 UTC

−3 points

0 comments5 min readLW link

Question to LW devs: does LessWrong tries to be facebooky?

Roman Malov3 Jun 2025 22:08 UTC

5 points

1 comment1 min readLW link

Your Strategy Roadmap: Expert Tips + Live Training

Deena Englander3 Jun 2025 21:10 UTC

−4 points

0 comments4 min readLW link

Steering Vectors Can Help LLM Judges Detect Subtle Dishonesty

Leon Eshuijs, mcbeth, Etha and Archie Chaudhury

3 Jun 2025 20:33 UTC

12 points

1 comment5 min readLW link

Schelling Coordination via Agentic Loops

Callum-Luis Kindred3 Jun 2025 20:13 UTC

10 points

1 comment9 min readLW link

Visual Prompt Injections: Results on testing AI spam-defense and AI vulnerability to deceptive web ads.

Seon Gunness3 Jun 2025 20:10 UTC

4 points

0 comments12 min readLW link

Broad-Spectrum Cancer Treatments

sarahconstantin3 Jun 2025 19:40 UTC

150 points

10 comments7 min readLW link

(sarahconstantin.substack.com)

How to work through the ARENA program on your own

Leon Lang3 Jun 2025 17:38 UTC

38 points

5 comments6 min readLW link

How the veil of ignorance grounds sentientism

HoVY3 Jun 2025 17:29 UTC

−3 points

23 comments6 min readLW link

(forum.effectivealtruism.org)

In Which I Make the Mistake of Fully Covering an Episode of the All-In Podcast

Zvi3 Jun 2025 15:50 UTC

42 points

2 comments28 min readLW link

(thezvi.wordpress.com)

Transformer Modular Addition Through A Signal Processing Lens

Benjamin Kelley3 Jun 2025 15:32 UTC

1 point

0 comments1 min readLW link

AXRP Episode 41 - Lee Sharkey on Attribution-based Parameter Decomposition

DanielFilan3 Jun 2025 3:40 UTC

28 points

1 comment61 min readLW link

Notes on dynamism, power, & virtue

Lizka3 Jun 2025 1:40 UTC

19 points

0 comments12 min readLW link

Trends – Artificial Intelligence

Archimedes3 Jun 2025 0:48 UTC

1 point

1 comment1 min readLW link

(www.bondcap.com)

LLMs might have subjective experiences, but no concepts for them

No77e2 Jun 2025 21:18 UTC

17 points

5 comments2 min readLW link

In defense of memes (and thought-terminating clichés)

Harjas2 Jun 2025 20:18 UTC

11 points

4 comments10 min readLW link

Hedonic adaptation: you should not seeks pleasure

Crazy philosopher2 Jun 2025 19:23 UTC

0 points

6 comments2 min readLW link

Unfaithful Reasoning Can Fool Chain-of-Thought Monitoring

Benjamin Arnav, Pablo Bernabeu-Pérez, Tim Kostolansky, HanneWhitt, Nathan Helm-Burger and Mary Phuong

2 Jun 2025 19:08 UTC

78 points

17 comments3 min readLW link

Frank Herbert’s great insight into human agency—Muad’Dib the tool?

Nerret2 Jun 2025 18:52 UTC

2 points

1 comment1 min readLW link

Hemingway Case

Martin Sustrik2 Jun 2025 18:50 UTC

19 points

2 comments1 min readLW link

(www.250bpm.com)

[Question] What AI apps are surprisingly absent given current capabilities?

azergante2 Jun 2025 18:46 UTC

4 points

8 comments1 min readLW link

[Beneath Psychology] Chronic pain challenge part 2: the solution

jimmy2 Jun 2025 17:30 UTC

39 points

3 comments34 min readLW link

The Value Proposition of Romantic Relationships

johnswentworth2 Jun 2025 13:51 UTC

208 points

43 comments13 min readLW link

1. The challenge of unawareness for impartial altruist action guidance: Introduction

Anthony DiGiovanni2 Jun 2025 8:54 UTC

48 points

6 comments13 min readLW link

‘Wicked’: thoughts

KatjaGrace2 Jun 2025 6:20 UTC

25 points

3 comments3 min readLW link

(worldspiritsockpuppet.com)

Humanity needs a Ulysses Pact for AI

Lukas N.P. Egger1 Jun 2025 20:56 UTC

1 point

2 comments1 min readLW link

Text Steers Vision

Woody Gan1 Jun 2025 20:30 UTC

5 points

0 comments7 min readLW link

[Question] Possible AI regulation emergency?

CronoDAS1 Jun 2025 20:30 UTC

19 points

1 comment1 min readLW link

Eliezer Yudkowsky & Connor Leahy | AI Risk, Safety & Alignment Q&A [4K Remaster + HQ Audio]

Dex Volkov1 Jun 2025 20:20 UTC

−8 points

2 comments1 min readLW link

(www.youtube.com)