All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 345 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Lectures on AI for high school students (and others)

Radford Neal3 Jun 2025 23:54 UTC

6 points

0 comments1 min readLW link

(radfordneal.wordpress.com)

Does the Taiwan invasion prevent mankind from obtaining the aligned ASI?

StanislavKrym3 Jun 2025 23:35 UTC

−14 points

1 comment5 min readLW link

Self-inquiry

Vadim Golub3 Jun 2025 22:15 UTC

−3 points

0 comments5 min readLW link

Question to LW devs: does LessWrong tries to be facebooky?

Roman Malov3 Jun 2025 22:08 UTC

5 points

1 comment1 min readLW link

Your Strategy Roadmap: Expert Tips + Live Training

Deena Englander3 Jun 2025 21:10 UTC

−4 points

0 comments4 min readLW link

Steering Vectors Can Help LLM Judges Detect Subtle Dishonesty

Leon Eshuijs, mcbeth, Etha and Archie Chaudhury

3 Jun 2025 20:33 UTC

12 points

1 comment5 min readLW link

Schelling Coordination via Agentic Loops

Callum-Luis Kindred3 Jun 2025 20:13 UTC

10 points

1 comment9 min readLW link

Visual Prompt Injections: Results on testing AI spam-defense and AI vulnerability to deceptive web ads.

Seon Gunness3 Jun 2025 20:10 UTC

4 points

0 comments12 min readLW link

Broad-Spectrum Cancer Treatments

sarahconstantin3 Jun 2025 19:40 UTC

150 points

10 comments7 min readLW link

(sarahconstantin.substack.com)

How to work through the ARENA program on your own

Leon Lang3 Jun 2025 17:38 UTC

38 points

5 comments6 min readLW link

How the veil of ignorance grounds sentientism

HoVY3 Jun 2025 17:29 UTC

−3 points

23 comments6 min readLW link

(forum.effectivealtruism.org)

In Which I Make the Mistake of Fully Covering an Episode of the All-In Podcast

Zvi3 Jun 2025 15:50 UTC

42 points

2 comments28 min readLW link

(thezvi.wordpress.com)

Transformer Modular Addition Through A Signal Processing Lens

Benjamin Kelley3 Jun 2025 15:32 UTC

1 point

0 comments1 min readLW link

AXRP Episode 41 - Lee Sharkey on Attribution-based Parameter Decomposition

DanielFilan3 Jun 2025 3:40 UTC

28 points

1 comment61 min readLW link

Notes on dynamism, power, & virtue

Lizka3 Jun 2025 1:40 UTC

19 points

0 comments12 min readLW link

Trends – Artificial Intelligence

Archimedes3 Jun 2025 0:48 UTC

1 point

1 comment1 min readLW link

(www.bondcap.com)

LLMs might have subjective experiences, but no concepts for them

No77e2 Jun 2025 21:18 UTC

17 points

5 comments2 min readLW link

In defense of memes (and thought-terminating clichés)

Harjas2 Jun 2025 20:18 UTC

11 points

4 comments10 min readLW link

Hedonic adaptation: you should not seeks pleasure

Crazy philosopher2 Jun 2025 19:23 UTC

0 points

6 comments2 min readLW link

Unfaithful Reasoning Can Fool Chain-of-Thought Monitoring

Benjamin Arnav, Pablo Bernabeu-Pérez, Tim Kostolansky, HanneWhitt, Nathan Helm-Burger and Mary Phuong

2 Jun 2025 19:08 UTC

78 points

17 comments3 min readLW link

Frank Herbert’s great insight into human agency—Muad’Dib the tool?

Nerret2 Jun 2025 18:52 UTC

2 points

1 comment1 min readLW link

Hemingway Case

Martin Sustrik2 Jun 2025 18:50 UTC

19 points

2 comments1 min readLW link

(www.250bpm.com)

[Question] What AI apps are surprisingly absent given current capabilities?

azergante2 Jun 2025 18:46 UTC

4 points

8 comments1 min readLW link

[Beneath Psychology] Chronic pain challenge part 2: the solution

jimmy2 Jun 2025 17:30 UTC

39 points

3 comments34 min readLW link

The Value Proposition of Romantic Relationships

johnswentworth2 Jun 2025 13:51 UTC

210 points

43 comments13 min readLW link

1. The challenge of unawareness for impartial altruist action guidance: Introduction

Anthony DiGiovanni2 Jun 2025 8:54 UTC

48 points

6 comments13 min readLW link

‘Wicked’: thoughts

KatjaGrace2 Jun 2025 6:20 UTC

25 points

3 comments3 min readLW link

(worldspiritsockpuppet.com)

Humanity needs a Ulysses Pact for AI

Lukas N.P. Egger1 Jun 2025 20:56 UTC

1 point

2 comments1 min readLW link

Text Steers Vision

Woody Gan1 Jun 2025 20:30 UTC

5 points

0 comments7 min readLW link

[Question] Possible AI regulation emergency?

CronoDAS1 Jun 2025 20:30 UTC

19 points

1 comment1 min readLW link

Eliezer Yudkowsky & Connor Leahy | AI Risk, Safety & Alignment Q&A [4K Remaster + HQ Audio]

Dex Volkov1 Jun 2025 20:20 UTC

−8 points

2 comments1 min readLW link

(www.youtube.com)

Ownership: the principle of “Deprive first, ask questions later”

MillardJMelnyk1 Jun 2025 20:19 UTC

−27 points

22 comments1 min readLW link

Economists should track the speed and magnitude of AI implementation projects

ParrotRobot1 Jun 2025 20:15 UTC

3 points

0 comments2 min readLW link

Ingroup

JenniferRM1 Jun 2025 19:47 UTC

−3 points

12 comments1 min readLW link

Apply to the AI Security Bootcamp [Aug 4 - Aug 29]

Pranav Gade, Jan Michelfeit and Jinglin Li

1 Jun 2025 19:47 UTC

27 points

2 comments4 min readLW link

Seeing how well an agentic AI coding tool can do compared to me using an actual real-world example

Massimog1 Jun 2025 19:24 UTC

32 points

2 comments1 min readLW link

(blog.massimogauthier.com)

Nicotine addiction, cloves, and needing to take a shit

eyesack1 Jun 2025 19:13 UTC

4 points

1 comment1 min readLW link

2nd Germany-wide ACX/LW event

Fernand01 Jun 2025 13:56 UTC

1 point

0 comments1 min readLW link

An Opinionated Guide to P-Values

amitlevy491 Jun 2025 11:48 UTC

11 points

0 comments8 min readLW link

(ivy0.substack.com)

Legal Personhood for Models: Novelli et. al & Mocanu

Stephen Martin1 Jun 2025 8:18 UTC

2 points

0 comments10 min readLW link

Is Escalation Inevitable?

Lennart Wijers31 May 2025 22:10 UTC

5 points

0 comments3 min readLW link

Policy Entropy, Learning, and Alignment (Or Maybe Your LLM Needs Therapy)

sdeture31 May 2025 22:09 UTC

15 points

6 comments8 min readLW link

The Unseen Hand: AI’s Problem Preemption and the True Future of Labor

Ben Kassan31 May 2025 22:04 UTC

8 points

0 comments20 min readLW link

The 80/20 playbook for mitigating AI scheming in 2025

Charbel-Raphaël31 May 2025 21:17 UTC

40 points

2 comments4 min readLW link

Collective Action for AI Safety (June 4, NYC)

Jordan Braunstein31 May 2025 20:27 UTC

1 point

0 comments1 min readLW link

The best approaches for mitigating “the intelligence curse” (or gradual disempowerment); my quick guesses at the best object-level interventions

ryan_greenblatt31 May 2025 18:20 UTC

78 points

19 comments5 min readLW link

Would It Be Better to Dispense with Good and Evil?

arusarda31 May 2025 16:40 UTC

−2 points

10 comments6 min readLW link

How Epistemic Collapse Looks from Inside

Martin Sustrik31 May 2025 16:30 UTC

9 points

11 comments1 min readLW link

(250bpm.substack.com)

When will AI automate all mental work, and how fast?

aggliu and Writer

31 May 2025 16:18 UTC

10 points

0 comments7 min readLW link

(youtu.be)

Progress links and short notes, 2025-05-31: RPI fellowship deadline tomorrow, Edge Esmeralda next week, and more

jasoncrawford31 May 2025 15:20 UTC

11 points

0 comments7 min readLW link

(newsletter.rootsofprogress.org)