All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 293031

CFAR is running an experimental mini-workshop (June 2-6, Berkeley CA)!

Davis_Kingsley29 May 2025 22:02 UTC

65 points

2 comments2 min readLW link

Orphaned Policies (Post 5 of 7 on AI Governance)

Mass_Driver29 May 2025 21:42 UTC

72 points

5 comments16 min readLW link

Gradual Disempowerment: Concrete Research Projects

Raymond Douglas29 May 2025 18:55 UTC

103 points

10 comments10 min readLW link

Do you even have a system prompt? (PSA / repo)

Croissanthology29 May 2025 18:49 UTC

111 points

78 comments2 min readLW link

Incorrect Baseline Evaluations Call into Question Recent LLM-RL Claims

shash4229 May 2025 18:40 UTC

66 points

7 comments1 min readLW link

(safe-lip-9a8.notion.site)

Dimensionalization

Jordan Rubin29 May 2025 18:18 UTC

7 points

6 comments4 min readLW link

(jordanmrubin.substack.com)

Distilled Human Judgment: Reifying AI Alignment

Devansh Mehta29 May 2025 18:06 UTC

2 points

0 comments4 min readLW link

Summer AI Safety Intro Fellowships in Boston and Online (Policy & Technical) – Apply by June 6!

jandrade11229 May 2025 18:02 UTC

1 point

0 comments1 min readLW link

Digital sentience funding opportunities: Support for applied work and research

aog and zdgroff

29 May 2025 15:22 UTC

21 points

0 comments4 min readLW link

When to Be Nice vs Kind

Declan Molony29 May 2025 15:06 UTC

25 points

2 comments1 min readLW link

AI #118: Claude Ascendant

Zvi29 May 2025 14:10 UTC

45 points

8 comments57 min readLW link

(thezvi.wordpress.com)

Social Capital—Does it Matter?

Momcilo29 May 2025 12:26 UTC

−9 points

1 comment6 min readLW link

Alignment Crisis: Genocide Denial

_mp_29 May 2025 12:04 UTC

−11 points

5 comments4 min readLW link

Cross-posting to Substack

jefftk29 May 2025 11:10 UTC

12 points

0 comments1 min readLW link

(www.jefftk.com)

Reflections on AI Wisdom, plus announcing Wise AI Wednesdays

Chris_Leong29 May 2025 7:13 UTC

18 points

0 comments3 min readLW link

[Question] What was so great about Move 37?

Caleb Biddulph29 May 2025 7:00 UTC

24 points

4 comments3 min readLW link

Procedural vs. Causal Understanding

Caleb Biddulph29 May 2025 7:00 UTC

7 points

2 comments2 min readLW link

Security Mindset: Hacking Pinball High Scores

gwern29 May 2025 3:39 UTC

29 points

4 comments1 min readLW link

(gwern.net)

Quick Minimal Playhouse

jefftk29 May 2025 2:10 UTC

17 points

1 comment1 min readLW link

(www.jefftk.com)

Cognitive Exhaustion and Engineered Trust: Lessons from My Gym

Priyanka Bharadwaj29 May 2025 1:21 UTC

14 points

3 comments3 min readLW link

Truth or Dare

Duncan Sabien (Inactive)29 May 2025 0:07 UTC

263 points

61 comments69 min readLW link

[Question] What should I read to understand ancestral human society?

Lorec28 May 2025 23:36 UTC

9 points

4 comments1 min readLW link

The case for countermeasures to memetic spread of misaligned values

Alex Mallen28 May 2025 21:12 UTC

83 points

8 comments7 min readLW link

a case for a lesswrong private prediction market

don't_wanna_be_stupid_any_more28 May 2025 20:26 UTC

3 points

0 comments2 min readLW link

LessWrong Feed [new, now in beta]

Ruby28 May 2025 19:01 UTC

53 points

88 comments8 min readLW link

Fun With Veo 3 and Media Generation

Zvi28 May 2025 18:30 UTC

29 points

0 comments5 min readLW link

(thezvi.wordpress.com)

Reverse Auctions for Group Decision-Making

krishmatta28 May 2025 17:52 UTC

3 points

4 comments3 min readLW link

(krishmatta.net)

What college major should I choose if I am unsure?

contrejour28 May 2025 17:50 UTC

−1 points

6 comments1 min readLW link

Evaluation As Feedback Cycle

belos28 May 2025 17:02 UTC

1 point

0 comments18 min readLW link

(bestofagreatlot.substack.com)

How much might AI legislation cost in the U.S.?

will rinehart28 May 2025 16:21 UTC

−5 points

0 comments11 min readLW link

What LLMs lack

p.b.28 May 2025 16:19 UTC

15 points

5 comments3 min readLW link

Playlist Inspired by Manifest 2024

Commander Zander28 May 2025 16:03 UTC

4 points

0 comments1 min readLW link

(open.spotify.com)

AISN #56: Google Releases Veo 3

Corin Katzke and Dan H

28 May 2025 16:00 UTC

7 points

0 comments4 min readLW link

(newsletter.safe.ai)

How Self-Aware Are LLMs?

Christopher Ackerman28 May 2025 12:57 UTC

30 points

9 comments10 min readLW link

Can We Hack Hedonic Treadmills?

Vincent Li28 May 2025 11:42 UTC

3 points

0 comments3 min readLW link

AI’s goals may not match ours

Algon, steven0461 and Vishakha

28 May 2025 9:30 UTC

14 points

1 comment3 min readLW link

AI may pursue goals

Algon, steven0461 and Vishakha

28 May 2025 9:30 UTC

13 points

0 comments1 min readLW link

The Best Way to Align an LLM: Is Inner Alignment Now a Solved Problem?

RogerDearnaley28 May 2025 6:21 UTC

32 points

34 comments9 min readLW link

Spectral radii dimensionality reduction computed without gradient calculations

Joseph Van Name28 May 2025 5:06 UTC

5 points

4 comments6 min readLW link

If you’re not sure how to sort a list or grid—seriate it!

gwern28 May 2025 3:54 UTC

221 points

9 comments3 min readLW link

(www.jstatsoft.org)

Briefly analyzing the 10-year moratorium amendment

RobertM28 May 2025 3:11 UTC

73 points

1 comment3 min readLW link

Does Sort Really Fall Back to Disk?

jefftk28 May 2025 1:20 UTC

13 points

2 comments1 min readLW link

(www.jefftk.com)

Shift Resources to Advocacy Now (Post 4 of 7 on AI Governance)

Mass_Driver28 May 2025 1:19 UTC

65 points

18 comments32 min readLW link

[Question] Colonialism in space: Does a collection of minds have exactly two attractors?

StanislavKrym27 May 2025 23:35 UTC

7 points

8 comments1 min readLW link

[Question] What are the best arguments you’ve seen for the Litany of Gendlin?

flowerfeatherfocus27 May 2025 21:19 UTC

7 points

8 comments1 min readLW link

What We Learned from Briefing 70+ Lawmakers on the Threat from AI

leticiagarcia27 May 2025 18:23 UTC

514 points

17 comments16 min readLW link

(substack.com)

My script for organizing OBNYC meetups

Orioth27 May 2025 18:14 UTC

3 points

0 comments4 min readLW link

Untrusted AIs can exploit feedback in control protocols

Mia Hopman, BionicD0LPH1N and Tyler Tracy

27 May 2025 16:41 UTC

30 points

0 comments16 min readLW link

Requiem for the hopes of a pre-AI world

Mitchell_Porter27 May 2025 14:47 UTC

103 points

0 comments3 min readLW link

The Best of All Possible Worlds

Jakub Growiec27 May 2025 13:16 UTC

11 points

7 comments49 min readLW link