All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 20242025

All Jan Feb MarAprMay Jun Jul Aug Sep Oct

All 1 2 3 4 5 6 7 8 9 10 11 12 131415 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Sam Altman’s sister claims Sam sexually abused her—Part 4: Timeline, continued

pythagoras501513 Apr 2025 23:41 UTC

1 point

0 comments51 min readLW link

The Structure of the Pain of Change

ReverendBayes13 Apr 2025 21:51 UTC

7 points

0 comments10 min readLW link

Луна Лавгуд и Комната Тайн, Часть 4

Kongo Landwalker and lsusr

13 Apr 2025 20:55 UTC

3 points

0 comments4 min readLW link

Thoughts on the Double Impact Project

Mati_Roy13 Apr 2025 19:07 UTC

27 points

14 comments2 min readLW link

Intro to Multi-Agent Safety

james__p13 Apr 2025 17:40 UTC

12 points

0 comments5 min readLW link

Vestigial reasoning in RL

Caleb Biddulph13 Apr 2025 15:40 UTC

54 points

8 comments9 min readLW link

Four Types of Disagreement

silentbob13 Apr 2025 11:22 UTC

50 points

4 comments5 min readLW link

How I switched careers from software engineer to AI policy operations

Lucie Philippon13 Apr 2025 6:37 UTC

58 points

1 comment5 min readLW link

Steelmanning heuristic arguments

Dmitry Vaintrob13 Apr 2025 1:09 UTC

78 points

0 comments17 min readLW link

MONA: Three Month Later—Updates and Steganography Without Optimization Pressure

David Lindner and Vikrant Varma

12 Apr 2025 23:15 UTC

31 points

0 comments5 min readLW link

The Era of the Dividual—are we falling apart?

James Stephen Brown12 Apr 2025 22:35 UTC

3 points

2 comments4 min readLW link

Commitment Races are a technical problem ASI can easily solve

Knight Lee12 Apr 2025 22:22 UTC

7 points

6 comments6 min readLW link

The King’s Gift: How Institutions Rebrand Responsibility into Illusion

Hu Yichao12 Apr 2025 19:38 UTC

1 point

0 comments1 min readLW link

Experts have it easy

beyarkay12 Apr 2025 19:32 UTC

23 points

3 comments9 min readLW link

find_purpose.exe

heatdeathandtaxes12 Apr 2025 19:31 UTC

−1 points

0 comments5 min readLW link

(heatdeathandtaxes.substack.com)

The Cynic Wasps in the Beehive

mempko12 Apr 2025 19:30 UTC

−3 points

0 comments1 min readLW link

(blog.mempko.com)

Луна Лавгуд и Комната Тайн, Часть 3

Kongo Landwalker and lsusr

12 Apr 2025 19:20 UTC

3 points

0 comments2 min readLW link

[Question] What is autism?

Adam Zerner12 Apr 2025 18:12 UTC

18 points

7 comments1 min readLW link

College Advice For People Like Me

henryj12 Apr 2025 14:36 UTC

50 points

5 comments17 min readLW link

(www.henryjosephson.com)

Why does LW not put much more focus on AI governance and outreach?

Severin T. Seehrich and Benjamin Schmidt

12 Apr 2025 14:24 UTC

78 points

31 comments2 min readLW link

[Question] Is Local Order a Clue to Universal Entropy? How a Failed Professor Searches for a ‘Sacred Motivational Order’

P. João12 Apr 2025 13:39 UTC

2 points

2 comments2 min readLW link

What are good safety standards for open source AIs from China?

ChristianKl12 Apr 2025 13:06 UTC

10 points

2 comments1 min readLW link

Will US tariffs push data centers for large model training offshore?

ChristianKl12 Apr 2025 12:47 UTC

20 points

3 comments1 min readLW link

Self propagating story.

Canaletto12 Apr 2025 12:32 UTC

3 points

0 comments8 min readLW link

Calling Bullshit—the Cheatsheet

Niklas Lehmann12 Apr 2025 11:43 UTC

13 points

4 comments2 min readLW link

The Internal Model Principle: A Straightforward Explanation

Alfred Harwood12 Apr 2025 10:58 UTC

23 points

6 comments19 min readLW link

ACX Spring Meetup 2025 @ Klang Valley, Malaysia

Yi-Yang12 Apr 2025 7:31 UTC

2 points

0 comments1 min readLW link

Distributed whistleblowing

samuelshadrach12 Apr 2025 6:36 UTC

5 points

5 comments4 min readLW link

(samuelshadrach.com)

[Question] How likely are the USA to decay and how will it influence the AI development?

StanislavKrym12 Apr 2025 4:42 UTC

10 points

0 comments1 min readLW link

[Question] Does this game have a name?

Mis-Understandings12 Apr 2025 1:52 UTC

4 points

4 comments1 min readLW link

Bias Mitigation in Language Models by Steering Features

akankshanc12 Apr 2025 0:10 UTC

1 point

0 comments9 min readLW link

(akankshanc.io)

Do we want too much from a potentially godlike AGI?

StanislavKrym11 Apr 2025 23:33 UTC

−1 points

0 comments2 min readLW link

How training-gamers might function (and win)

Vivek Hebbar11 Apr 2025 21:26 UTC

110 points

5 comments13 min readLW link

The limits of black-box evaluations: two hypotheticals

TFD11 Apr 2025 20:45 UTC

1 point

0 comments4 min readLW link

(www.thefloatingdroid.com)

Comments on “AI 2027”

Randaly11 Apr 2025 20:32 UTC

19 points

14 comments7 min readLW link

Debunk the myth -Testing the generalized reasoning ability of LLM

Defender776211 Apr 2025 20:17 UTC

1 point

5 comments4 min readLW link

Theories of Impact for Causality in AI Safety

alexisbellot11 Apr 2025 20:16 UTC

11 points

1 comment6 min readLW link

Why Bigger Models Generalize Better

PapersToAGI11 Apr 2025 19:54 UTC

1 point

0 comments2 min readLW link

Can LLMs learn Steganographic Reasoning via RL?

robert mccarthy, Vasil Georgiev, Steven Basart and David Lindner

11 Apr 2025 16:33 UTC

29 points

3 comments6 min readLW link

My day in 2035

Tenoke11 Apr 2025 16:31 UTC

19 points

2 comments7 min readLW link

(svilentodorov.xyz)

Youth Lockout

Xavi CF11 Apr 2025 15:05 UTC

47 points

6 comments5 min readLW link

[Question] Is the ethics of interaction with primitive peoples already solved?

StanislavKrym11 Apr 2025 14:56 UTC

−4 points

0 comments1 min readLW link

OpenAI Responses API changes models’ behavior

Jan Betley and James Chua

11 Apr 2025 13:27 UTC

53 points

6 comments2 min readLW link

Weird Random Newcomb Problem

Tapatakt11 Apr 2025 13:09 UTC

21 points

16 comments4 min readLW link

On Google’s Safety Plan

Zvi11 Apr 2025 12:51 UTC

57 points

6 comments33 min readLW link

(thezvi.wordpress.com)

Луна Лавгуд и Комната Тайн, Часть 2

Kongo Landwalker and lsusr

11 Apr 2025 12:42 UTC

2 points

1 comment3 min readLW link

Paper

dynomight11 Apr 2025 12:20 UTC

43 points

12 comments3 min readLW link

Why are neuro-symbolic systems not considered when it comes to AI Safety?

Edy Nastase11 Apr 2025 9:41 UTC

3 points

6 comments1 min readLW link

Crash scenario 1: Rapidly mobilise for a 2025 AI crash

Remmelt11 Apr 2025 6:54 UTC

12 points

4 comments1 min readLW link

Currency Collapse

prue11 Apr 2025 3:48 UTC

27 points

3 comments9 min readLW link

(www.prue0.com)