AI 2027: What Su­per­in­tel­li­gence Looks Like

3 Apr 2025 16:23 UTC
671 points
222 comments41 min readLW link
(ai-2027.com)

Ac­countabil­ity Sinks

Martin Sustrik22 Apr 2025 5:00 UTC
444 points
58 comments15 min readLW link
(250bpm.substack.com)

Play­ing in the Creek

Hastings10 Apr 2025 17:39 UTC
401 points
13 comments2 min readLW link
(hgreer.com)

LessWrong has been ac­quired by EA

habryka1 Apr 2025 13:09 UTC
362 points
55 comments1 min readLW link

VDT: a solu­tion to de­ci­sion theory

L Rudolf L1 Apr 2025 21:04 UTC
351 points
33 comments4 min readLW link

Why Have Sen­tence Lengths De­creased?

Arjun Panickssery3 Apr 2025 17:50 UTC
283 points
89 comments4 min readLW link
(arjunpanickssery.substack.com)

To Un­der­stand His­tory, Keep Former Pop­u­la­tion Distri­bu­tions In Mind

Arjun Panickssery23 Apr 2025 4:51 UTC
242 points
13 comments2 min readLW link
(arjunpanickssery.substack.com)

Jaan Tal­linn’s 2024 Philan­thropy Overview

jaan23 Apr 2025 11:06 UTC
227 points
8 comments1 min readLW link
(jaan.info)

Thoughts on AI 2027

Max Harms9 Apr 2025 21:26 UTC
222 points
61 comments21 min readLW link
(intelligence.org)

Learned pain as a lead­ing cause of chronic pain

SoerenMind9 Apr 2025 11:57 UTC
216 points
39 comments9 min readLW link

Im­pact, agency, and taste

benkuhn19 Apr 2025 21:10 UTC
205 points
10 comments8 min readLW link
(www.benkuhn.net)

Short Timelines Don’t De­value Long Hori­zon Research

Vladimir_Nesov9 Apr 2025 0:42 UTC
176 points
24 comments1 min readLW link

Sur­pris­ing LLM rea­son­ing failures make me think we still need qual­i­ta­tive break­throughs for AGI

Kaj_Sotala15 Apr 2025 15:56 UTC
174 points
52 comments18 min readLW link

Fron­tier AI Models Still Fail at Ba­sic Phys­i­cal Tasks: A Man­u­fac­tur­ing Case Study

Adam Karvonen14 Apr 2025 17:38 UTC
158 points
42 comments7 min readLW link
(adamkarvonen.github.io)

Align­ment Fak­ing Re­vis­ited: Im­proved Clas­sifiers and Open Source Extensions

8 Apr 2025 17:32 UTC
147 points
20 comments12 min readLW link

Train­ing AGI in Se­cret would be Un­safe and Unethical

Daniel Kokotajlo18 Apr 2025 12:27 UTC
140 points
15 comments6 min readLW link

AI-en­abled coups: a small group could use AI to seize power

16 Apr 2025 16:51 UTC
137 points
23 comments7 min readLW link

AI 2027 is a Bet Against Am­dahl’s Law

snewman21 Apr 2025 3:09 UTC
127 points
57 comments9 min readLW link

Ctrl-Z: Con­trol­ling AI Agents via Resampling

16 Apr 2025 16:21 UTC
126 points
0 comments20 min readLW link

Re­search Notes: Run­ning Claude 3.7, Gem­ini 2.5 Pro, and o3 on Poké­mon Red

Julian Bradshaw21 Apr 2025 3:52 UTC
124 points
20 comments14 min readLW link

Three Months In, Eval­u­at­ing Three Ra­tion­al­ist Cases for Trump

Arjun Panickssery18 Apr 2025 8:27 UTC
117 points
33 comments4 min readLW link

Show, not tell: GPT-4o is more opinionated in images than in text

2 Apr 2025 8:51 UTC
115 points
42 comments3 min readLW link

“The Era of Ex­pe­rience” has an un­solved tech­ni­cal al­ign­ment problem

Steven Byrnes24 Apr 2025 13:57 UTC
115 points
48 comments23 min readLW link

Among Us: A Sand­box for Agen­tic Deception

5 Apr 2025 6:24 UTC
114 points
7 comments7 min readLW link

We should try to au­to­mate AI safety work asap

Marius Hobbhahn26 Apr 2025 16:35 UTC
113 points
10 comments15 min readLW link

Mis­rep­re­sen­ta­tion as a Bar­rier for In­terp (Part I)

29 Apr 2025 17:07 UTC
113 points
12 comments7 min readLW link

AI 2027: Responses

Zvi8 Apr 2025 12:50 UTC
111 points
3 comments30 min readLW link
(thezvi.wordpress.com)

New Cause Area Proposal

CallumMcDougall1 Apr 2025 7:12 UTC
110 points
4 comments1 min readLW link

How train­ing-gamers might func­tion (and win)

Vivek Hebbar11 Apr 2025 21:26 UTC
110 points
5 comments13 min readLW link

The Lizard­man and the Black Hat Bobcat

Screwtape6 Apr 2025 19:02 UTC
109 points
15 comments9 min readLW link

One-shot steer­ing vec­tors cause emer­gent mis­al­ign­ment, too

Jacob Dunefsky14 Apr 2025 6:40 UTC
98 points
6 comments11 min readLW link

How to Build a Third Place on Focusmate

Parker Conley28 Apr 2025 23:46 UTC
97 points
10 comments5 min readLW link
(parconley.com)

Re­ward hack­ing is be­com­ing more so­phis­ti­cated and de­liber­ate in fron­tier LLMs

Kei Nishimura-Gasparian24 Apr 2025 16:03 UTC
96 points
6 comments1 min readLW link

ASI ex­is­ten­tial risk: Re­con­sid­er­ing Align­ment as a Goal

habryka15 Apr 2025 19:57 UTC
95 points
14 comments19 min readLW link
(michaelnotebook.com)

7+ tractable di­rec­tions in AI control

28 Apr 2025 17:12 UTC
93 points
1 comment13 min readLW link

How To Believe False Things

Eneasz2 Apr 2025 16:28 UTC
92 points
14 comments3 min readLW link

$500 Bounty Prob­lem: Are (Ap­prox­i­mately) Deter­minis­tic Nat­u­ral La­tents All You Need?

21 Apr 2025 20:19 UTC
92 points
24 comments3 min readLW link

Is Gem­ini now bet­ter than Claude at Poké­mon?

Julian Bradshaw19 Apr 2025 23:34 UTC
91 points
12 comments5 min readLW link

The Uses of Complacency

sarahconstantin21 Apr 2025 18:50 UTC
88 points
5 comments8 min readLW link
(sarahconstantin.substack.com)

Keltham’s Lec­tures in Pro­ject Lawful

Morpheus1 Apr 2025 10:39 UTC
86 points
6 comments2 min readLW link

GPT-4o Is An Ab­surd Sycophant

Zvi28 Apr 2025 19:00 UTC
84 points
7 comments19 min readLW link
(thezvi.wordpress.com)

A Slow Guide to Con­fronting Doom

Ruby6 Apr 2025 2:10 UTC
84 points
20 comments14 min readLW link

o3 Is a Ly­ing Liar

Zvi23 Apr 2025 20:00 UTC
84 points
26 comments9 min readLW link
(thezvi.wordpress.com)

How peo­ple use LLMs

Elizabeth27 Apr 2025 21:48 UTC
83 points
6 comments1 min readLW link
(www.gleech.org)

What Makes an AI Startup “Net Pos­i­tive” for Safety?

jacquesthibs18 Apr 2025 20:33 UTC
82 points
23 comments2 min readLW link

Band­width Rules Every­thing Around Me: Oliver Habryka on OpenPhil and GoodVentures

Elizabeth29 Apr 2025 20:40 UTC
81 points
15 comments1 min readLW link
(acesounderglass.com)

An­nounc­ing ILIAD2: ODYSSEY

3 Apr 2025 17:01 UTC
80 points
1 comment1 min readLW link

You will crash your car in front of my house within the next week

Richard Korzekwa 1 Apr 2025 21:43 UTC
80 points
6 comments1 min readLW link

Why does LW not put much more fo­cus on AI gov­er­nance and out­reach?

12 Apr 2025 14:24 UTC
78 points
31 comments2 min readLW link

New Paper: In­fra-Bayesian De­ci­sion-Es­ti­ma­tion Theory

10 Apr 2025 9:17 UTC
78 points
4 comments1 min readLW link
(arxiv.org)