The sur­pris­ing ad­e­quacy of the Roblox game marketplace

Esteban Restrepo3 Jan 2026 14:15 UTC
26 points
3 comments8 min readLW link
(papabos.substack.com)

Re: An­thropic Chi­nese Cy­ber-At­tack. How Do We Pro­tect Open-source Models?

Mayowa Osibodu3 Jan 2026 9:45 UTC
−1 points
2 comments6 min readLW link

Give Skep­ti­cism a Try

Ape in the coat3 Jan 2026 8:57 UTC
12 points
17 comments3 min readLW link
(apeinthecoat102771.substack.com)

Why We Should Talk Speci­fi­cally Amid Uncertainty

sbaumohl3 Jan 2026 3:04 UTC
11 points
1 comment7 min readLW link

Com­pa­nies as “proto-ASI”

beyarkay (Boyd Kane)3 Jan 2026 0:24 UTC
15 points
3 comments1 min readLW link
(boydkane.com)

AXRP Epi­sode 47 - David Rein on METR Time Horizons

DanielFilan3 Jan 2026 0:10 UTC
21 points
0 comments46 min readLW link

The Weird­ness of Dat­ing/​Mat­ing: Deep Non­con­sent Preference

johnswentworth2 Jan 2026 23:05 UTC
12 points
61 comments6 min readLW link

Can AI learn hu­man so­cietal norms from so­cial feed­back (with­out re­ca­pitu­lat­ing all the ways this has failed in hu­man his­tory?)

foodforthought2 Jan 2026 22:11 UTC
7 points
3 comments4 min readLW link

Fer­til­ity Roundup #5: Causation

Zvi2 Jan 2026 22:00 UTC
19 points
5 comments25 min readLW link
(thezvi.wordpress.com)

Scale-Free Goodness

testingthewaters2 Jan 2026 21:00 UTC
10 points
3 comments5 min readLW link
(aclevername.substack.com)

Does de­vel­op­men­tal cog­ni­tive psy­chol­ogy provide any hints for mak­ing model al­ign­ment more ro­bust?

foodforthought2 Jan 2026 20:31 UTC
7 points
0 comments3 min readLW link

Does evolu­tion provide any hints for mak­ing model al­ign­ment more ro­bust?

foodforthought2 Jan 2026 19:06 UTC
5 points
0 comments4 min readLW link

Where do AI Safety Fel­lows go? An­a­lyz­ing a dataset of 600+ alumni

Christopher_Clay2 Jan 2026 18:14 UTC
20 points
2 comments5 min readLW link
(forum.effectivealtruism.org)

In­struct Vec­tors—Base mod­els can be in­struct with ac­ti­va­tion vectors

Eriskii2 Jan 2026 18:14 UTC
21 points
0 comments8 min readLW link

[Ad­vanced In­tro to AI Align­ment] 2. What Values May an AI Learn? — 4 Key Problems

Towards_Keeperhood2 Jan 2026 14:51 UTC
33 points
10 comments19 min readLW link

2025 Letter

zef2 Jan 2026 13:57 UTC
10 points
0 comments14 min readLW link
(zephyyr.substack.com)

2025 in AI predictions

jessicata2 Jan 2026 4:29 UTC
245 points
19 comments11 min readLW link

De­bunk­ing claims about sub­quadratic attention

Vladimir Ivanov2 Jan 2026 4:23 UTC
32 points
5 comments3 min readLW link

The bio-pirate’s guide to GLP-1 ag­o­nists

quiet_NaN2 Jan 2026 3:32 UTC
40 points
3 comments5 min readLW link

Col­lege Was Not That Ter­rible Now That I’m Not That Crazy

Zack_M_Davis1 Jan 2026 23:14 UTC
90 points
9 comments44 min readLW link
(zackmdavis.net)

Taiwan war timelines might be shorter than AI timelines

Baram Sosis1 Jan 2026 22:30 UTC
108 points
21 comments5 min readLW link

Split (Part 1)

Shoshannah Tekofsky1 Jan 2026 22:29 UTC
27 points
2 comments4 min readLW link
(shoshanigans.substack.com)

[Question] Who is re­spon­si­ble for shut­ting down rogue AI?

Cole Wyeth1 Jan 2026 21:36 UTC
45 points
2 comments1 min readLW link

$500 Write like lsusr com­pe­ti­tion—Results

lsusr1 Jan 2026 20:53 UTC
40 points
4 comments3 min readLW link

Over­whelming Superintelligence

Raemon1 Jan 2026 20:51 UTC
80 points
30 comments1 min readLW link

Re­duc­ing MDMA neurotoxicity

Pjain1 Jan 2026 20:13 UTC
5 points
0 comments12 min readLW link

Is it pos­si­ble to pre­vent AGI?

jrincayc1 Jan 2026 19:15 UTC
12 points
1 comment2 min readLW link

Prin­ci­pled In­ter­pretabil­ity of Re­ward Hack­ing in Closed Fron­tier Models

1 Jan 2026 16:37 UTC
24 points
0 comments23 min readLW link

AI #149: 3

Zvi1 Jan 2026 15:40 UTC
39 points
7 comments23 min readLW link
(thezvi.wordpress.com)

ML Eng­ineer—MIT AI Risk Ini­ti­a­tive, Con­trac­tor, Part-time, 6-months

peterslattery1 Jan 2026 14:23 UTC
4 points
0 comments1 min readLW link

Re­cent LLMs can do 2-hop and 3-hop la­tent (no-CoT) rea­son­ing on nat­u­ral facts

ryan_greenblatt1 Jan 2026 13:36 UTC
129 points
11 comments3 min readLW link

AGI and the struc­tural foun­da­tions of democ­racy and the rule-based in­ter­na­tional order

PabloAMC1 Jan 2026 12:07 UTC
21 points
0 comments10 min readLW link
(pabloamc.substack.com)

From Drift to Snap: In­struc­tion Vio­la­tion as a Phase Transition

James Hoffend1 Jan 2026 10:44 UTC
8 points
0 comments3 min readLW link

Quick polls on AGI doom

denkenberger1 Jan 2026 6:23 UTC
2 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

Spe­cial Per­sona Train­ing: Hyper­sti­tion Progress Re­port 2

jayterwahl1 Jan 2026 1:34 UTC
38 points
2 comments2 min readLW link

You will be OK

Boaz Barak1 Jan 2026 0:33 UTC
57 points
57 comments4 min readLW link

Speciesquest 2026

eukaryote31 Dec 2025 23:24 UTC
27 points
3 comments5 min readLW link
(eukaryotewritesblog.com)

[Question] How Should Poli­ti­cal Si­tu­a­tions Be Clas­sified In Order To Pick The Lo­cally Best Vot­ing Sys­tem For Each Si­tu­a­tion?

JenniferRM31 Dec 2025 22:49 UTC
19 points
7 comments6 min readLW link

AI Fu­tures Timelines and Take­off Model: Dec 2025 Update

31 Dec 2025 22:34 UTC
147 points
34 comments25 min readLW link

What drives LLM bail? A small Mech In­terp study

Anton de la Fuente31 Dec 2025 21:19 UTC
8 points
0 comments6 min readLW link

Lu­me­na­tor 2.0

Keri Warr31 Dec 2025 20:48 UTC
36 points
5 comments3 min readLW link
(keri.warr.ca)

[Question] Is in­tel­li­gent in­duc­tion even pos­si­ble?

PickleBrine31 Dec 2025 20:11 UTC
6 points
2 comments1 min readLW link

The Plan − 2025 Update

31 Dec 2025 20:10 UTC
96 points
21 comments7 min readLW link

Safety Net When AIs Take Our Jobs

PeterMcCluskey31 Dec 2025 20:05 UTC
16 points
0 comments2 min readLW link
(bayesianinvestor.com)

2025 Year in Review

Zvi31 Dec 2025 19:50 UTC
57 points
4 comments14 min readLW link
(thezvi.wordpress.com)

The Essen­tial­ism of Lesswrong

milanrosko31 Dec 2025 17:34 UTC
−45 points
6 comments1 min readLW link

Uncer­tain Up­dates: De­cem­ber 2025

Gordon Seidoh Worley31 Dec 2025 16:20 UTC
10 points
0 comments1 min readLW link
(www.uncertainupdates.com)

Halfhaven Forever

Viliam31 Dec 2025 15:59 UTC
23 points
4 comments4 min readLW link

Grad­ing my 2022 pre­dic­tions for 2025

Yitz31 Dec 2025 15:45 UTC
62 points
9 comments9 min readLW link

My 2025 in review

jasoncrawford31 Dec 2025 14:46 UTC
12 points
0 comments5 min readLW link
(newsletter.rootsofprogress.org)