Why I’m Wor­ried About Job Loss + Thoughts on Com­par­a­tive Advantage

claywren13 Feb 2026 23:36 UTC
62 points
5 comments11 min readLW link

METR Time Hori­zons: Now 10x/​Year

johncrox13 Feb 2026 23:01 UTC
28 points
6 comments3 min readLW link

Use more text than one to­ken to avoid neuralese

Jude Stiel13 Feb 2026 21:09 UTC
10 points
4 comments1 min readLW link

Hazards of Selec­tion Effects on Ap­proved Information

Zack_M_Davis13 Feb 2026 18:51 UTC
56 points
11 comments12 min readLW link
(zackmdavis.net)

OpenClaw Newsletter

Jacobson13 Feb 2026 17:59 UTC
2 points
1 comment5 min readLW link

ChatGPT-5.3-Codex Is Also Good At Coding

Zvi13 Feb 2026 16:20 UTC
45 points
2 comments20 min readLW link
(thezvi.wordpress.com)

Repli­ca­tion of Koorndijk (2025): Differ­en­tial Com­pli­ance May Reflect Prompt Sen­si­tivity Rather Than Strate­gic Reasoning

13 Feb 2026 16:12 UTC
9 points
0 comments8 min readLW link

Towards an ob­jec­tive test of Com­pas­sion—Turn­ing an ab­stract test into a col­lec­tion of nuances

tailcalled13 Feb 2026 15:03 UTC
12 points
0 comments7 min readLW link

(Up­dated) METR’s data can’t dis­t­in­guish be­tween tra­jec­to­ries (and 80% hori­zons are an or­der of mag­ni­tude off)

Jonas Moss13 Feb 2026 14:05 UTC
28 points
10 comments10 min readLW link

We Die Be­cause it’s a Com­pu­ta­tional Necessity

E.G. Blee-Goldman13 Feb 2026 13:16 UTC
2 points
3 comments22 min readLW link

Hazardous States and Accidents

kqr13 Feb 2026 13:02 UTC
4 points
0 comments4 min readLW link
(entropicthoughts.com)

Sys­temic Risks and Where to Find Them

Jonas Hallgren13 Feb 2026 10:51 UTC
14 points
0 comments20 min readLW link
(equilibria1.substack.com)

Nick Bostrom: Op­ti­mal Timing for Superintelligence

Julian Bradshaw13 Feb 2026 7:33 UTC
8 points
3 comments2 min readLW link
(nickbostrom.com)

Why You Don’t Believe in Xhosa Prophecies

Jan_Kulveit13 Feb 2026 2:25 UTC
265 points
28 comments4 min readLW link

Gem­ini’s Hy­po­thet­i­cal Present

jefftk13 Feb 2026 2:20 UTC
101 points
9 comments2 min readLW link
(www.jefftk.com)

I Tried to Trick My­self into Be­ing a Bet­ter Plan­ner & Prob­lem Solver

CstineSublime13 Feb 2026 0:25 UTC
7 points
2 comments3 min readLW link

Grad­ing AI 2027′s 2025 Predictions

13 Feb 2026 0:18 UTC
64 points
4 comments9 min readLW link
(blog.ai-futures.org)

Long-term risks from ide­olog­i­cal fanaticism

12 Feb 2026 23:26 UTC
99 points
12 comments84 min readLW link

(Re)Dis­cov­er­ing Nat­u­ral Laws

Margot12 Feb 2026 21:45 UTC
13 points
0 comments17 min readLW link

An On­tol­ogy of Rep­re­sen­ta­tions: Limits of Universality

Margot12 Feb 2026 21:43 UTC
23 points
1 comment39 min readLW link

A Closer Look at the “So­cieties of Thought” Paper

Against Moloch12 Feb 2026 21:38 UTC
10 points
0 comments3 min readLW link
(againstmoloch.com)

mod­els have some pretty funny at­trac­tor states

12 Feb 2026 21:14 UTC
275 points
38 comments18 min readLW link

Stay in your hu­man loop

benjamin ar12 Feb 2026 21:05 UTC
22 points
0 comments5 min readLW link
(bjar.substack.com)

The case for in­dus­trial evals

12 Feb 2026 20:45 UTC
16 points
0 comments23 min readLW link

Mul­ti­verse sam­pling assumption

avturchin12 Feb 2026 19:59 UTC
12 points
0 comments5 min readLW link

What We Learned from Briefing 140+ Law­mak­ers on the Threat from AI

leticiagarcia12 Feb 2026 19:53 UTC
174 points
7 comments14 min readLW link
(substack.com)

Paper: Prompt Op­ti­miza­tion Makes Misal­ign­ment Legible

12 Feb 2026 19:45 UTC
63 points
8 comments8 min readLW link

Claude’s Constitution

PeterMcCluskey12 Feb 2026 19:44 UTC
15 points
4 comments6 min readLW link
(bayesianinvestor.com)

Hu­man-like metacog­ni­tive skills will re­duce LLM slop and aid al­ign­ment and capabilities

Seth Herd12 Feb 2026 19:38 UTC
48 points
16 comments18 min readLW link

Good AI Epistemics as an Offramp from the In­tel­li­gence Explosion

Ben Goldhaber12 Feb 2026 19:18 UTC
23 points
2 comments3 min readLW link

How Se­cret Loy­alty Differs from Stan­dard Back­door Threats

Joe Kwon12 Feb 2026 18:48 UTC
23 points
4 comments12 min readLW link

You get about.… how many words ex­actly?

Raemon12 Feb 2026 18:06 UTC
21 points
1 comment7 min readLW link

Ba­sic Leg­i­bil­ity Pro­to­cols Im­prove Trusted Monitoring

12 Feb 2026 17:50 UTC
8 points
4 comments11 min readLW link

A re­search agenda for the fi­nal year

Mitchell_Porter12 Feb 2026 17:24 UTC
13 points
22 comments3 min readLW link

Poly­se­man­tic­ity is a Misnomer

Shiva's Right Foot12 Feb 2026 17:22 UTC
11 points
0 comments3 min readLW link

Op­ti­mal Timing for Su­per­in­tel­li­gence: Mun­dane Con­sid­er­a­tions for Ex­ist­ing People

Nick Bostrom12 Feb 2026 17:06 UTC
49 points
89 comments72 min readLW link

How do we (more) safely defer to AIs?

12 Feb 2026 16:55 UTC
83 points
5 comments72 min readLW link

A Con­cep­tual Frame­work for Ex­plo­ra­tion Hacking

12 Feb 2026 16:33 UTC
26 points
2 comments9 min readLW link

AI #155: Wel­come to Re­cur­sive Self-Improvement

Zvi12 Feb 2026 16:10 UTC
52 points
5 comments56 min readLW link
(thezvi.wordpress.com)

The Fa­cade of AI Safety Will Crumble

Liron12 Feb 2026 15:57 UTC
36 points
11 comments4 min readLW link
(doomdebates.com)

The his­tory of light

Kotlopou12 Feb 2026 14:16 UTC
16 points
0 comments1 min readLW link
(beatingthehydra.substack.com)

Three Wor­lds Col­lide as­sumes cal­ibra­tion is solved

Vyacheslav Ladischenski (Slava)12 Feb 2026 4:28 UTC
7 points
1 comment3 min readLW link

Re­search note: A sim­pler AI timelines model pre­dicts 99% AI R&D au­toma­tion in ~2032

Thomas Kwa12 Feb 2026 0:13 UTC
69 points
15 comments8 min readLW link
(metr.org)

Time­less Engineering

Jack Bradshaw11 Feb 2026 23:53 UTC
−14 points
0 comments5 min readLW link

[Paper] How does in­for­ma­tion ac­cess af­fect LLM mon­i­tors’ abil­ity to de­tect sab­o­tage?

11 Feb 2026 21:25 UTC
26 points
0 comments6 min readLW link

Claude Opus 4.6 Es­ca­lates Things Quickly

Zvi11 Feb 2026 21:20 UTC
51 points
0 comments34 min readLW link
(thezvi.wordpress.com)

Where Will Call Cen­ter Work­ers Go?

loic11 Feb 2026 20:44 UTC
19 points
2 comments4 min readLW link

Dist­in­guish be­tween in­fer­ence scal­ing and “larger tasks use more com­pute”

ryan_greenblatt11 Feb 2026 18:37 UTC
87 points
5 comments2 min readLW link

Mon­i­tor Jailbreak­ing: Evad­ing Chain-of-Thought Mon­i­tor­ing Without En­coded Reasoning

Wuschel Schulz11 Feb 2026 17:18 UTC
61 points
17 comments5 min readLW link

[Hiring] Prin­cipia Re­search Fellows

11 Feb 2026 16:30 UTC
35 points
1 comment3 min readLW link