Fairly Break­ing Ties Without Fair Coins

Brendan Long11 Nov 2025 21:48 UTC
11 points
10 comments4 min readLW link
(www.brendanlong.com)

Kimi K2 Thinking

Zvi11 Nov 2025 21:10 UTC
47 points
0 comments5 min readLW link
(thezvi.wordpress.com)

Not-A-Book Re­view: The At­trac­tive Man (Dat­ing Coach Ser­vice)

25Hour11 Nov 2025 20:03 UTC
15 points
0 comments1 min readLW link
(lifeimprovementschemes.substack.com)

Don’t Get One-Shotted

Jordan Rubin11 Nov 2025 17:07 UTC
2 points
2 comments6 min readLW link
(jordanmrubin.substack.com)

Learn­ings from the Zurich AI Safety Day

11 Nov 2025 17:00 UTC
13 points
0 comments6 min readLW link

Steer­ing Lan­guage Models with Weight Arithmetic

11 Nov 2025 16:30 UTC
88 points
6 comments5 min readLW link

An­nounc­ing the So­ciety of Teen Scientists

rogersbacon11 Nov 2025 16:08 UTC
8 points
0 comments1 min readLW link

What is Hap­pen­ing in AI Gover­nance?

11 Nov 2025 15:59 UTC
6 points
0 comments5 min readLW link

Hu­man Agency at Stake

11 Nov 2025 15:57 UTC
8 points
0 comments6 min readLW link

Om­ni­science one bit at a time: Chap­ter 3

Dentosal11 Nov 2025 13:34 UTC
2 points
0 comments2 min readLW link

Evolu­tion’s Align­ment Solu­tion: Why Burnout Prevents Monsters

Elias_Kunnas11 Nov 2025 13:32 UTC
9 points
0 comments6 min readLW link

Thick prac­tices for AI tools

Alexandre Variengien11 Nov 2025 13:13 UTC
19 points
2 comments20 min readLW link
(alexandrevariengien.com)

The prob­lem of grace­ful deference

TsviBT11 Nov 2025 8:17 UTC
108 points
41 comments4 min readLW link

See Your Word Count While You Write

dreeves11 Nov 2025 8:02 UTC
7 points
3 comments2 min readLW link

On Stance

Screwtape11 Nov 2025 7:50 UTC
24 points
5 comments6 min readLW link

Break­ing the He­donic Rub­ber Band

Ben Pace11 Nov 2025 7:00 UTC
20 points
4 comments4 min readLW link

Re­ject­ing “Good­ness” Does Not Mean Ham­mer­ing The Defect Button

johnswentworth11 Nov 2025 6:50 UTC
25 points
6 comments2 min readLW link

Strength­en­ing Red Teams: A Mo­du­lar Scaf­fold for Con­trol Evaluations

Chloe Loughridge11 Nov 2025 6:20 UTC
7 points
0 comments1 min readLW link
(alignment.anthropic.com)

On the Nor­ma­tivity of De­bate: A Dis­cus­sion With Said Achmiz

Zack_M_Davis11 Nov 2025 5:49 UTC
21 points
1 comment22 min readLW link

Ques­tion the Requirements

habryka11 Nov 2025 5:25 UTC
95 points
12 comments3 min readLW link

France is ready to stand alone

Lucie Philippon11 Nov 2025 5:09 UTC
32 points
6 comments2 min readLW link
(aelerinya.substack.com)

Love is Willing­ness to do Violence

Eneasz11 Nov 2025 5:09 UTC
16 points
9 comments2 min readLW link
(deathisbad.substack.com)

Don’t can­cel out your re­wards!

Sneha Bangalore11 Nov 2025 5:04 UTC
1 point
0 comments15 min readLW link

Turn­ing Grey

Taylor G. Lunt11 Nov 2025 4:40 UTC
8 points
0 comments11 min readLW link

The AI bub­ble cov­ered in the Atlantic

Remmelt11 Nov 2025 4:11 UTC
4 points
0 comments2 min readLW link
(www.theatlantic.com)

A Sim­ple Sing-along Solstice

maia11 Nov 2025 2:49 UTC
34 points
3 comments1 min readLW link
(tigrennatenn.neocities.org)

Univer­sal Ba­sic In­come in an AGI Future

Simon Lermen11 Nov 2025 2:26 UTC
21 points
1 comment2 min readLW link
(simonlermen.substack.com)

Ternary plots are underrated

Adam Scherlis11 Nov 2025 2:19 UTC
17 points
1 comment3 min readLW link
(adam.scherl.is)

How likely is dan­ger­ous AI in the short term?

Nikola Jurkovic11 Nov 2025 2:14 UTC
26 points
3 comments4 min readLW link
(nikolajurkovic.substack.com)

On model weight preser­va­tion: An­thropic’s new initiative

Olle Häggström11 Nov 2025 1:12 UTC
16 points
2 comments1 min readLW link
(haggstrom.substack.com)

Pause from Be­hind /​ Los­ing Heroically

enterthewoods11 Nov 2025 1:12 UTC
0 points
0 comments5 min readLW link

[Linkpost] Galaxy brain resistance

derikk11 Nov 2025 0:43 UTC
4 points
0 comments1 min readLW link
(vitalik.eth.limo)

A pen­cil is not a pen­cil is not a pencil

Algon10 Nov 2025 23:59 UTC
18 points
4 comments2 min readLW link

The Open Strat­egy Dic­ta­tor Game: An Ex­per­i­ment in Trans­par­ent Cooperation

Michael Glass10 Nov 2025 23:26 UTC
13 points
2 comments1 min readLW link

DC/​Mary­land Sec­u­lar Solstice

maia10 Nov 2025 23:25 UTC
13 points
2 comments1 min readLW link

What I learned build­ing a lan­guage-learn­ing app

depressurize10 Nov 2025 21:04 UTC
5 points
0 comments10 min readLW link
(chadnauseam.com)

An­drej Karpa­thy on LLM cog­ni­tive deficits

Nina Panickssery10 Nov 2025 21:02 UTC
45 points
3 comments5 min readLW link
(www.dwarkesh.com)

Con­scious­ness as a Distributed Ponzi Scheme

abramdemski10 Nov 2025 20:18 UTC
34 points
11 comments4 min readLW link

Maat—In­tro Post

TristanTrim10 Nov 2025 20:09 UTC
3 points
0 comments1 min readLW link

Var­i­ously Effec­tive Altruism

Zvi10 Nov 2025 19:21 UTC
14 points
3 comments8 min readLW link
(thezvi.wordpress.com)

Why does ev­ery­thing feel so ur­gent?

mingyuan10 Nov 2025 18:11 UTC
19 points
8 comments3 min readLW link
(mingyuan.substack.com)

Om­ni­science one bit at a time: Chap­ter 2

Dentosal10 Nov 2025 15:47 UTC
4 points
0 comments2 min readLW link

So­cial drives 1: “Sym­pa­thy Re­ward”, from com­pas­sion to dehumanization

Steven Byrnes10 Nov 2025 14:53 UTC
36 points
7 comments13 min readLW link

On­tol­ogy for AI Cults and Cy­borg Egregores

Jan_Kulveit10 Nov 2025 13:19 UTC
65 points
14 comments2 min readLW link

From Vi­talik: Galaxy brain resistance

Gabriel Alfour10 Nov 2025 13:06 UTC
115 points
2 comments1 min readLW link
(vitalik.eth.limo)

The jailbreak ar­gu­ment against LLM values

technicalities10 Nov 2025 12:05 UTC
25 points
2 comments6 min readLW link

The grapefruit juice effect

Adam Scherlis10 Nov 2025 8:49 UTC
38 points
1 comment5 min readLW link
(adam.scherl.is)

Against Pow­er­ful Text Editors

dreeves10 Nov 2025 8:11 UTC
10 points
11 comments2 min readLW link

MtG Colour Wheel ap­plied to Politics

samuelshadrach10 Nov 2025 5:05 UTC
−5 points
6 comments6 min readLW link
(samuelshadrach.com)

The only im­por­tant ASI timeline

beyarkay (Boyd Kane)10 Nov 2025 4:53 UTC
2 points
4 comments1 min readLW link
(boydkane.com)