Taiwan Trip Report

nomagicpill7 Jan 2026 23:40 UTC
11 points
0 comments9 min readLW link
(nomagicpill.substack.com)

Public in­tel­lec­tu­als need to say what they ac­tu­ally believe

Aaron Bergman7 Jan 2026 21:22 UTC
79 points
12 comments14 min readLW link
(www.aaronbergman.net)

Two Aspects of Si­tu­a­tional Aware­ness: World Model­ling & In­dex­i­cal Information

David Scott Krueger7 Jan 2026 20:24 UTC
40 points
7 comments2 min readLW link

Ad­vance­ments In Self-Driv­ing Cars

Zvi7 Jan 2026 19:50 UTC
30 points
2 comments17 min readLW link
(thezvi.wordpress.com)

Two ways non-U.S. folks can con­tribute to AI go­ing well

Joe Rogero7 Jan 2026 19:37 UTC
21 points
1 comment2 min readLW link
(subatomicarticles.com)

Every­thing is Poli­ti­cal Now, or, A Re­view of “Frag­gle Rock: Back to the Rock”

Gordon Seidoh Worley7 Jan 2026 17:00 UTC
13 points
0 comments8 min readLW link
(www.uncertainupdates.com)

FirstPrin­ci­ples Talks: Quan­tum ma­chines learn­ing quantum

Carly Turini7 Jan 2026 16:44 UTC
3 points
0 comments1 min readLW link

Does mind­ful­ness med­i­ta­tion lead to awak­en­ing?

Vadim Golub7 Jan 2026 14:08 UTC
16 points
0 comments4 min readLW link

An in­ter­ac­tive toy model for ex­plor­ing AI’s effect on the labour market

CharlesD7 Jan 2026 12:57 UTC
12 points
0 comments7 min readLW link

OpenFore­caster: How to train lan­guage mod­els for open-ended fore­cast­ing?

7 Jan 2026 11:03 UTC
10 points
1 comment7 min readLW link

ML re­search di­rec­tions for pre­vent­ing catas­trophic data poisoning

Tom Davidson7 Jan 2026 10:16 UTC
35 points
1 comment10 min readLW link
(newsletter.forethought.org)

A Loser’s Reflections

L.M.Sherlock7 Jan 2026 7:15 UTC
9 points
12 comments18 min readLW link
(lmsherlock.substack.com)

Al­gorith­mic Dating

denzit7 Jan 2026 2:39 UTC
−2 points
0 comments3 min readLW link
(denzit.substack.com)

Sim­ple sum­mary of AI Safety laws

7 Jan 2026 1:51 UTC
46 points
4 comments3 min readLW link

Re­sults: A self-ran­dom­ized study of the im­pacts of glycine on sleep (Science is still hard)

thedissonance.net6 Jan 2026 20:54 UTC
6 points
1 comment3 min readLW link
(thedissonance.net)

My 2003 Post on the Evolu­tion­ary Ar­gu­ment for AI Misalignment

Wei Dai6 Jan 2026 20:45 UTC
37 points
7 comments2 min readLW link

Main­stream ap­proach for al­ign­ment evals is a dead end

Igor Ivanov6 Jan 2026 19:52 UTC
60 points
9 comments5 min readLW link

Fer­til­ity Roundup #6: The Art of More Dakka

Zvi6 Jan 2026 19:50 UTC
32 points
5 comments26 min readLW link
(thezvi.wordpress.com)

On Own­ing Galaxies

Simon Lermen6 Jan 2026 18:16 UTC
154 points
62 comments3 min readLW link
(simonlermen.substack.com)

How hard is it to in­oc­u­late against mis­al­ign­ment gen­er­al­iza­tion?

Jozdien6 Jan 2026 17:30 UTC
46 points
4 comments14 min readLW link

How AI Is Learn­ing to Think in Secret

Nicholas Andresen6 Jan 2026 16:31 UTC
382 points
32 comments18 min readLW link
(nickandresen.substack.com)

Should you be post­ing on the open internet

zef6 Jan 2026 15:50 UTC
22 points
9 comments2 min readLW link

Catch­ing mis­re­port­ing about ML hard­ware use by turn­ing noise into sig­nal—Part II

Naci Cankaya6 Jan 2026 12:38 UTC
8 points
0 comments1 min readLW link
(nacicankaya.substack.com)

Med­i­ta­tions on Moloch in the AI Rat Race

Alexander Müller6 Jan 2026 9:46 UTC
11 points
1 comment6 min readLW link

[Question] Is any­one do­ing a real-world test of agen­tic mis­al­ign­ment?

Jamie Milton Freestone6 Jan 2026 7:45 UTC
2 points
1 comment1 min readLW link

Do we need spar­sity af­ter­all?

Giuseppe Birardi6 Jan 2026 6:06 UTC
20 points
5 comments29 min readLW link

Ex­plor­ing Re­in­force­ment Learn­ing Effects on Chain-of-Thought Legibility

6 Jan 2026 3:04 UTC
41 points
3 comments21 min readLW link

The Evolu­tion Ar­gu­ment Sucks

peralice6 Jan 2026 2:32 UTC
30 points
6 comments8 min readLW link

Fes­ti­val Stats 2025

jefftk6 Jan 2026 1:40 UTC
10 points
1 comment1 min readLW link
(www.jefftk.com)

Over­sight As­sis­tants: Turn­ing Com­pute into Understanding

jsteinhardt6 Jan 2026 0:50 UTC
85 points
7 comments9 min readLW link
(bounded-regret.ghost.io)

Aether is hiring tech­ni­cal AI safety researchers

5 Jan 2026 22:27 UTC
22 points
0 comments2 min readLW link

[Question] Con­tinual Learn­ing Achieved?

PeterMcCluskey5 Jan 2026 22:22 UTC
−7 points
11 comments1 min readLW link

AGI will not be one spe­cific sys­tem, it’ll be the unity of all systems

henophilia5 Jan 2026 18:21 UTC
−4 points
0 comments11 min readLW link

How to tame a com­plex system

jasoncrawford5 Jan 2026 18:20 UTC
27 points
0 comments2 min readLW link
(newsletter.rootsofprogress.org)

Broad­en­ing the train­ing set for alignment

Seth Herd5 Jan 2026 17:30 UTC
40 points
11 comments9 min readLW link

Dos Capital

Zvi5 Jan 2026 16:40 UTC
71 points
10 comments17 min readLW link
(thezvi.wordpress.com)

An­nounc­ing the CLR Fun­da­men­tals Program

Tristan Cook5 Jan 2026 15:16 UTC
12 points
0 comments2 min readLW link

AI Risk timelines: 10% chance (by year X) should be the head­line (and dead­line), not 50%. And 10% is _this year_!

Greg C5 Jan 2026 11:57 UTC
61 points
18 comments1 min readLW link

Trans­form­ers, Intuitively

atharva5 Jan 2026 11:34 UTC
5 points
0 comments4 min readLW link

The Tech­nol­ogy of Liberalism

L Rudolf L5 Jan 2026 11:04 UTC
41 points
7 comments29 min readLW link
(www.nosetgauge.com)

Ax­iolog­i­cal Stopsigns

JenniferRM5 Jan 2026 7:30 UTC
34 points
6 comments16 min readLW link

Ar­tifi­cal Ex­pert/​Ex­panded Nar­row In­tel­li­gence, and Proto-AGI

Yuli_Ban5 Jan 2026 3:40 UTC
15 points
0 comments7 min readLW link

An Apho­ris­tic Overview of Tech­ni­cal AI Align­ment proposals

wassname5 Jan 2026 3:01 UTC
11 points
3 comments2 min readLW link

Claude Wrote Me a 400-Com­mit RSS Reader App

Brendan Long5 Jan 2026 2:52 UTC
35 points
11 comments3 min readLW link
(www.brendanlong.com)

The inau­gu­ral Red­wood Re­search podcast

4 Jan 2026 22:11 UTC
146 points
10 comments142 min readLW link

LessOn­line 2026 Im­prove­ment Ideas

nomagicpill4 Jan 2026 21:56 UTC
16 points
0 comments1 min readLW link

The econ­omy is a graph, not a pipeline

anithite4 Jan 2026 21:48 UTC
33 points
10 comments4 min readLW link

Cal­ling all col­lege stu­dents (and new read­ers)

neo4 Jan 2026 21:20 UTC
15 points
0 comments1 min readLW link

Rock bot­tom ter­mi­nal value

ihatenumbersinusernames74 Jan 2026 20:43 UTC
4 points
9 comments2 min readLW link

In My Misan­thropy Era

jenn4 Jan 2026 18:34 UTC
352 points
153 comments8 min readLW link
(jenn.site)