All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 2024 20252026

All JanFebMar Apr May Jun

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 192021 22 23 24 25 26 27 28

Cooperationism: first draft for a moral framework that does not require consciousness

Épiphanie Gédéon19 Feb 2026 21:07 UTC

26 points

5 comments8 min readLW link

Flamingos (among other things) reduce emergent misalignment

eekay19 Feb 2026 19:17 UTC

13 points

3 comments7 min readLW link

Funkering!

flying buttress19 Feb 2026 18:14 UTC

13 points

0 comments1 min readLW link

Subjectivity vs Agency: AI “Waking Up”?

Jonathan Moregård19 Feb 2026 17:19 UTC

4 points

0 comments5 min readLW link

(honestliving.substack.com)

You May Already Be Canadian

jefftk19 Feb 2026 16:00 UTC

120 points

14 comments1 min readLW link

(www.jefftk.com)

AI Researchers and Executives Continue to Underestimate the Near-Future Risks of Open Models

Andrew Dickson19 Feb 2026 15:56 UTC

23 points

1 comment16 min readLW link

AI #156 Part 1: They Do Mean The Effect On Jobs

Zvi19 Feb 2026 14:20 UTC

53 points

7 comments36 min readLW link

(thezvi.wordpress.com)

Terminal Cynicism

PranavG and Gabriel Alfour

19 Feb 2026 13:51 UTC

24 points

25 comments10 min readLW link

(cognition.cafe)

How much information does an optimal policy contain about its environment?

Alfred Harwood, Alex_Altair and JoseFaustino

19 Feb 2026 13:05 UTC

30 points

0 comments10 min readLW link

All hands on deck to build the datacenter lie detector

Naci Cankaya19 Feb 2026 11:42 UTC

32 points

2 comments5 min readLW link

(open.substack.com)

A Technical Primer on Mechanistic Interpretability

Alexei G19 Feb 2026 7:42 UTC

1 point

0 comments11 min readLW link

(alexeigannon.com)

Power Laws Are Not Enough

CarolusRenniusVitellius19 Feb 2026 4:31 UTC

10 points

3 comments4 min readLW link

(charlesr-w.github.io)

Be skeptical of milestone announcements by young AI startups

lc19 Feb 2026 4:19 UTC

25 points

0 comments3 min readLW link

Opus 4.5 made a biodevice (w me)

Raye19 Feb 2026 2:31 UTC

23 points

0 comments10 min readLW link

Review of If Anyone Builds It, Everyone Dies

James Brobin19 Feb 2026 1:53 UTC

23 points

4 comments5 min readLW link

I want to actually get good at forecasting this year (Group Invite)

Vojtech Brynych19 Feb 2026 1:41 UTC

12 points

4 comments1 min readLW link

Does GPT-2 Represent Controversy? A Small Mech Interp Investigation

CharlesL19 Feb 2026 1:36 UTC

6 points

0 comments2 min readLW link

Emotional Dispersion and Patience

Astrid Callender19 Feb 2026 1:35 UTC

6 points

5 comments4 min readLW link

What AI-safely topics are missing from the mainstream media? What underreported but underestimated issues need to be addressed? This is your chance to collaborate with filmmakers & have your worries addressed.

Max Hellier19 Feb 2026 1:30 UTC

2 points

0 comments1 min readLW link

Manifold spin off MNX, a real money decentralized market for AI-related bets. Includes levered prediction markets, perpetual futures

mako yass18 Feb 2026 22:36 UTC

10 points

3 comments1 min readLW link

(x.com)

AI and Nationalism Are a Deadly Combination

Matrice Jacobine18 Feb 2026 21:46 UTC

11 points

0 comments4 min readLW link

(www.currentaffairs.org)

Todd, Ord, Galef, Yudkowsky: German Podcast Sums Up EA/LW Books

jorges18 Feb 2026 21:44 UTC

7 points

0 comments1 min readLW link

The near-term potential of AI forecasting for public epistemics

Lawrence Phillips18 Feb 2026 20:37 UTC

21 points

0 comments16 min readLW link

Monthly Roundup #39: February 2026

Zvi18 Feb 2026 20:30 UTC

32 points

5 comments40 min readLW link

(thezvi.wordpress.com)

How to Reset

Logan Riggs18 Feb 2026 19:49 UTC

10 points

2 comments2 min readLW link

Karl Popper, meet the Hydra

Kotlopou18 Feb 2026 18:55 UTC

14 points

4 comments21 min readLW link

(beatingthehydra.substack.com)

Altruism Survey

ozymandias18 Feb 2026 18:40 UTC

9 points

0 comments1 min readLW link

Building Technology to Drive AI Governance

jsteinhardt18 Feb 2026 18:30 UTC

59 points

4 comments10 min readLW link

(bounded-regret.ghost.io)

Alignment Is Proven Tractable

SE Gyges18 Feb 2026 17:55 UTC

10 points

0 comments10 min readLW link

(www.verysane.ai)

Why we should expect ruthless sociopath ASI

Steven Byrnes18 Feb 2026 17:28 UTC

163 points

63 comments8 min readLW link

Is the Invisible Hand an Agent?

Gunnar_Zarncke18 Feb 2026 16:26 UTC

13 points

4 comments4 min readLW link

(substack.com)

Nine Flavors of Not Enough

Gordon Seidoh Worley18 Feb 2026 15:10 UTC

13 points

0 comments6 min readLW link

(www.uncertainupdates.com)

Grown from Us

ben_levinstein18 Feb 2026 14:57 UTC

10 points

0 comments2 min readLW link

How much superposition is there?

chanind and Adrià Garriga-alonso

18 Feb 2026 13:53 UTC

25 points

0 comments3 min readLW link

Irrationality is Socially Strategic

Valentine18 Feb 2026 13:28 UTC

119 points

18 comments13 min readLW link

Announcement: Technical AI Safety Evals Course

meriton, Alexandra R, July Kim, Bogoed, Alex the L and CommissarNeutrino

18 Feb 2026 13:24 UTC

7 points

0 comments1 min readLW link

Managed vs Unmanaged Agency

plex18 Feb 2026 13:23 UTC

52 points

23 comments3 min readLW link

Genomic emancipation contra eugenics

TsviBT18 Feb 2026 10:35 UTC

56 points

8 comments51 min readLW link

Already Optimized

Florian_Dietz18 Feb 2026 10:01 UTC

52 points

14 comments14 min readLW link

Statistical Literacy

kqr18 Feb 2026 6:50 UTC

0 points

2 comments8 min readLW link

(entropicthoughts.com)

AXRP Episode 49 - Caspar Oesterheld on Program Equilibrium

DanielFilan18 Feb 2026 1:30 UTC

10 points

1 comment72 min readLW link

Thoughts about Understanding

azergante18 Feb 2026 0:19 UTC

4 points

1 comment5 min readLW link

Monday AI Radar #13

Against Moloch18 Feb 2026 0:13 UTC

9 points

0 comments8 min readLW link

(againstmoloch.com)

Deception Channeling: Training Models to Always Verbalize Alignment Faking

Florian_Dietz17 Feb 2026 22:28 UTC

7 points

2 comments9 min readLW link

Rephrasing Reduces Eval Awareness...

atharva17 Feb 2026 22:23 UTC

23 points

4 comments3 min readLW link

The Math And The Territory

cylonator17 Feb 2026 21:53 UTC

2 points

0 comments8 min readLW link

Words are not dead

William tirkey17 Feb 2026 21:42 UTC

−2 points

2 comments5 min readLW link

Review of the System Theory as a Field of Knowledge

siarshai17 Feb 2026 21:34 UTC

4 points

1 comment18 min readLW link

You’re an AI Expert – Not an Influencer

Max Winga17 Feb 2026 21:03 UTC

180 points

25 comments6 min readLW link

(maxwinga.substack.com)

“We are confused about agency”

Cole Wyeth17 Feb 2026 19:51 UTC

57 points

37 comments3 min readLW link