All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 101112 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Positional kernels of attention heads

Alex Gibson10 Mar 2025 23:17 UTC

10 points

0 comments4 min readLW link

Progress links and short notes, 2025-03-10

jasoncrawford10 Mar 2025 20:27 UTC

8 points

0 comments4 min readLW link

(newsletter.rootsofprogress.org)

The Manus Marketing Madness

Zvi10 Mar 2025 20:10 UTC

54 points

0 comments24 min readLW link

(thezvi.wordpress.com)

You can just play

aswath krishnan10 Mar 2025 20:00 UTC

−5 points

0 comments2 min readLW link

How to Use Prompt Engineering to Rewire Your Brain

aswath krishnan10 Mar 2025 20:00 UTC

1 point

0 comments5 min readLW link

(www.aswathkrishnan.com)

When Independent Optimization Is Worse Than Randomness

Chaotic rationalist10 Mar 2025 19:46 UTC

−4 points

0 comments2 min readLW link

Stress exists only where the Mind makes it

Noahh10 Mar 2025 19:44 UTC

5 points

2 comments4 min readLW link

Counterargument to Godel’s Modal Ontological Argument

Wynn10 Mar 2025 19:38 UTC

−1 points

0 comments4 min readLW link

[Question] How much do frontier LLMs code and browse while in training?

Joe Rogero10 Mar 2025 19:34 UTC

7 points

0 comments1 min readLW link

Observations on self-supervised Learning for vision

Dinkar Juyal10 Mar 2025 19:31 UTC

3 points

0 comments5 min readLW link

Introducing 11 New AI Safety Organizations—Catalyze’s Winter 24/25 London Incubation Program Cohort

Alexandra Bos10 Mar 2025 19:26 UTC

75 points

0 comments14 min readLW link

The Jackpot Jinx (or why “Superintelligence Strategy” is wrong)

E.G. Blee-Goldman10 Mar 2025 19:18 UTC

13 points

0 comments5 min readLW link

Effective AI Outreach | A Data Driven Approach

NoahCWilson10 Mar 2025 19:18 UTC

1 point

0 comments15 min readLW link

Emergent AI Society. Tasks, Scarcity, Talks

Andrey Seryakov10 Mar 2025 19:18 UTC

1 point

0 comments5 min readLW link

Sentinel minutes #10/2025: Trump tariffs, US/China tensions, Claude code reward hacking.

NunoSempere10 Mar 2025 19:00 UTC

25 points

0 comments10 min readLW link

(blog.sentinel-team.org)

Have you actually tried raising the birth rate?

Yair Halberstadt10 Mar 2025 18:06 UTC

6 points

5 comments1 min readLW link

Split Personality Training: Revealing Latent Knowledge Through Personality-Shift Tokens

Florian_Dietz10 Mar 2025 16:07 UTC

44 points

7 comments9 min readLW link

We Have No Plan for Preventing Loss of Control in Open Models

Andrew Dickson10 Mar 2025 15:35 UTC

46 points

11 comments22 min readLW link

Lock-In Threat Models

alamerton10 Mar 2025 10:22 UTC

5 points

0 comments8 min readLW link

Book Review: Affective Neuroscience

sarahconstantin10 Mar 2025 6:50 UTC

62 points

8 comments13 min readLW link

(sarahconstantin.substack.com)

The chessboard world

phdead10 Mar 2025 1:26 UTC

5 points

0 comments8 min readLW link

[Question] when will LLMs become human-level bloggers?

nostalgebraist9 Mar 2025 21:10 UTC

125 points

34 comments6 min readLW link

Everything I Know About Semantics I Learned From Music Notation

J Bostock9 Mar 2025 18:09 UTC

34 points

2 comments10 min readLW link

Phoenix Rising

Metacelsus9 Mar 2025 11:53 UTC

67 points

7 comments5 min readLW link

(denovo.substack.com)

How well can Claude write coding questions?

bodry9 Mar 2025 5:29 UTC

3 points

1 comment12 min readLW link

A model of the final phase: the current frontier AIs as de facto CEOs of their own companies

Mitchell_Porter8 Mar 2025 22:15 UTC

23 points

2 comments1 min readLW link

Harry Potter and the Methods of Rationality 10 Year Anniversary Party!

Robert Cousineau8 Mar 2025 21:29 UTC

6 points

0 comments1 min readLW link

A case for peer-reviewed conspiracy theories

Sam G8 Mar 2025 20:41 UTC

13 points

3 comments4 min readLW link

The machine has no mouth and it must scream

zef8 Mar 2025 16:40 UTC

80 points

1 comment7 min readLW link

(zephyyr.substack.com)

How Do We Fix the Education Crisis?

James Camacho8 Mar 2025 2:59 UTC

12 points

5 comments8 min readLW link

GPT-4.5 Can Play Losing Chess

GoteNoSente8 Mar 2025 0:58 UTC

9 points

0 comments1 min readLW link

(chatgpt.com)

[Question] are “almost-p-zombies” possible?

KvmanThinking7 Mar 2025 22:58 UTC

4 points

3 comments1 min readLW link

Sufficiently Decentralized Intelligence is Indistinguishable from Synchronicity

Sahil7 Mar 2025 21:50 UTC

61 points

1 comment19 min readLW link

Amplifying the Computational No-Coincidence Conjecture

glauberdebona7 Mar 2025 21:29 UTC

8 points

6 comments7 min readLW link

[ages 16-21] Apply to PAIR & ESPR, Summer AI & Rationality Programs

Anna Gajdova7 Mar 2025 19:49 UTC

4 points

0 comments1 min readLW link

What if consciousness emerges from a predictive loop?

JohnMarkNorman7 Mar 2025 19:46 UTC

2 points

0 comments2 min readLW link

Forecasting newsletter #3/2025: Long march through the institutions

NunoSempere7 Mar 2025 18:17 UTC

8 points

0 comments1 min readLW link

(forecasting.substack.com)

Childhood and Education #9: School is Hell

Zvi7 Mar 2025 12:40 UTC

53 points

36 comments37 min readLW link

(thezvi.wordpress.com)

The Insanity Detector and Writing

Johannes C. Mayer7 Mar 2025 11:19 UTC

20 points

3 comments1 min readLW link

So how well is Claude playing Pokémon?

Julian Bradshaw7 Mar 2025 5:54 UTC

173 points

76 comments5 min readLW link

Of Loving Grace

Charlie Sanders7 Mar 2025 4:48 UTC

−3 points

0 comments3 min readLW link

(www.dailymicrofiction.com)

In-Context Scheming: A Run is Worth a Thousand Words

noise-field7 Mar 2025 2:47 UTC

10 points

0 comments1 min readLW link

(github.com)

AI for Music, A Tool for Manipulation or Expression?

Sunny Huiseon Lee7 Mar 2025 2:47 UTC

1 point

0 comments1 min readLW link

Are recent LLMs better at reasoning or better at memorizing?

Jude Khouja, harrymayne, ryanothnielkearns and karolinakorgul

7 Mar 2025 2:44 UTC

11 points

0 comments4 min readLW link

The Dead Planet Theory

arealsociety7 Mar 2025 2:43 UTC

17 points

0 comments1 min readLW link

(open.substack.com)

How Can Average People Contribute to AI Safety?

Stephen McAleese6 Mar 2025 22:50 UTC

16 points

4 comments8 min readLW link

Anthropic’s Recommendations to OSTP for the U.S. AI Action Plan

UnofficialLinkpostBot6 Mar 2025 22:38 UTC

11 points

2 comments2 min readLW link

(www.anthropic.com)

Lots of brief thoughts on Software Engineering

Yair Halberstadt6 Mar 2025 19:50 UTC

47 points

17 comments10 min readLW link

What the Headlines Miss About the Latest Decision in the Musk vs. OpenAI Lawsuit

garrison6 Mar 2025 19:49 UTC

98 points

0 comments6 min readLW link

(garrisonlovely.substack.com)

The optimizer won’t just guess your intended semantics

Thomas Kehrenberg6 Mar 2025 19:42 UTC

20 points

1 comment6 min readLW link