All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

How Can Average People Contribute to AI Safety?

Stephen McAleese6 Mar 2025 22:50 UTC

16 points

4 comments8 min readLW link

Anthropic’s Recommendations to OSTP for the U.S. AI Action Plan

UnofficialLinkpostBot6 Mar 2025 22:38 UTC

11 points

2 comments2 min readLW link

(www.anthropic.com)

Lots of brief thoughts on Software Engineering

Yair Halberstadt6 Mar 2025 19:50 UTC

47 points

17 comments10 min readLW link

What the Headlines Miss About the Latest Decision in the Musk vs. OpenAI Lawsuit

garrison6 Mar 2025 19:49 UTC

98 points

0 comments6 min readLW link

(garrisonlovely.substack.com)

The optimizer won’t just guess your intended semantics

Thomas Kehrenberg6 Mar 2025 19:42 UTC

20 points

1 comment6 min readLW link

AISN #49: Superintelligence Strategy

Corin Katzke and Dan H

6 Mar 2025 17:46 UTC

6 points

1 comment5 min readLW link

(newsletter.safe.ai)

Decision-Relevance of worlds and ADT implementations

Maxime Riché6 Mar 2025 16:57 UTC

9 points

0 comments15 min readLW link

AI #106: Not so Fast

Zvi6 Mar 2025 15:40 UTC

34 points

5 comments38 min readLW link

(thezvi.wordpress.com)

Can a finite physical device be Turing equivalent?

Noosphere896 Mar 2025 15:02 UTC

0 points

10 comments2 min readLW link

(lifeiscomputation.com)

We should start looking for scheming “in the wild”

Marius Hobbhahn6 Mar 2025 13:49 UTC

91 points

4 comments5 min readLW link

Bounded AI might be viable

Mateusz Bagiński and JustinShovelain

6 Mar 2025 12:55 UTC

24 points

4 comments20 min readLW link

Publish your genomic data

samuelshadrach6 Mar 2025 12:39 UTC

1 point

0 comments1 min readLW link

Which meat to eat: CO₂ vs Animal suffering

B Jacobs6 Mar 2025 12:37 UTC

3 points

5 comments3 min readLW link

(bobjacobs.substack.com)

Musings on Scenario Forecasting and AI

Alvin Ånestrand6 Mar 2025 12:28 UTC

10 points

0 comments11 min readLW link

(forecastingaifutures.substack.com)

Minor interpretability exploration #2: Extending superposition to different activation functions

Rareș Baron6 Mar 2025 11:22 UTC

3 points

0 comments4 min readLW link

What is Lock-In?

alamerton6 Mar 2025 11:09 UTC

5 points

0 comments9 min readLW link

ASI Game Theory: The Cosmic Dark Forest Deterrent

tavurth6 Mar 2025 10:28 UTC

1 point

4 comments1 min readLW link

The Hidden Cost of Our Lies to AI

Nicholas Andresen6 Mar 2025 5:03 UTC

151 points

19 comments7 min readLW link

(substack.com)

Camps Should List Bands

jefftk6 Mar 2025 3:00 UTC

7 points

0 comments1 min readLW link

(www.jefftk.com)

Give Neo a Chance

ank6 Mar 2025 1:48 UTC

3 points

7 comments7 min readLW link

[Question] Sparks of Original Thought?

Annapurna6 Mar 2025 0:53 UTC

6 points

4 comments1 min readLW link

Social Dilemmas — public goods, free riders, and exploitation

James Stephen Brown5 Mar 2025 23:31 UTC

7 points

0 comments3 min readLW link

(nonzerosum.games)

Introducing MASK: A Benchmark for Measuring Honesty in AI Systems

Richard Ren, Mantas Mazeika and Dan H

5 Mar 2025 22:56 UTC

37 points

5 comments2 min readLW link

(www.mask-benchmark.ai)

The Hardware-Software Framework: A New Perspective on Economic Growth with AI

Jakub Growiec5 Mar 2025 19:59 UTC

13 points

0 comments3 min readLW link

NY State Has a New Frontier Model Bill (+quick takes)

henryj5 Mar 2025 19:29 UTC

9 points

0 comments1 min readLW link

(www.henryjosephson.com)

The old memories tree

Yair Halberstadt5 Mar 2025 19:03 UTC

7 points

1 comment1 min readLW link

Reply to Vitalik on d/acc

samuelshadrach5 Mar 2025 18:55 UTC

8 points

0 comments3 min readLW link

(samuelshadrach.com)

A Bear Case: My Predictions Regarding AI Progress

Thane Ruthenis5 Mar 2025 16:41 UTC

377 points

163 comments9 min readLW link

On the Rationality of Deterring ASI

Dan H5 Mar 2025 16:11 UTC

171 points

34 comments4 min readLW link

(nationalsecurity.ai)

On OpenAI’s Safety and Alignment Philosophy

Zvi5 Mar 2025 14:00 UTC

58 points

5 comments17 min readLW link

(thezvi.wordpress.com)

The Alignment Imperative: Act Now or Lose Everything

racinkc15 Mar 2025 5:49 UTC

−14 points

0 comments1 min readLW link

Contra Dance Pay and Inflation

jefftk5 Mar 2025 2:40 UTC

12 points

0 comments2 min readLW link

(www.jefftk.com)

NYT Op-Ed The Government Knows A.G.I. Is Coming

worse5 Mar 2025 1:53 UTC

11 points

12 comments2 min readLW link

(www.nytimes.com)

Could this be an unusually good time to Earn To Give?

TomGardiner4 Mar 2025 21:51 UTC

−1 points

0 comments3 min readLW link

(forum.effectivealtruism.org)

What is the best / most proper definition of “Feeling the AGI” there is?

Annapurna4 Mar 2025 20:13 UTC

8 points

5 comments1 min readLW link

Energy Markets Temporal Arbitrage with Batteries

NickyP4 Mar 2025 17:37 UTC

28 points

3 comments16 min readLW link

Distillation of Meta’s Large Concept Models Paper

NickyP4 Mar 2025 17:33 UTC

19 points

3 comments4 min readLW link

Top AI safety newsletters, books, podcasts, etc – new AISafety.com resource

Bryce Robertson and Søren Elverlin

4 Mar 2025 17:01 UTC

33 points

2 comments1 min readLW link

2028 Should Not Be AI Safety’s First Foray Into Politics

Jesse Richardson4 Mar 2025 16:46 UTC

5 points

0 comments2 min readLW link

[Question] How Much Are LLMs Actually Boosting Real-World Programmer Productivity?

Thane Ruthenis4 Mar 2025 16:23 UTC

141 points

52 comments3 min readLW link

Validating against a misalignment detector is very different to training against one

mattmacdermott4 Mar 2025 15:41 UTC

46 points

4 comments4 min readLW link

For scheming, we should first focus on detection and then on prevention

Marius Hobbhahn4 Mar 2025 15:22 UTC

53 points

7 comments5 min readLW link

Progress links and short notes, 2025-03-03

jasoncrawford4 Mar 2025 15:20 UTC

8 points

0 comments6 min readLW link

(newsletter.rootsofprogress.org)

Formation Research: Organisation Overview

alamerton4 Mar 2025 15:03 UTC

6 points

0 comments11 min readLW link

On Writing #1

Zvi4 Mar 2025 13:30 UTC

38 points

2 comments15 min readLW link

(thezvi.wordpress.com)

The Semi-Rational Militar Firefighter

P. João4 Mar 2025 12:23 UTC

73 points

10 comments2 min readLW link

Observations About LLM Inference Pricing

Aaron_Scher4 Mar 2025 3:03 UTC

40 points

2 comments9 min readLW link

(techgov.intelligence.org)

[Question] How much should I worry about the Atlanta Fed’s GDP estimates?

Brendan Long4 Mar 2025 2:03 UTC

16 points

2 comments1 min readLW link

[Question] shouldn’t we try to get media attention?

KvmanThinking4 Mar 2025 1:39 UTC

6 points

1 comment1 min readLW link

The Milton Friedman Model of Policy Change

JohnofCharleston4 Mar 2025 0:38 UTC

152 points

17 comments4 min readLW link