All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 101112 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

Logical Correlation

niplav10 Feb 2025 23:29 UTC

24 points

7 comments10 min readLW link

Proof idea: SLT to AIT

Lucius Bushnaq10 Feb 2025 23:14 UTC

42 points

15 comments6 min readLW link

LW/ACX social meetup

Stefan10 Feb 2025 21:12 UTC

2 points

0 comments1 min readLW link

A Bearish Take on AI, as a Treat

rats10 Feb 2025 19:22 UTC

11 points

0 comments4 min readLW link

(open.substack.com)

Beyond ELO: Rethinking Chess Skill as a Multidimensional Random Variable

Oliver Oswald10 Feb 2025 19:19 UTC

6 points

7 comments2 min readLW link

Claude is More Anxious than GPT; Personality is an axis of interpretability in language models

future_detective10 Feb 2025 19:19 UTC

2 points

2 comments8 min readLW link

(dhealy.substack.com)

Notes on Occam via Solomonoff vs. hierarchical Bayes

JesseClifton10 Feb 2025 17:55 UTC

29 points

7 comments4 min readLW link

Sleeping Beauty: an Accuracy-based Approach

glauberdebona10 Feb 2025 15:40 UTC

7 points

2 comments7 min readLW link

Political Idolatry

Arturo Macias10 Feb 2025 15:26 UTC

−8 points

7 comments2 min readLW link

ML4Good Colombia—Applications Open to LatAm Participants

Alejandro Acelas and Manuela García

10 Feb 2025 15:03 UTC

5 points

0 comments1 min readLW link

Nonpartisan AI safety

Yair Halberstadt10 Feb 2025 14:55 UTC

30 points

4 comments2 min readLW link

Opinion Article Scoring System

ciaran 10 Feb 2025 14:32 UTC

1 point

0 comments5 min readLW link

Levels of Friction

Zvi10 Feb 2025 13:10 UTC

155 points

8 comments12 min readLW link

(thezvi.wordpress.com)

Baumol effect vs Jevons paradox

Hzn10 Feb 2025 8:28 UTC

0 points

0 comments1 min readLW link

(hzn33.neocities.org)

[Question] A Simulation of Automation economics?

qbolec10 Feb 2025 8:11 UTC

10 points

1 comment1 min readLW link

[Question] Should I Divest from AI?

Oliver Kuperman10 Feb 2025 3:29 UTC

6 points

4 comments1 min readLW link

OpenAI lied about SFT vs. RLHF

sanxiyn10 Feb 2025 3:24 UTC

10 points

2 comments1 min readLW link

(x.com)

“Self-Blackmail” and Alternatives

jessicata9 Feb 2025 23:20 UTC

20 points

12 comments7 min readLW link

(unstableontology.com)

Altman blog on post-AGI world

Julian Bradshaw9 Feb 2025 21:52 UTC

29 points

10 comments1 min readLW link

(blog.samaltman.com)

Forecasting newsletter #2/2025: Forecasting meetup network

NunoSempere9 Feb 2025 18:07 UTC

13 points

0 comments4 min readLW link

(forecasting.substack.com)

How identical twin sisters feel about nieces vs their own daughters

Dave92F19 Feb 2025 17:36 UTC

4 points

19 comments1 min readLW link

Two hemispheres—I do not think it means what you think it means

Viliam9 Feb 2025 15:33 UTC

112 points

21 comments14 min readLW link

The Structure of Professional Revolutions

SebastianG 9 Feb 2025 13:23 UTC

8 points

0 comments4 min readLW link

Gary Marcus now saying AI can’t do things it can already do

Benjamin_Todd9 Feb 2025 12:24 UTC

62 points

12 comments1 min readLW link

(benjamintodd.substack.com)

How do you make a 250x better vaccine at 1/10 the cost? Develop it in India.

Abhishaike Mahajan9 Feb 2025 3:53 UTC

4 points

5 comments1 min readLW link

(www.owlposting.com)

Less Laptop Velcro

jefftk9 Feb 2025 3:30 UTC

19 points

0 comments1 min readLW link

(www.jefftk.com)

AXRP Episode 38.7 - Anthony Aguirre on the Future of Life Institute

DanielFilan9 Feb 2025 1:10 UTC

10 points

0 comments12 min readLW link

[Job ad] LISA CEO

Ryan Kidd, James Fox and mike_safeAI

9 Feb 2025 0:18 UTC

18 points

4 comments2 min readLW link

“Think it Faster” worksheet

Raemon8 Feb 2025 22:02 UTC

69 points

11 comments4 min readLW link

Seven sources of goals in LLM agents

Seth Herd8 Feb 2025 21:54 UTC

23 points

3 comments2 min readLW link

[Question] p(s-risks to contemporary humans)?

MattAlexander8 Feb 2025 21:19 UTC

6 points

5 comments6 min readLW link

Cross-Layer Feature Alignment and Steering in Large Language Model

dlaptev8 Feb 2025 20:18 UTC

9 points

0 comments6 min readLW link

Towards building blocks of ontologies

Daniel C, Alex_Altair, Dalcy, Alfred Harwood and JoseFaustino

8 Feb 2025 16:03 UTC

29 points

0 comments26 min readLW link

Can Knowledge Hurt You? The Dangers of Infohazards (and Exfohazards)

aggliu and Writer

8 Feb 2025 15:51 UTC

19 points

0 comments5 min readLW link

(www.youtube.com)

Distilling the Internal Model Principle

JoseFaustino8 Feb 2025 14:59 UTC

21 points

0 comments16 min readLW link

Knocking Down My AI Optimist Strawman

tailcalled8 Feb 2025 10:52 UTC

31 points

3 comments6 min readLW link

Preserving Epistemic Novelty in AI: Experiments, Insights, and the Case for Decentralized Collective Intelligence

Andy E Williams8 Feb 2025 10:25 UTC

−4 points

8 comments7 min readLW link

Chaos Investments v0.31

Screwtape8 Feb 2025 6:53 UTC

19 points

1 comment9 min readLW link

AI Safety Oversights

Davey Morse8 Feb 2025 6:15 UTC

3 points

0 comments1 min readLW link

Wiki on Suspects in Lind, Zajko, and Maland Killings

Rebecca_Records8 Feb 2025 4:16 UTC

20 points

4 comments1 min readLW link

Research directions Open Phil wants to fund in technical AI safety

jake_mendel, maxnadeau and Peter Favaloro

8 Feb 2025 1:40 UTC

117 points

21 comments58 min readLW link

(www.openphilanthropy.org)

So You Want To Make Marginal Progress...

johnswentworth7 Feb 2025 23:22 UTC

304 points

42 comments4 min readLW link

Reasons-based choice and cluelessness

JesseClifton7 Feb 2025 22:21 UTC

34 points

0 comments10 min readLW link

[Translation] In the Age of AI don’t Look for Unicorns

mushroomsoup7 Feb 2025 21:06 UTC

3 points

0 comments10 min readLW link

Racing Towards Fusion and AI

Jeffrey Heninger7 Feb 2025 20:40 UTC

49 points

11 comments7 min readLW link

‘High-Level Machine Intelligence’ and ‘Full Automation of Labor’ in the AI Impacts Surveys

Jeffrey Heninger7 Feb 2025 20:40 UTC

11 points

1 comment7 min readLW link

Request for Information for a new US AI Action Plan (OSTP RFI)

agucova7 Feb 2025 20:40 UTC

5 points

0 comments2 min readLW link

(www.federalregister.gov)

A Problem to Solve Before Building a Deception Detector

Eleni Angelou and lewis smith

7 Feb 2025 19:35 UTC

78 points

12 comments14 min readLW link

Request for proposals: improving capability evaluations

cb7 Feb 2025 18:51 UTC

1 point

0 comments1 min readLW link

(www.openphilanthropy.org)

How AI Takeover Might Happen in 2 Years

joshc7 Feb 2025 17:10 UTC

426 points

142 comments29 min readLW link

(x.com)