All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 232425 26 27 28

Poll on AI opinions.

Niclas Kupper23 Feb 2025 22:39 UTC

1 point

2 comments1 min readLW link

The Geometry of Linear Regression versus PCA

criticalpoints23 Feb 2025 21:01 UTC

20 points

7 comments6 min readLW link

(eregis.github.io)

Judgements: Merging Prediction & Evidence

abramdemski23 Feb 2025 19:35 UTC

107 points

7 comments6 min readLW link

Intelligence as Privilege Escalation

Cole Wyeth23 Feb 2025 19:31 UTC

29 points

2 comments5 min readLW link

[Question] Have LLMs Generated Novel Insights?

abramdemski and Cole Wyeth

23 Feb 2025 18:22 UTC

169 points

41 comments2 min readLW link

The case for corporal punishment

Yair Halberstadt23 Feb 2025 15:05 UTC

28 points

5 comments2 min readLW link

Reflections on the state of the race to superintelligence, February 2025

Mitchell_Porter23 Feb 2025 13:58 UTC

22 points

7 comments4 min readLW link

List of most interesting ideas I encountered in my life, ranked

Lucien23 Feb 2025 12:36 UTC

21 points

6 comments1 min readLW link

Test of the Bene Gesserit

lsusr23 Feb 2025 11:51 UTC

19 points

3 comments3 min readLW link

Moral gauge theory: A speculative suggestion for AI alignment

James Diacoumis23 Feb 2025 11:42 UTC

6 points

3 comments8 min readLW link

[Question] Does human (mis)alignment pose a significant and imminent existential threat?

jr23 Feb 2025 10:03 UTC

6 points

3 comments1 min readLW link

Deep sparse autoencoders yield interpretable features too

Armaan A. Abraham23 Feb 2025 5:46 UTC

31 points

8 comments8 min readLW link

New Report: Multi-Agent Risks from Advanced AI

Lewis Hammond23 Feb 2025 0:32 UTC

25 points

0 comments2 min readLW link

(www.cooperativeai.com)

Power Lies Trembling: a three-book review

Richard_Ngo22 Feb 2025 22:57 UTC

214 points

29 comments15 min readLW link

(www.mindthefuture.info)

Transformer Dynamics: a neuro-inspired approach to MechInterp

guitchounts and jfernando

22 Feb 2025 21:33 UTC

11 points

0 comments5 min readLW link

Recursive Cognitive Refinement (RCR): A Self-Correcting Approach for LLM Hallucinations

mxTheo22 Feb 2025 21:32 UTC

0 points

0 comments2 min readLW link

Gradual Disempowerment: Simplified

Annapurna22 Feb 2025 16:59 UTC

10 points

1 comment1 min readLW link

(jorgevelez.substack.com)

AI Apocalypse and the Buddha

pchvykov22 Feb 2025 16:33 UTC

−17 points

6 comments9 min readLW link

Unaligned AGI & Brief History of Inequality

ank22 Feb 2025 16:26 UTC

−20 points

4 comments7 min readLW link

HPMOR Anniversary Guide

Screwtape22 Feb 2025 16:17 UTC

64 points

7 comments3 min readLW link

Forecasting Uncontrolled Spread of AI

Alvin Ånestrand22 Feb 2025 13:05 UTC

2 points

0 comments10 min readLW link

(forecastingaifutures.substack.com)

Seeing Through the Eyes of the Algorithm

silentbob22 Feb 2025 11:54 UTC

18 points

3 comments10 min readLW link

Proselytizing

lsusr22 Feb 2025 11:54 UTC

49 points

3 comments2 min readLW link

Workshop: Interpretability in LLMs using Geometric and Statistical Methods

Karthik Viswanathan22 Feb 2025 9:39 UTC

17 points

0 comments8 min readLW link

Information throughput of biological humans and frontier LLMs

benwr22 Feb 2025 7:15 UTC

12 points

0 comments1 min readLW link

Inefficiencies in Pharmaceutical Research Practices

ErioirE22 Feb 2025 4:43 UTC

20 points

2 comments5 min readLW link

Build a Metaculus Forecasting Bot in 30 Minutes: A Practical Guide

ChristianWilliams22 Feb 2025 3:52 UTC

7 points

0 comments1 min readLW link

Intelligence–Agency Equivalence ≈ Mass–Energy Equivalence: On Static Nature of Intelligence & Physicalization of Ethics

ank22 Feb 2025 0:12 UTC

1 point

0 comments6 min readLW link

Alignment can be the ‘clean energy’ of AI

Cameron Berg, Kvee and Trent Hodgeson

22 Feb 2025 0:08 UTC

69 points

8 comments8 min readLW link

The Sorry State of AI X-Risk Advocacy, and Thoughts on Doing Better

Thane Ruthenis21 Feb 2025 20:15 UTC

157 points

53 comments6 min readLW link

ParaScopes: Do Language Models Plan the Upcoming Paragraph?

NickyP21 Feb 2025 16:50 UTC

43 points

2 comments20 min readLW link

Linguistic Imperialism in AI: Enforcing Human-Readable Chain-of-Thought

Lukas Petersson21 Feb 2025 15:45 UTC

5 points

0 comments5 min readLW link

(lukaspetersson.com)

On OpenAI’s Model Spec 2.0

Zvi21 Feb 2025 14:10 UTC

52 points

4 comments43 min readLW link

(thezvi.wordpress.com)

Longtermist implications of aliens Space-Faring Civilizations—Introduction

Maxime Riché21 Feb 2025 12:08 UTC

21 points

0 comments6 min readLW link

MAISU—Minimal AI Safety Unconference

Linda Linsefors21 Feb 2025 11:36 UTC

19 points

2 comments2 min readLW link

The case for the death penalty

Yair Halberstadt21 Feb 2025 8:30 UTC

24 points

81 comments5 min readLW link

Make Superintelligence Loving

Davey Morse21 Feb 2025 6:07 UTC

8 points

9 comments5 min readLW link

The Takeoff Speeds Model Predicts We May Be Entering Crunch Time

johncrox21 Feb 2025 2:26 UTC

56 points

7 comments18 min readLW link

(readtheoom.substack.com)

Humans are Just Self Aware Intelligent Biological Machines

asksathvik21 Feb 2025 1:03 UTC

3 points

9 comments2 min readLW link

(asksathvik.substack.com)

Pre-ASI: The case for an enlightened mind, capital, and AI literacy in maximizing the good life

Noahh21 Feb 2025 0:03 UTC

5 points

5 comments6 min readLW link

(open.substack.com)

Timaeus in 2024

Jesse Hoogland, Stan van Wingerden, Alexander Gietelink Oldenziel and Daniel Murfet

20 Feb 2025 23:54 UTC

99 points

1 comment8 min readLW link

Biological humans collectively exert at most 400 gigabits/s of control over the world.

benwr20 Feb 2025 23:44 UTC

15 points

3 comments1 min readLW link

The first RCT for GLP-1 drugs and alcoholism isn’t what we hoped

dynomight20 Feb 2025 22:30 UTC

62 points

4 comments6 min readLW link

(dynomight.net)

Published report: Pathways to short TAI timelines

Zershaaneh Qureshi20 Feb 2025 22:10 UTC

22 points

0 comments17 min readLW link

(www.convergenceanalysis.org)

Neural Scaling Laws Rooted in the Data Distribution

aribrill20 Feb 2025 21:22 UTC

8 points

0 comments1 min readLW link

(arxiv.org)

Demonstrating specification gaming in reasoning models

Matrice Jacobine20 Feb 2025 19:26 UTC

4 points

0 comments1 min readLW link

(arxiv.org)

What makes a theory of intelligence useful?

Cole Wyeth20 Feb 2025 19:22 UTC

16 points

0 comments11 min readLW link

AI #104: American State Capacity on the Brink

Zvi20 Feb 2025 14:50 UTC

37 points

9 comments44 min readLW link

(thezvi.wordpress.com)

US AI Safety Institute will be ‘gutted,’ Axios reports

Matrice Jacobine20 Feb 2025 14:40 UTC

11 points

1 comment1 min readLW link

(www.zdnet.com)

Human-AI Relationality is Already Here

bridgebot20 Feb 2025 7:08 UTC

17 points

0 comments15 min readLW link