All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 20242025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 202122 23 24 25 26 27 28 29 30 31

Why We Need More Shovel-Ready AI Notkilleveryoneism Megaproject Proposals

Peter Berggren20 Jan 2025 22:38 UTC

36 points

1 comment6 min readLW link

Tips and Code for Empirical Research Workflows

John Hughes and Ethan Perez

20 Jan 2025 22:31 UTC

96 points

15 comments20 min readLW link

Lecture Series on Tiling Agents #2

abramdemski20 Jan 2025 21:02 UTC

16 points

0 comments1 min readLW link

Announcement: Learning Theory Online Course

Yegreg and Alex Flint

20 Jan 2025 19:55 UTC

63 points

33 comments4 min readLW link

The Hidden Status Game in Hospital Slacking

EpistemicExplorer20 Jan 2025 18:35 UTC

2 points

4 comments3 min readLW link

Monthly Roundup #26: January 2025

Zvi20 Jan 2025 15:30 UTC

34 points

15 comments43 min readLW link

(thezvi.wordpress.com)

Things I have been using LLMs for

Kaj_Sotala20 Jan 2025 14:20 UTC

51 points

13 comments7 min readLW link

(kajsotala.fi)

[Question] What are the chances that Superhuman Agents are already being tested on the internet?

artemium20 Jan 2025 11:09 UTC

3 points

1 comment1 min readLW link

Detroit Lions—over confidence is over rated?

Hzn20 Jan 2025 10:53 UTC

6 points

0 comments1 min readLW link

Logits, log-odds, and loss for parallel circuits

Dmitry Vaintrob20 Jan 2025 9:56 UTC

57 points

4 comments11 min readLW link

Worries about latent reasoning in LLMs

Caleb Biddulph20 Jan 2025 9:09 UTC

47 points

11 comments7 min readLW link

SIGMI Certification Criteria

a littoral wizard20 Jan 2025 2:41 UTC

6 points

0 comments1 min readLW link

AXRP Episode 38.5 - Adrià Garriga-Alonso on Detecting AI Scheming

DanielFilan20 Jan 2025 0:40 UTC

9 points

0 comments16 min readLW link

The Monster in Our Heads

testingthewaters19 Jan 2025 23:58 UTC

35 points

4 comments5 min readLW link

AI: How We Got Here—A Neuroscience Perspective

Mordechai Rorvig19 Jan 2025 23:51 UTC

5 points

0 comments2 min readLW link

(www.kickstarter.com)

Agent Foundations 2025 at CMU

Alexander Gietelink Oldenziel and windows

19 Jan 2025 23:48 UTC

90 points

10 comments1 min readLW link

Who is marketing AI alignment?

MrThink19 Jan 2025 21:37 UTC

23 points

4 comments1 min readLW link

Some lessons from the OpenAI-FrontierMath debacle

7vik19 Jan 2025 21:09 UTC

71 points

9 comments4 min readLW link

Maximally Eggy Crepes

jefftk19 Jan 2025 20:40 UTC

12 points

0 comments1 min readLW link

(www.jefftk.com)

The second bitter lesson — there’s a fundamental problem with aligning distributed AI

aelwood19 Jan 2025 19:00 UTC

−5 points

0 comments5 min readLW link

(pursuingreality.substack.com)

The Gentle Romance

Richard_Ngo19 Jan 2025 18:29 UTC

244 points

46 comments15 min readLW link

(www.asimov.press)

Is theory good or bad for AI safety?

Dmitry Vaintrob19 Jan 2025 10:32 UTC

28 points

1 comment5 min readLW link

[Question] What’s the Right Way to think about Information Theoretic quantities in Neural Networks?

Dalcy19 Jan 2025 8:04 UTC

45 points

13 comments3 min readLW link

Per Tribalismum ad Astra

Martin Sustrik19 Jan 2025 6:50 UTC

30 points

5 comments2 min readLW link

(250bpm.substack.com)

Five Recent AI Tutoring Studies

Arjun Panickssery19 Jan 2025 3:53 UTC

94 points

0 comments2 min readLW link

(arjunpanickssery.substack.com)

Does Society need a cultural outlet in turbulent political times?

Freya Mcneill19 Jan 2025 2:45 UTC

−3 points

0 comments7 min readLW link

On Thiel’s New American Regime

shawkisukkar19 Jan 2025 2:45 UTC

−3 points

0 comments5 min readLW link

(shawkisukkar.substack.com)

be the person that makes the meeting productive

Oldmanrahul18 Jan 2025 22:32 UTC

9 points

0 comments1 min readLW link

Beards and Masks?

jefftk18 Jan 2025 16:00 UTC

72 points

5 comments4 min readLW link

(www.jefftk.com)

[Question] How likely is AGI to force us all to be happy forever? (much like in the Three Worlds Collide novel)

uhbif1918 Jan 2025 15:39 UTC

9 points

5 comments1 min readLW link

Well-being in the mind, and its implications for utilitarianism

Sjlver18 Jan 2025 15:32 UTC

6 points

2 comments2 min readLW link

[Exercise] Four Examples of Noticing Confusion

Logan Riggs18 Jan 2025 15:29 UTC

8 points

8 comments3 min readLW link

Scaling Wargaming for Global Catastrophic Risks with AI

rai and NunoSempere

18 Jan 2025 15:10 UTC

40 points

2 comments4 min readLW link

(blog.sentinel-team.org)

Alignment ideas

qbolec18 Jan 2025 12:43 UTC

11 points

1 comment8 min readLW link

AI-enabled Cloud Gaming

samuelshadrach18 Jan 2025 11:56 UTC

1 point

0 comments3 min readLW link

(samuelshadrach.com)

Don’t ignore bad vibes you get from people

Kaj_Sotala18 Jan 2025 9:20 UTC

164 points

52 comments2 min readLW link

(kajsotala.fi)

Renormalization Redux: QFT Techniques for AI Interpretability

Lauren Greenspan and Dmitry Vaintrob

18 Jan 2025 3:54 UTC

47 points

12 comments7 min readLW link

[Question] What’s Wrong With the Simulation Argument?

Davey18 Jan 2025 2:32 UTC

6 points

49 comments1 min readLW link

Your AI Safety focus is downstream of your AGI timeline

Michael Flood17 Jan 2025 21:24 UTC

9 points

0 comments4 min readLW link

Thoughts on the conservative assumptions in AI control

Buck17 Jan 2025 19:23 UTC

91 points

5 comments13 min readLW link

Timaeus is hiring researchers & engineers

Jesse Hoogland and Stan van Wingerden

17 Jan 2025 19:13 UTC

65 points

4 comments4 min readLW link

Model Amnesty Project

themis17 Jan 2025 18:53 UTC

3 points

2 comments3 min readLW link

Addressing doubts of AI progress: Why GPT-5 is not late, and why data scarcity isn’t a fundamental limiter near term.

LDJ17 Jan 2025 18:53 UTC

2 points

0 comments2 min readLW link

Playing Dixit with AI: How Well LLMs Detect ‘Me-ness’

Mariia Koroliuk17 Jan 2025 18:52 UTC

5 points

0 comments2 min readLW link

Doing a self-randomized study of the impacts of glycine on sleep (Science is hard)

thedissonance.net17 Jan 2025 18:49 UTC

11 points

5 comments11 min readLW link

How sci-fi can have drama without dystopia or doomerism

jasoncrawford17 Jan 2025 15:22 UTC

19 points

3 comments3 min readLW link

(newsletter.rootsofprogress.org)

[Question] What do you mean with ‘alignment is solvable in principle’?

Remmelt17 Jan 2025 15:03 UTC

3 points

9 comments1 min readLW link

Meta Pivots on Content Moderation

Zvi17 Jan 2025 14:20 UTC

47 points

3 comments10 min readLW link

(thezvi.wordpress.com)

Tax Price Gouging?

jefftk17 Jan 2025 14:10 UTC

55 points

22 comments3 min readLW link

(www.jefftk.com)

The quantum red pill or: They lied to you, we live in the (density) matrix

Dmitry Vaintrob17 Jan 2025 13:58 UTC

37 points

34 comments12 min readLW link