All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 20242025

AllJanFeb Mar Apr May Jun Jul Aug

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Heritability: Five Battles

Steven ByrnesJan 14, 2025, 6:21 PM

90 points

23 comments60 min readLW link

Agent Foundations 2025 at CMU

Alexander Gietelink Oldenziel and windows

Jan 19, 2025, 11:48 PM

90 points

10 comments1 min readLW link

Scaling Sparse Feature Circuit Finding to Gemma 9B

Diego Caples, Jatin Nainani, CallumMcDougall and rrenaud

Jan 10, 2025, 11:08 AM

86 points

11 comments17 min readLW link

Stargate AI-1

ZviJan 24, 2025, 3:20 PM

85 points

1 comment18 min readLW link

(thezvi.wordpress.com)

I’m offering free math consultations!

GurkenglasJan 14, 2025, 4:30 PM

83 points

7 comments1 min readLW link

MONA: Managed Myopia with Approval Feedback

Seb Farquhar, David Lindner and Rohin Shah

Jan 23, 2025, 12:24 PM

81 points

30 comments9 min readLW link

On the OpenAI Economic Blueprint

ZviJan 15, 2025, 2:30 PM

81 points

2 comments9 min readLW link

(thezvi.wordpress.com)

No one has the ball on 1500 Russian olympiad winners who’ve received HPMOR

Mikhail SaminJan 12, 2025, 11:43 AM

80 points

21 comments1 min readLW link

Human study on AI spear phishing campaigns

Simon Lermen, Fred Heiding and Andrew Kao

Jan 3, 2025, 3:11 PM

79 points

8 comments5 min readLW link

Stream Entry

lsusrJan 7, 2025, 11:56 PM

76 points

11 comments4 min readLW link

Moderately More Than You Wanted To Know: Depressive Realism

JustisMillsJan 13, 2025, 2:57 AM

73 points

4 comments6 min readLW link

(justismills.substack.com)

Beards and Masks?

jefftkJan 18, 2025, 4:00 PM

72 points

5 comments4 min readLW link

(www.jefftk.com)

New, improved multiple-choice TruthfulQA

Owain_Evans, James Chua and Steph Lin

Jan 15, 2025, 11:32 PM

72 points

0 comments3 min readLW link

Numberwang: LLMs Doing Autonomous Research, and a Call for Input

eggsyntax and ncase

Jan 16, 2025, 5:20 PM

71 points

30 comments31 min readLW link

Yudkowsky on The Trajectory podcast

Seth HerdJan 24, 2025, 7:52 PM

71 points

39 comments2 min readLW link

(www.youtube.com)

Policymakers don’t have access to paywalled articles

Adam JonesJan 5, 2025, 10:56 AM

71 points

11 comments2 min readLW link

(adamjones.me)

Detect Goodhart and shut down

Jeremy GillenJan 22, 2025, 6:45 PM

70 points

21 comments7 min readLW link

Tail SP 500 Call Options

sapphireJan 23, 2025, 5:21 AM

70 points

28 comments2 min readLW link

Kessler’s Second Syndrome

Jesse HooglandJan 26, 2025, 7:04 AM

70 points

2 comments3 min readLW link

Some lessons from the OpenAI-FrontierMath debacle

7vikJan 19, 2025, 9:09 PM

70 points

9 comments4 min readLW link

Inference-Time-Compute: More Faithful? A Research Note

James Chua and Owain_Evans

Jan 15, 2025, 4:43 AM

69 points

10 comments11 min readLW link

Retrospective: 12 [sic] Months Since MIRI

james.lucassenJan 21, 2025, 2:52 AM

68 points

0 comments9 min readLW link

Paper: Open Problems in Mechanistic Interpretability

Lee Sharkey and bilalchughtai

Jan 29, 2025, 10:25 AM

68 points

0 comments1 min readLW link

(arxiv.org)

Chance is in the Map, not the Territory

Daniel Herrmann, ben_levinstein and Aydin Mohseni

Jan 13, 2025, 7:17 PM

67 points

18 comments7 min readLW link

Should you go with your best guess?: Against precise Bayesianism and related views

Anthony DiGiovanniJan 27, 2025, 8:25 PM

65 points

15 comments22 min readLW link

Timaeus is hiring researchers & engineers

Jesse Hoogland and Stan van Wingerden

Jan 17, 2025, 7:13 PM

65 points

4 comments4 min readLW link

Recommendations for Technical AI Safety Research Directions

Sam MarksJan 10, 2025, 7:34 PM

64 points

1 comment17 min readLW link

(alignment.anthropic.com)

Gaming TruthfulQA: Simple Heuristics Exposed Dataset Weaknesses

TurnTroutJan 16, 2025, 2:14 AM

64 points

3 comments1 min readLW link

(turntrout.com)

Read The Sequences As If They Were Written Today

Peter BerggrenJan 2, 2025, 2:51 AM

63 points

7 comments4 min readLW link

Announcement: Learning Theory Online Course

Yegreg and Alex Flint

Jan 20, 2025, 7:55 PM

63 points

33 comments4 min readLW link

“We know how to build AGI”—Sam Altman

Nikola JurkovicJan 6, 2025, 2:05 AM

62 points

5 comments1 min readLW link

(blog.samaltman.com)

Testing for Scheming with Model Deletion

GuiveJan 7, 2025, 1:54 AM

59 points

21 comments21 min readLW link

(guive.substack.com)

new chinese stealth aircraft

bhauthJan 1, 2025, 12:19 AM

58 points

3 comments6 min readLW link

(bhauth.com)

Logits, log-odds, and loss for parallel circuits

Dmitry VaintrobJan 20, 2025, 9:56 AM

57 points

4 comments11 min readLW link

A sketch of an AI control safety case

Tomek Korbak, joshc, Benjamin Hilton, Buck and Geoffrey Irving

Jan 30, 2025, 5:28 PM

57 points

0 comments5 min readLW link

A Novel Emergence of Meta-Awareness in LLM Fine-Tuning

rifeJan 15, 2025, 10:59 PM

57 points

32 comments2 min readLW link

AI Safety as a YC Startup

Lukas PeterssonJan 8, 2025, 10:46 AM

56 points

9 comments5 min readLW link

Introducing the WeirdML Benchmark

Håvard Tveit IhleJan 16, 2025, 11:38 AM

56 points

13 comments11 min readLW link

On polytopes

Dmitry VaintrobJan 25, 2025, 1:56 PM

56 points

5 comments12 min readLW link

What’s Behind the SynBio Bust?

sarahconstantinJan 30, 2025, 10:30 PM

55 points

8 comments6 min readLW link

(sarahconstantin.substack.com)

Tax Price Gouging?

jefftkJan 17, 2025, 2:10 PM

55 points

22 comments3 min readLW link

(www.jefftk.com)

Predict 2025 AI capabilities (by Sunday)

Jonas V, elifland and Sage Future

Jan 15, 2025, 12:16 AM

55 points

3 comments1 min readLW link

On DeepSeek’s r1

ZviJan 22, 2025, 7:50 PM

55 points

2 comments35 min readLW link

(thezvi.wordpress.com)

Finding Features Causally Upstream of Refusal

Daniel Lee, Eric Breck and Andy Arditi

Jan 14, 2025, 2:30 AM

54 points

5 comments12 min readLW link

AI #99: Farewell to Biden

ZviJan 16, 2025, 2:20 PM

54 points

5 comments58 min readLW link

(thezvi.wordpress.com)

Preference Inversion

BenquoJan 2, 2025, 6:15 PM

53 points

48 comments4 min readLW link

(benjaminrosshoffman.com)

The OODA Loop—Observe, Orient, Decide, Act

Davis_KingsleyJan 1, 2025, 8:00 AM

53 points

2 comments11 min readLW link

Dario Amodei: On DeepSeek and Export Controls

Zach Stein-PerlmanJan 29, 2025, 5:15 PM

53 points

3 comments1 min readLW link

(darioamodei.com)

You should read Hobbes, Locke, Hume, and Mill via EarlyModernTexts.com

Arjun PanicksseryJan 30, 2025, 12:35 PM

52 points

3 comments3 min readLW link

(arjunpanickssery.substack.com)

Discursive Warfare and Faction Formation

BenquoJan 9, 2025, 4:47 PM

52 points

3 comments3 min readLW link

(benjaminrosshoffman.com)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer