All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

How useful is “AI Control” as a framing on AI X-Risk?

habryka and ryan_greenblatt

Mar 14, 2024, 6:06 PM

70 points

4 comments34 min readLW link

Understanding SAE Features with the Logit Lens

Joseph Bloom and Johnny Lin

Mar 11, 2024, 12:16 AM

68 points

0 comments14 min readLW link

AE Studio @ SXSW: We need more AI consciousness research (and further resources)

AE Studio, Cameron Berg, Judd Rosenblatt, phgubbins and Diogo de Lucena

Mar 26, 2024, 8:59 PM

67 points

8 comments3 min readLW link

Social status part 2/2: everything else

Steven ByrnesMar 5, 2024, 4:29 PM

65 points

2 comments23 min readLW link

All About Concave and Convex Agents

mako yassMar 24, 2024, 9:37 PM

64 points

24 comments8 min readLW link

Superforecasting the Origins of the Covid-19 Pandemic

DanielFilanMar 12, 2024, 7:01 PM

64 points

0 comments1 min readLW link

(goodjudgment.substack.com)

On the Gladstone Report

ZviMar 20, 2024, 7:50 PM

64 points

11 comments40 min readLW link

(thezvi.wordpress.com)

We Inspected Every Head In GPT-2 Small using SAEs So You Don’t Have To

robertzk, Connor Kissane, Arthur Conmy and Neel Nanda

Mar 6, 2024, 5:03 AM

63 points

0 comments12 min readLW link

AI #55: Keep Clauding Along

ZviMar 14, 2024, 3:40 PM

62 points

16 comments70 min readLW link

(thezvi.wordpress.com)

Do not delete your misaligned AGI.

mako yassMar 24, 2024, 9:37 PM

62 points

13 comments3 min readLW link

More people getting into AI safety should do a PhD

AdamGleaveMar 14, 2024, 10:14 PM

61 points

24 comments12 min readLW link

(gleave.me)

DeepMind: Evaluating Frontier Models for Dangerous Capabilities

Zach Stein-PerlmanMar 21, 2024, 3:00 AM

61 points

8 comments1 min readLW link

(arxiv.org)

Research Report: Sparse Autoencoders find only 9/180 board state features in OthelloGPT

Robert_AIZIMar 5, 2024, 1:55 PM

61 points

24 comments10 min readLW link

(aizi.substack.com)

Results from an Adversarial Collaboration on AI Risk (FRI)

Josh Rosenberg, AvitalM, Molly and rosehadshar

Mar 11, 2024, 8:00 PM

61 points

3 comments9 min readLW link

(forecastingresearch.org)

[Question] What do we know about the AI knowledge and views, especially about existential risk, of the new OpenAI board members?

ZviMar 11, 2024, 2:55 PM

60 points

2 comments2 min readLW link

5 Physics Problems

DaemonicSigil and Muireall

Mar 18, 2024, 8:05 AM

60 points

0 comments15 min readLW link

0th Person and 1st Person Logic

Adele LopezMar 10, 2024, 12:56 AM

60 points

28 comments6 min readLW link

Measuring Coherence of Policies in Toy Environments

dx26 and Richard_Ngo

Mar 18, 2024, 5:59 PM

59 points

9 comments14 min readLW link

D&D.Sci: The Mad Tyrant’s Pet Turtles

abstractapplicMar 29, 2024, 4:22 PM

59 points

18 comments2 min readLW link

Woods’ new preprint on object permanence

Steven ByrnesMar 7, 2024, 9:29 PM

58 points

1 comment6 min readLW link

On the Latest TikTok Bill

ZviMar 13, 2024, 6:50 PM

58 points

7 comments29 min readLW link

(thezvi.wordpress.com)

AI things that are perhaps as important as human-controlled AI

Chi NguyenMar 3, 2024, 6:07 PM

55 points

4 comments LW link

Come to Manifest 2024 (June 7-9 in Berkeley)

Saul MunnMar 27, 2024, 9:30 PM

54 points

2 comments LW link

(news.manifold.markets)

Be More Katja

Nathan YoungMar 11, 2024, 9:12 PM

53 points

0 comments3 min readLW link

Was Releasing Claude-3 Net-Negative?

Logan RiggsMar 27, 2024, 5:41 PM

52 points

5 comments4 min readLW link

On Lex Fridman’s Second Podcast with Altman

ZviMar 25, 2024, 12:20 PM

51 points

10 comments10 min readLW link

(thezvi.wordpress.com)

Vipassana Meditation and Active Inference: A Framework for Understanding Suffering and its Cessation

sturbMar 21, 2024, 12:32 PM

50 points

8 comments19 min readLW link

Scenario Forecasting Workshop: Materials and Learnings

elifland and charlie_griffin

Mar 8, 2024, 2:30 AM

50 points

3 comments2 min readLW link

The Broken Screwdriver and other parables

bhauthMar 4, 2024, 3:34 AM

49 points

1 comment2 min readLW link

Should rationalists be spiritual / Spirituality as overcoming delusion

Kaj_Sotala and romeostevensit

Mar 25, 2024, 4:48 PM

49 points

57 comments29 min readLW link

Highlights from Lex Fridman’s interview of Yann LeCun

Joel BurgetMar 13, 2024, 8:58 PM

48 points

15 comments41 min readLW link

Constructive Cauchy sequences vs. Dedekind cuts

jessicataMar 14, 2024, 11:04 PM

47 points

23 comments4 min readLW link

(unstableontology.com)

How to safely use an optimizer

Simon FischerMar 28, 2024, 4:11 PM

47 points

21 comments7 min readLW link

AI Safety 101 : Capabilities—Human Level AI, What? How? and When?

markov and Charbel-Raphaël

Mar 7, 2024, 5:29 PM

46 points

8 comments54 min readLW link

Metascience of the Vesuvius Challenge

Maxwell TabarrokMar 30, 2024, 12:02 PM

46 points

2 comments6 min readLW link

(www.maximum-progress.com)

Some costs of superposition

Linda LinseforsMar 3, 2024, 4:08 PM

46 points

11 comments3 min readLW link

How people stopped dying from diarrhea so much (& other life-saving decisions)

WriterMar 16, 2024, 4:00 PM

45 points

0 comments LW link

(youtu.be)

AI #54: Clauding Along

ZviMar 7, 2024, 4:00 PM

45 points

11 comments51 min readLW link

(thezvi.wordpress.com)

Laying the Foundations for Vision and Multimodal Mechanistic Interpretability & Open Problems

Sonia Joseph and Neel Nanda

Mar 13, 2024, 5:09 PM

44 points

13 comments14 min readLW link

Back to Basics: Truth is Unitary

lsusrMar 29, 2024, 9:10 PM

44 points

13 comments6 min readLW link

Housing Roundup #7

ZviMar 4, 2024, 3:00 PM

42 points

1 comment44 min readLW link

(thezvi.wordpress.com)

One-shot strategy games?

RaemonMar 11, 2024, 12:19 AM

41 points

42 comments1 min readLW link

A Teacher vs. Everyone Else

ronak69Mar 21, 2024, 5:45 PM

41 points

8 comments2 min readLW link

Jobs, Relationships, and Other Cults

Ruby and Elizabeth

Mar 13, 2024, 5:58 AM

40 points

9 comments35 min readLW link

Movie posters

KatjaGraceMar 6, 2024, 6:20 AM

40 points

0 comments2 min readLW link

(worldspiritsockpuppet.com)

Neuroscience and Alignment

Garrett BakerMar 18, 2024, 9:09 PM

40 points

25 comments2 min readLW link

Mud and Despair (Part 4 of “The Sense Of Physical Necessity”)

LoganStrohlMar 7, 2024, 12:14 AM

38 points

0 comments2 min readLW link

Elon files grave charges against OpenAI

mako yassMar 1, 2024, 5:42 PM

38 points

10 comments1 min readLW link

(www.courthousenews.com)

Increasing IQ is trivial

George3d6Mar 1, 2024, 10:43 PM

38 points

61 comments6 min readLW link

(epistemink.substack.com)

Simple Kelly betting in prediction markets

jessicataMar 6, 2024, 6:59 PM

38 points

3 comments3 min readLW link

(unstablerontology.substack.com)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer