All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

AI Alignment Research Engineer Accelerator (ARENA): call for applicants

CallumMcDougallApr 17, 2023, 8:30 PM

100 points

9 comments7 min readLW link

AI #8: People Can Do Reasonable Things

ZviApr 20, 2023, 3:50 PM

100 points

16 comments55 min readLW link

(thezvi.wordpress.com)

The Social Alignment Problem

irvingApr 28, 2023, 2:16 PM

99 points

13 comments8 min readLW link

Would we even want AI to solve all our problems?

So8resApr 21, 2023, 6:04 PM

98 points

15 comments2 min readLW link

Given the Restrict Act, Don’t Ban TikTok

ZviApr 4, 2023, 2:40 PM

97 points

9 comments4 min readLW link

(thezvi.wordpress.com)

Why Simulator AIs want to be Active Inference AIs

Jan_Kulveit and rosehadshar

Apr 10, 2023, 6:23 PM

96 points

9 comments8 min readLW link 1 review

Communicating effectively under Knightian norms

Richard_NgoApr 3, 2023, 10:39 PM

96 points

54 comments6 min readLW link

Scaffolded LLMs as natural language computers

berenApr 12, 2023, 10:47 AM

95 points

10 comments11 min readLW link

Contra Yudkowsky on Doom from Foom #2

jacob_cannellApr 27, 2023, 12:07 AM

94 points

76 comments6 min readLW link

Exposure to Lizardman is Lethal

Duncan Sabien (Inactive)Apr 2, 2023, 6:57 PM

91 points

97 comments3 min readLW link

Contra Yudkowsky on AI Doom

jacob_cannellApr 24, 2023, 12:20 AM

89 points

111 comments9 min readLW link

Capabilities and alignment of LLM cognitive architectures

Seth HerdApr 18, 2023, 4:29 PM

88 points

18 comments20 min readLW link

A Confession about the LessWrong Team

RubyApr 1, 2023, 9:47 PM

87 points

5 comments2 min readLW link

Singularities against the Singularity: Announcing Workshop on Singular Learning Theory and Alignment

Jesse Hoogland, Alexander Gietelink Oldenziel and Daniel Murfet

Apr 1, 2023, 9:58 AM

87 points

0 comments1 min readLW link

(singularlearningtheory.com)

You can use GPT-4 to create prompt injections against GPT-4

WitchBOTApr 6, 2023, 8:39 PM

87 points

8 comments2 min readLW link

The Agency Overhang

Jeffrey LadishApr 21, 2023, 7:47 AM

85 points

6 comments6 min readLW link

No convincing evidence for gradient descent in activation space

BlaineApr 12, 2023, 4:48 AM

85 points

9 comments20 min readLW link

The benevolence of the butcher

dr_sApr 8, 2023, 4:29 PM

84 points

33 comments6 min readLW link 1 review

AI Safety via Luck

JozdienApr 1, 2023, 8:13 PM

82 points

7 comments11 min readLW link

Polio Lab Leak Caught with Wastewater Sampling

CullenApr 7, 2023, 1:06 AM

82 points

3 comments LW link

Anthropic is further accelerating the Arms Race?

sapphireApr 6, 2023, 11:29 PM

82 points

22 comments1 min readLW link

(techcrunch.com)

The surprising parameter efficiency of vision models

berenApr 8, 2023, 7:44 PM

81 points

28 comments4 min readLW link

AISafety.world is a map of the AIS ecosystem

Hamish DoodlesApr 6, 2023, 6:37 PM

80 points

0 comments1 min readLW link

AI #6: Agents of Change

ZviApr 6, 2023, 2:00 PM

79 points

13 comments47 min readLW link

(thezvi.wordpress.com)

Introducing AlignmentSearch: An AI Alignment-Informed Conversional Agent

BionicD0LPH1N, Fraser and TheBayesian

Apr 1, 2023, 4:39 PM

79 points

14 comments4 min readLW link

My experience getting funding for my biological research

MetacelsusApr 16, 2023, 10:53 PM

78 points

10 comments5 min readLW link

(denovo.substack.com)

Locating Fulcrum Experiences

LoganStrohlApr 28, 2023, 8:14 PM

78 points

31 comments17 min readLW link

Introducing the Nuts and Bolts Of Naturalism

LoganStrohlApr 22, 2023, 6:31 PM

77 points

2 comments3 min readLW link

Research agenda: Supervising AIs improving AIs

Quintin Pope, Owen D, Roman Engeler and jacquesthibs

Apr 29, 2023, 5:09 PM

76 points

5 comments19 min readLW link

Romance, misunderstanding, social stances, and the human LLM

Kaj_SotalaApr 27, 2023, 12:59 PM

75 points

32 comments16 min readLW link

I was Wrong, Simulator Theory is Real

Robert_AIZIApr 26, 2023, 5:45 PM

75 points

7 comments3 min readLW link

(aizi.substack.com)

The Computational Anatomy of Human Values

berenApr 6, 2023, 10:33 AM

74 points

30 comments30 min readLW link

[Question] Is this true? @tyler_m_john: [If we had started using CFCs earlier, we would have ended most life on the planet]

tailcalledApr 10, 2023, 2:22 PM

73 points

15 comments1 min readLW link

(twitter.com)

All images from the WaitButWhy sequence on AI

trevorApr 8, 2023, 7:36 AM

73 points

5 comments2 min readLW link

Repugnant levels of violins

Solenoid_EntityApr 12, 2023, 5:11 PM

73 points

10 comments12 min readLW link

The Toxoplasma of AGI Doom and Capabilities?

Robert_AIZIApr 24, 2023, 6:11 PM

72 points

12 comments1 min readLW link

Japan AI Alignment Conference Postmortem

Chris Scammell and Katrina Joslin

Apr 20, 2023, 10:58 AM

71 points

8 comments8 min readLW link

Power laws in Speedrunning and Machine Learning

Jsevillamol and Ege Erdil

Apr 24, 2023, 10:06 AM

71 points

1 comment1 min readLW link

(arxiv.org)

SmartyHeaderCode: anomalous tokens for GPT3.5 and GPT-4

AdamYedidiaApr 15, 2023, 10:35 PM

71 points

18 comments6 min readLW link

SERI MATS—Summer 2023 Cohort

Aris, Ryan Kidd and Christian Smith

Apr 8, 2023, 3:32 PM

71 points

25 comments4 min readLW link

A decade of lurking, a month of posting

Max HApr 9, 2023, 12:21 AM

70 points

4 comments5 min readLW link

[Linkpost] Sam Altman’s 2015 Blog Posts Machine Intelligence Parts 1 & 2

OliviaJApr 28, 2023, 4:02 PM

70 points

4 comments9 min readLW link

Approximation is expensive, but the lunch is cheap

Jesse Hoogland and Zach Furman

Apr 19, 2023, 2:19 PM

70 points

3 comments16 min readLW link

AGI ruin mostly rests on strong claims about alignment and deployment, not about society

Rob BensingerApr 24, 2023, 1:06 PM

70 points

8 comments6 min readLW link

Getting Started With Naturalism

LoganStrohlApr 23, 2023, 9:02 PM

69 points

4 comments11 min readLW link 1 review

Why Are Maximum Entropy Distributions So Ubiquitous?

johnswentworthApr 5, 2023, 8:12 PM

68 points

6 comments9 min readLW link

Mechanistically interpreting time in GPT-2 small

rgould, Elizabeth Ho and Arthur Conmy

Apr 16, 2023, 5:57 PM

68 points

6 comments21 min readLW link

Subscripts for Probabilities

niplavApr 13, 2023, 6:32 PM

67 points

9 comments5 min readLW link

Green goo is plausible

anithiteApr 18, 2023, 12:04 AM

67 points

31 comments4 min readLW link 1 review

On “aiming for convergence on truth”

gjmApr 11, 2023, 6:19 PM

67 points

55 comments13 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer