Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
Page
1
SolidGoldMagikarp (plus, prompt generation)
Jessica Rumbelow
and
mwatkins
5 Feb 2023 22:02 UTC
677
points
208
comments
12
min read
LW
link
1
review
The Waluigi Effect (mega-post)
Cleo Nardo
3 Mar 2023 3:22 UTC
648
points
188
comments
16
min read
LW
link
The Talk: a brief explanation of sexual dimorphism
Malmesbury
18 Sep 2023 16:23 UTC
553
points
79
comments
16
min read
LW
link
3
reviews
How much do you believe your results?
Eric Neyman
6 May 2023 20:31 UTC
524
points
18
comments
15
min read
LW
link
4
reviews
(ericneyman.wordpress.com)
The ants and the grasshopper
Richard_Ngo
4 Jun 2023 22:00 UTC
505
points
45
comments
5
min read
LW
link
4
reviews
(www.narrativeark.xyz)
Focus on the places where you feel shocked everyone’s dropping the ball
So8res
2 Feb 2023 0:27 UTC
502
points
65
comments
4
min read
LW
link
3
reviews
Significantly Enhancing Adult Intelligence With Gene Editing May Be Possible
GeneSmith
and
kman
12 Dec 2023 18:14 UTC
474
points
212
comments
33
min read
LW
link
2
reviews
Things I Learned by Spending Five Thousand Hours In Non-EA Charities
jenn
1 Jun 2023 20:48 UTC
451
points
37
comments
8
min read
LW
link
1
review
(jenn.site)
Steering GPT-2-XL by adding an activation vector
TurnTrout
,
Monte M
,
David Udell
,
lisathiergart
and
Ulisse Mini
13 May 2023 18:42 UTC
441
points
98
comments
50
min read
LW
link
1
review
Douglas Hofstadter changes his mind on Deep Learning & AI risk (June 2023)?
gwern
3 Jul 2023 0:48 UTC
431
points
54
comments
7
min read
LW
link
(www.youtube.com)
GPTs are Predictors, not Imitators
Eliezer Yudkowsky
8 Apr 2023 19:59 UTC
429
points
100
comments
3
min read
LW
link
3
reviews
Please don’t throw your mind away
TsviBT
15 Feb 2023 21:41 UTC
419
points
50
comments
18
min read
LW
link
1
review
Bing Chat is blatantly, aggressively misaligned
evhub
15 Feb 2023 5:29 UTC
408
points
181
comments
2
min read
LW
link
1
review
Social Dark Matter
Duncan Sabien (Inactive)
16 Nov 2023 20:00 UTC
388
points
131
comments
34
min read
LW
link
2
reviews
Statement on AI Extinction—Signed by AGI Labs, Top Academics, and Many Other Notable Figures
Dan H
30 May 2023 9:05 UTC
383
points
78
comments
1
min read
LW
link
1
review
(www.safe.ai)
Noting an error in Inadequate Equilibria
Matthew Barnett
8 Feb 2023 1:33 UTC
379
points
60
comments
2
min read
LW
link
2
reviews
How it feels to have your mind hacked by an AI
blaked
12 Jan 2023 0:33 UTC
374
points
222
comments
17
min read
LW
link
How to have Polygenically Screened Children
GeneSmith
7 May 2023 16:01 UTC
372
points
128
comments
27
min read
LW
link
1
review
My Objections to “We’re All Gonna Die with Eliezer Yudkowsky”
Quintin Pope
21 Mar 2023 0:06 UTC
365
points
233
comments
39
min read
LW
link
1
review
Fucking Goddamn Basics of Rationalist Discourse
LoganStrohl
4 Feb 2023 1:47 UTC
364
points
104
comments
1
min read
LW
link
3
reviews
Childhoods of exceptional people
Henrik Karlsson
6 Feb 2023 17:27 UTC
353
points
62
comments
15
min read
LW
link
1
review
(escapingflatland.substack.com)
Shallow review of live agendas in alignment & safety
technicalities
and
Stag
27 Nov 2023 11:10 UTC
351
points
73
comments
29
min read
LW
link
1
review
Guide to rationalist interior decorating
mingyuan
19 Jun 2023 6:47 UTC
344
points
53
comments
12
min read
LW
link
4
reviews
Shutting Down the Lightcone Offices
habryka
and
Ben Pace
14 Mar 2023 22:47 UTC
339
points
103
comments
17
min read
LW
link
2
reviews
Inside Views, Impostor Syndrome, and the Great LARP
johnswentworth
25 Sep 2023 16:08 UTC
339
points
54
comments
5
min read
LW
link
Cyborgism
Niki Dupuis
and
janus
10 Feb 2023 14:47 UTC
339
points
47
comments
35
min read
LW
link
2
reviews
Against Almost Every Theory of Impact of Interpretability
Charbel-Raphaël
17 Aug 2023 18:44 UTC
336
points
93
comments
26
min read
LW
link
2
reviews
Understanding and controlling a maze-solving policy network
TurnTrout
,
peligrietzer
,
Ulisse Mini
,
Monte M
and
David Udell
11 Mar 2023 18:59 UTC
335
points
28
comments
23
min read
LW
link
EA Vegan Advocacy is not truthseeking, and it’s everyone’s problem
Elizabeth
28 Sep 2023 23:30 UTC
334
points
250
comments
22
min read
LW
link
2
reviews
(acesounderglass.com)
Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research
evhub
,
Nicholas Schiefer
,
Carson Denison
and
Ethan Perez
8 Aug 2023 1:30 UTC
331
points
30
comments
18
min read
LW
link
1
review
Book Review: How Minds Change
bc4026bd4aaa5b7fe
25 May 2023 17:55 UTC
329
points
53
comments
15
min read
LW
link
When do “brains beat brawn” in Chess? An experiment
titotal
28 Jun 2023 13:33 UTC
327
points
107
comments
7
min read
LW
link
2
reviews
(titotal.substack.com)
Sharing Information About Nonlinear
Ben Pace
7 Sep 2023 6:51 UTC
323
points
324
comments
34
min read
LW
link
The Parable of the King and the Random Process
moridinamael
1 Mar 2023 22:18 UTC
317
points
26
comments
6
min read
LW
link
3
reviews
Speaking to Congressional staffers about AI risk
Orpheus16
and
hath
4 Dec 2023 23:08 UTC
314
points
25
comments
15
min read
LW
link
1
review
Alignment Grantmaking is Funding-Limited Right Now
johnswentworth
19 Jul 2023 16:49 UTC
312
points
68
comments
1
min read
LW
link
On not getting contaminated by the wrong obesity ideas
Natália
28 Jan 2023 20:18 UTC
310
points
69
comments
30
min read
LW
link
LW Team is adjusting moderation policy
Raemon
4 Apr 2023 20:41 UTC
307
points
185
comments
3
min read
LW
link
AI Timelines
habryka
,
Daniel Kokotajlo
,
Ajeya Cotra
and
Ege Erdil
10 Nov 2023 5:28 UTC
302
points
143
comments
51
min read
LW
link
2
reviews
Predictable updating about AI risk
Joe Carlsmith
8 May 2023 21:53 UTC
297
points
25
comments
36
min read
LW
link
1
review
The 101 Space You Will Always Have With You
Screwtape
29 Nov 2023 4:56 UTC
296
points
23
comments
6
min read
LW
link
1
review
Notes on Teaching in Prison
jsd
19 Apr 2023 1:53 UTC
295
points
13
comments
12
min read
LW
link
Pausing AI Developments Isn’t Enough. We Need to Shut it All Down by Eliezer Yudkowsky
jacquesthibs
29 Mar 2023 23:16 UTC
294
points
297
comments
3
min read
LW
link
(time.com)
Accidentally Load Bearing
jefftk
13 Jul 2023 16:10 UTC
291
points
19
comments
1
min read
LW
link
1
review
(www.jefftk.com)
Hooray for stepping out of the limelight
So8res
1 Apr 2023 2:45 UTC
289
points
26
comments
1
min read
LW
link
Towards Monosemanticity: Decomposing Language Models With Dictionary Learning
Zac Hatfield-Dodds
5 Oct 2023 21:01 UTC
289
points
22
comments
2
min read
LW
link
1
review
(transformer-circuits.pub)
The 6D effect: When companies take risks, one email can be very powerful.
scasper
4 Nov 2023 20:08 UTC
289
points
42
comments
3
min read
LW
link
Basics of Rationalist Discourse
Duncan Sabien (Inactive)
27 Jan 2023 2:40 UTC
287
points
193
comments
31
min read
LW
link
4
reviews
My Model Of EA Burnout
LoganStrohl
25 Jan 2023 17:52 UTC
282
points
50
comments
5
min read
LW
link
1
review
We don’t trade with ants
KatjaGrace
10 Jan 2023 23:50 UTC
281
points
110
comments
7
min read
LW
link
1
review
(worldspiritsockpuppet.com)
Back to top
Next