Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
Visualisation of Probability Mass
brook
Jan 25, 2023, 3:09 PM
7
points
0
comments
LW
link
When Did EA Start?
jefftk
Jan 25, 2023, 2:30 PM
37
points
2
comments
2
min read
LW
link
(www.jefftk.com)
Some Thoughts on AI Art
abramdemski
Jan 25, 2023, 2:18 PM
74
points
20
comments
7
min read
LW
link
Quick thoughts on “scalable oversight” / “super-human feedback” research
David Scott Krueger (formerly: capybaralet)
Jan 25, 2023, 12:55 PM
27
points
9
comments
2
min read
LW
link
Sapir-Whorf for Rationalists
Duncan Sabien (Inactive)
Jan 25, 2023, 7:58 AM
155
points
49
comments
19
min read
LW
link
ChatGPT vs the 2-4-6 Task
cwillu
Jan 25, 2023, 6:59 AM
20
points
4
comments
3
min read
LW
link
Pessimistic Shard Theory
Garrett Baker
Jan 25, 2023, 12:59 AM
72
points
13
comments
3
min read
LW
link
Thatcher’s Axiom
Edward P. Könings
Jan 24, 2023, 10:35 PM
10
points
22
comments
4
min read
LW
link
[Question]
Some questions about free will compatibilism
Asking Questions
Jan 24, 2023, 9:54 PM
3
points
21
comments
6
min read
LW
link
Alexander and Yudkowsky on AGI goals
Scott Alexander
and
Eliezer Yudkowsky
Jan 24, 2023, 9:09 PM
179
points
53
comments
26
min read
LW
link
1
review
[Question]
Is _The Age of AI: And Our Human Future_ worth reading
jmh
Jan 24, 2023, 9:05 PM
4
points
0
comments
1
min read
LW
link
Inverse Scaling Prize: Second Round Winners
Ian McKenzie
,
Sam Bowman
and
Ethan Perez
Jan 24, 2023, 8:12 PM
58
points
17
comments
15
min read
LW
link
ChatGPT intimates a tantalizing future; its core LLM is organized on multiple levels; and it has broken the idea of thinking.
Bill Benzon
Jan 24, 2023, 7:05 PM
5
points
0
comments
5
min read
LW
link
How-to Transformer Mechanistic Interpretability—in 50 lines of code or less!
StefanHex
Jan 24, 2023, 6:45 PM
47
points
5
comments
13
min read
LW
link
The Cabinet of Wikipedian Curiosities
Sam Enright
Jan 24, 2023, 6:22 PM
36
points
5
comments
6
min read
LW
link
(samenright.com)
Explanatory Parsimony, Explanatory Superfluousness and Uselessness of Newton’s First Law
Jimdrix_Hendri
Jan 24, 2023, 5:21 PM
−2
points
7
comments
2
min read
LW
link
Guesstimate: Why and how to use it
brook
and
chanamessinger
Jan 24, 2023, 4:24 PM
8
points
0
comments
3
min read
LW
link
(forum.effectivealtruism.org)
GWWC Pledge History
jefftk
Jan 24, 2023, 3:50 PM
15
points
0
comments
3
min read
LW
link
(www.jefftk.com)
Gradient hacking is extremely difficult
beren
Jan 24, 2023, 3:45 PM
170
points
22
comments
5
min read
LW
link
[Question]
What sci-fi books are most relevant to a future with transformative AI?
sid
Jan 24, 2023, 3:30 PM
2
points
9
comments
1
min read
LW
link
Grant-making in EA should consider peer-reviewing grant applications along the public-sector model
Ben Smith
Jan 24, 2023, 3:01 PM
0
points
3
comments
LW
link
“Endgame safety” for AGI
Steven Byrnes
Jan 24, 2023, 2:15 PM
85
points
10
comments
6
min read
LW
link
Thoughts on hardware / compute requirements for AGI
Steven Byrnes
Jan 24, 2023, 2:03 PM
63
points
32
comments
24
min read
LW
link
Parameter Scaling Comes for RL, Maybe
1a3orn
Jan 24, 2023, 1:55 PM
100
points
3
comments
14
min read
LW
link
How to find cool things in a new place
Sam F. Brown
Jan 24, 2023, 11:20 AM
12
points
0
comments
1
min read
LW
link
[Crosspost] ACX 2022 Prediction Contest Results
Scott Alexander
,
Eric Neyman
and
Sam Marks
Jan 24, 2023, 6:56 AM
48
points
6
comments
8
min read
LW
link
The Human-AI Reflective Equilibrium
Allison Duettmann
Jan 24, 2023, 1:32 AM
22
points
1
comment
24
min read
LW
link
“Status” can be corrosive; here’s how I handle it
Orpheus16
Jan 24, 2023, 1:25 AM
71
points
8
comments
6
min read
LW
link
[Question]
What area of the digital domain seems safe from AI in the next 5-10 years?
Adrien Chauvet
Jan 24, 2023, 1:16 AM
11
points
14
comments
1
min read
LW
link
Some of my disagreements with List of Lethalities
TurnTrout
Jan 24, 2023, 12:25 AM
70
points
7
comments
10
min read
LW
link
Rounding Someone Off
David Udell
Jan 24, 2023, 12:03 AM
25
points
0
comments
5
min read
LW
link
Life Has a Cruel Symmetry
philh
Jan 23, 2023, 11:40 PM
21
points
5
comments
11
min read
LW
link
(reasonableapproximation.net)
Highlights and Prizes from the 2021 Review Phase
Raemon
Jan 23, 2023, 9:41 PM
38
points
14
comments
21
min read
LW
link
[Question]
AI safety milestones?
Zach Stein-Perlman
Jan 23, 2023, 9:00 PM
7
points
5
comments
1
min read
LW
link
[Question]
A post-quantum theory of classical gravity?
Logan Zoellner
Jan 23, 2023, 8:39 PM
13
points
5
comments
1
min read
LW
link
Meals For Unclear Dietary Restrictions
jefftk
Jan 23, 2023, 8:00 PM
17
points
3
comments
2
min read
LW
link
(www.jefftk.com)
It’s ok
stratospher
Jan 23, 2023, 6:11 PM
1
point
0
comments
2
min read
LW
link
Experimenting with beta.character.ai
svemirski
Jan 23, 2023, 5:31 PM
−3
points
5
comments
1
min read
LW
link
This week in fashion
Jan
Jan 23, 2023, 5:23 PM
29
points
7
comments
7
min read
LW
link
(universalprior.substack.com)
Movie Review: Megan
Zvi
Jan 23, 2023, 12:50 PM
60
points
19
comments
24
min read
LW
link
(thezvi.wordpress.com)
[Question]
Has private AGI research made independent safety research ineffective already? What should we do about this?
Roman Leventov
Jan 23, 2023, 7:36 AM
43
points
5
comments
5
min read
LW
link
Deconfusing “Capabilities vs. Alignment”
RobertM
Jan 23, 2023, 4:46 AM
27
points
7
comments
2
min read
LW
link
What a compute-centric framework says about AI takeoff speeds
Tom Davidson
Jan 23, 2023, 4:02 AM
188
points
30
comments
16
min read
LW
link
1
review
Philly Rat Fest
LoganChipkin
Jan 23, 2023, 4:01 AM
9
points
0
comments
1
min read
LW
link
EA & LW Forum Weekly Summary (16th − 22nd Jan ’23)
Zoe Williams
Jan 23, 2023, 3:46 AM
13
points
0
comments
LW
link
Consider Trying Dictation
jefftk
Jan 22, 2023, 10:50 PM
23
points
10
comments
2
min read
LW
link
(www.jefftk.com)
Emotional attachment to AIs opens doors to problems
Igor Ivanov
Jan 22, 2023, 8:28 PM
20
points
10
comments
4
min read
LW
link
What fills a vacuum?
Logan Kieller
Jan 22, 2023, 7:25 PM
11
points
6
comments
2
min read
LW
link
Gemini modeling
TsviBT
Jan 22, 2023, 2:28 PM
12
points
8
comments
11
min read
LW
link
Large language models learn to represent the world
gjm
Jan 22, 2023, 1:10 PM
101
points
20
comments
3
min read
LW
link
1
review
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel