Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Why Not Subagents?
johnswentworth
and
David Lorell
Jun 22, 2023, 10:16 PM
130
points
52
comments
14
min read
LW
link
1
review
Catastrophic Risks from AI #2: Malicious Use
Dan H
,
Mantas Mazeika
and
TW123
Jun 22, 2023, 5:10 PM
38
points
1
comment
17
min read
LW
link
(arxiv.org)
Catastrophic Risks from AI #1: Introduction
Dan H
,
Mantas Mazeika
and
TW123
Jun 22, 2023, 5:09 PM
40
points
1
comment
5
min read
LW
link
(arxiv.org)
AI #17: The Litany
Zvi
Jun 22, 2023, 2:30 PM
95
points
34
comments
56
min read
LW
link
(thezvi.wordpress.com)
[Research Update] Sparse Autoencoder features are bimodal
Robert_AIZI
Jun 22, 2023, 1:15 PM
24
points
1
comment
5
min read
LW
link
(aizi.substack.com)
The Hubinger lectures on AGI safety: an introductory lecture series
evhub
Jun 22, 2023, 12:59 AM
126
points
0
comments
1
min read
LW
link
(www.youtube.com)
How to Search Multiple Websites Quickly
Nicholas / Heather Kross
Jun 22, 2023, 12:42 AM
16
points
1
comment
1
min read
LW
link
[Question]
Newbie questions about information theory and transformers
Misaligned-Semi-intelligence
Jun 21, 2023, 10:45 PM
10
points
1
comment
1
min read
LW
link
Progress links and tweets, 2023-06-21: Stewart Brand wants your comments
jasoncrawford
Jun 21, 2023, 8:52 PM
11
points
1
comment
1
min read
LW
link
(rootsofprogress.org)
What—ideally—should young and intelligent people do?
veterxiph
Jun 21, 2023, 8:21 PM
1
point
4
comments
3
min read
LW
link
Using Claude to convert dialog transcripts into great posts?
mako yass
Jun 21, 2023, 8:19 PM
6
points
4
comments
4
min read
LW
link
Which personality traits are real? Stress-testing the lexical hypothesis
tailcalled
Jun 21, 2023, 7:46 PM
65
points
5
comments
9
min read
LW
link
1
review
“textbooks are all you need”
bhauth
Jun 21, 2023, 5:06 PM
66
points
18
comments
2
min read
LW
link
(arxiv.org)
Philosophical Cyborg (Part 2)...or, The Good Successor
ukc10014
Jun 21, 2023, 3:43 PM
21
points
1
comment
31
min read
LW
link
Relational Speaking
jefftk
Jun 21, 2023, 2:40 PM
11
points
0
comments
2
min read
LW
link
(www.jefftk.com)
My side of an argument with Jacob Cannell about chip interconnect losses
Steven Byrnes
Jun 21, 2023, 1:33 PM
144
points
11
comments
11
min read
LW
link
Short timelines and slow, continuous takeoff as the safest path to AGI
rosehadshar
and
LintzA
Jun 21, 2023, 8:56 AM
65
points
15
comments
7
min read
LW
link
A way to make solving alignment 10.000 times easier. The shorter case for a massive open source simbox project.
AlexFromSafeTransition
Jun 21, 2023, 8:08 AM
2
points
16
comments
14
min read
LW
link
My tentative best guess on how EAs and Rationalists sometimes turn crazy
habryka
Jun 21, 2023, 4:11 AM
199
points
110
comments
8
min read
LW
link
The Importance of Judging: A Reflection on Rational Thought
CrimsonChin
Jun 20, 2023, 10:49 PM
2
points
0
comments
4
min read
LW
link
“Natural is better” is a valuable heuristic
Neil
Jun 20, 2023, 10:25 PM
35
points
16
comments
4
min read
LW
link
№.6 For Those About To Dress...
party girl
Jun 20, 2023, 9:14 PM
5
points
0
comments
4
min read
LW
link
(affale.substack.com)
Frame Bridging v0.8 - an inquiry and a technique
Unreal
Jun 20, 2023, 7:46 PM
11
points
9
comments
6
min read
LW
link
Public Transit is not Infinitely Safe
jefftk
Jun 20, 2023, 6:40 PM
97
points
34
comments
1
min read
LW
link
(www.jefftk.com)
why I’m here now
bhauth
Jun 20, 2023, 5:13 PM
8
points
3
comments
1
min read
LW
link
Causality: A Brief Introduction
tom4everitt
,
Lewis Hammond
,
Jonathan Richens
,
Francis Rhys Ward
,
RyanCarey
,
sbenthall
and
James Fox
Jun 20, 2023, 3:01 PM
49
points
18
comments
6
min read
LW
link
Lightning Post: Things people in AI Safety should stop talking about
Prometheus
Jun 20, 2023, 3:00 PM
23
points
6
comments
2
min read
LW
link
Having a headache and not having a headache
Jim Pivarski
Jun 20, 2023, 2:59 PM
7
points
9
comments
3
min read
LW
link
Never Fight The Last War
ChristianKl
Jun 20, 2023, 12:35 PM
32
points
4
comments
1
min read
LW
link
[Question]
Why didn’t virologists run the studies necessary to determine which viruses are airborne?
ChristianKl
Jun 20, 2023, 11:58 AM
28
points
19
comments
1
min read
LW
link
A Friendly Face (Another Failure Story)
Karl von Wendt
,
Sofia Bharadia
,
PeterDrotos
,
Artem Korotkov
,
mespa
and
mruwnik
Jun 20, 2023, 10:31 AM
65
points
21
comments
16
min read
LW
link
[Question]
Are the majority of your ancestors farmers or non-farmers?
Linch
Jun 20, 2023, 8:55 AM
19
points
47
comments
1
min read
LW
link
DSLT 3. Neural Networks are Singular
Liam Carroll
Jun 20, 2023, 8:20 AM
29
points
5
comments
19
min read
LW
link
10 quick takes about AGI
Max H
Jun 20, 2023, 2:22 AM
35
points
17
comments
7
min read
LW
link
OpenAI introduces function calling for GPT-4
mic
and
André Ferretti
Jun 20, 2023, 1:58 AM
24
points
3
comments
4
min read
LW
link
(openai.com)
Approaches to Thump
jefftk
Jun 20, 2023, 1:50 AM
8
points
0
comments
2
min read
LW
link
(www.jefftk.com)
Ban development of unpredictable powerful models?
TurnTrout
Jun 20, 2023, 1:43 AM
46
points
25
comments
4
min read
LW
link
Capture today’s market, capture tomorrow’s game board
SimonBiggs
Jun 20, 2023, 12:45 AM
9
points
0
comments
5
min read
LW
link
Lessons On How To Get Things Right On The First Try
johnswentworth
and
David Lorell
Jun 19, 2023, 11:58 PM
252
points
57
comments
10
min read
LW
link
1
review
Mode collapse in RL may be fueled by the update equation
TurnTrout
and
MichaelEinhorn
Jun 19, 2023, 9:51 PM
53
points
10
comments
8
min read
LW
link
New reference standard on LLM Application security started by OWASP
QuantumForest
Jun 19, 2023, 8:54 PM
2
points
0
comments
1
min read
LW
link
Experiments in Evaluating Steering Vectors
Gytis Daujotas
Jun 19, 2023, 3:11 PM
34
points
4
comments
4
min read
LW
link
Provisionality
TsviBT
Jun 19, 2023, 11:49 AM
7
points
2
comments
7
min read
LW
link
[Question]
When did you orient?
lemonhope
Jun 19, 2023, 7:22 AM
12
points
7
comments
1
min read
LW
link
Guide to rationalist interior decorating
mingyuan
Jun 19, 2023, 6:47 AM
327
points
53
comments
12
min read
LW
link
4
reviews
A Multidisciplinary Approach to Alignment (MATA) and Archetypal Transfer Learning (ATL)
MiguelDev
Jun 19, 2023, 2:32 AM
4
points
2
comments
7
min read
LW
link
resolving some neural network mysteries
bhauth
Jun 19, 2023, 12:09 AM
44
points
6
comments
2
min read
LW
link
(www.bhauth.com)
Why I am not an AI extinction cautionista
Shmi
Jun 18, 2023, 9:28 PM
22
points
40
comments
2
min read
LW
link
My impression of singular learning theory
Ege Erdil
Jun 18, 2023, 3:34 PM
47
points
30
comments
2
min read
LW
link
Berlin AI Alignment Open Meetup July 2023
GuyP
Jun 18, 2023, 2:13 PM
1
point
0
comments
1
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel