Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
The Parable Of The Fallen Pendulum—Part 2
johnswentworth
Mar 12, 2024, 9:41 PM
78
points
8
comments
4
min read
LW
link
Open consultancy: Letting untrusted AIs choose what answer to argue for
Fabien Roger
Mar 12, 2024, 8:38 PM
35
points
5
comments
5
min read
LW
link
[Question]
Is anyone working on formally verified AI toolchains?
metachirality
Mar 12, 2024, 7:36 PM
17
points
4
comments
1
min read
LW
link
Transformer Debugger
Henk Tillman
Mar 12, 2024, 7:08 PM
26
points
0
comments
1
min read
LW
link
(github.com)
Superforecasting the Origins of the Covid-19 Pandemic
DanielFilan
Mar 12, 2024, 7:01 PM
64
points
0
comments
1
min read
LW
link
(goodjudgment.substack.com)
minimum viable action
Sindhu Prasad
Mar 12, 2024, 4:06 PM
1
point
0
comments
3
min read
LW
link
Hardball questions for the Gemini Congressional Hearing
Michael Thiessen
Mar 12, 2024, 3:27 PM
−11
points
2
comments
1
min read
LW
link
OpenAI: The Board Expands
Zvi
Mar 12, 2024, 2:00 PM
92
points
1
comment
30
min read
LW
link
(thezvi.wordpress.com)
Update on Developing an Ethics Calculator to Align an AGI to
sweenesm
Mar 12, 2024, 12:33 PM
4
points
2
comments
8
min read
LW
link
[Question]
How do you identify and counteract your biases in decision-making?
warrenjordan
Mar 12, 2024, 5:01 AM
2
points
1
comment
1
min read
LW
link
How Much Have I Been Playing?
jefftk
Mar 12, 2024, 2:10 AM
9
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought
Miles Turpin
Mar 11, 2024, 11:46 PM
16
points
0
comments
1
min read
LW
link
(arxiv.org)
AI Safety Action Plan—A report commissioned by the US State Department
agucova
Mar 11, 2024, 10:14 PM
22
points
1
comment
LW
link
(www.gladstone.ai)
A discussion of AI risk and the cost/benefit calculation of stopping or pausing AI development
DuncanFowler
Mar 11, 2024, 9:41 PM
1
point
0
comments
1
min read
LW
link
Among the A.I. Doomsayers—The New Yorker
agucova
Mar 11, 2024, 9:35 PM
12
points
1
comment
LW
link
(www.newyorker.com)
Be More Katja
Nathan Young
Mar 11, 2024, 9:12 PM
53
points
0
comments
3
min read
LW
link
AI Incident Reporting: A Regulatory Review
Deric Cheng
and
Elliot Mckernon
Mar 11, 2024, 9:03 PM
16
points
0
comments
6
min read
LW
link
Results from an Adversarial Collaboration on AI Risk (FRI)
Josh Rosenberg
,
AvitalM
,
Molly
and
rosehadshar
Mar 11, 2024, 8:00 PM
60
points
3
comments
9
min read
LW
link
(forecastingresearch.org)
The Astronomical Sacrifice Dilemma
Matthew McRedmond
Mar 11, 2024, 7:58 PM
15
points
3
comments
4
min read
LW
link
Epiphenomenalism leads to eliminativism about qualia
Clément L
Mar 11, 2024, 7:53 PM
4
points
0
comments
7
min read
LW
link
The Best Essay (Paul Graham)
Chris_Leong
Mar 11, 2024, 7:25 PM
25
points
2
comments
1
min read
LW
link
(paulgraham.com)
Open Thread Spring 2024
habryka
Mar 11, 2024, 7:17 PM
22
points
160
comments
1
min read
LW
link
New social credit formalizations
KatjaGrace
Mar 11, 2024, 7:00 PM
23
points
3
comments
2
min read
LW
link
(worldspiritsockpuppet.com)
How disagreements about Evidential Correlations could be settled
Martín Soto
Mar 11, 2024, 6:28 PM
11
points
3
comments
4
min read
LW
link
“Artificial General Intelligence”: an extremely brief FAQ
Steven Byrnes
Mar 11, 2024, 5:49 PM
74
points
6
comments
2
min read
LW
link
Some (problematic) aesthetics of what constitutes good work in academia
Steven Byrnes
Mar 11, 2024, 5:47 PM
148
points
12
comments
12
min read
LW
link
Storable Votes with a Pay as you win mechanism: a contribution for institutional design
Arturo Macias
Mar 11, 2024, 3:58 PM
17
points
19
comments
2
min read
LW
link
Tend to your clarity, not your confusion
Severin T. Seehrich
Mar 11, 2024, 3:09 PM
23
points
1
comment
6
min read
LW
link
[Question]
What do we know about the AI knowledge and views, especially about existential risk, of the new OpenAI board members?
Zvi
Mar 11, 2024, 2:55 PM
60
points
2
comments
2
min read
LW
link
“How could I have thought that faster?”
mesaoptimizer
Mar 11, 2024, 10:56 AM
235
points
32
comments
2
min read
LW
link
(twitter.com)
Simple versus Short: Higher-order degeneracy and error-correction
Daniel Murfet
Mar 11, 2024, 7:52 AM
110
points
8
comments
13
min read
LW
link
Deconstructing Bostrom’s Classic Argument for AI Doom
Nora Belrose
Mar 11, 2024, 5:58 AM
16
points
14
comments
1
min read
LW
link
(www.youtube.com)
Advice Needed: Does Using a LLM Compomise My Personal Epistemic Security?
Naomi
Mar 11, 2024, 5:57 AM
17
points
7
comments
2
min read
LW
link
Some Thoughts on Concept Formation and Use in Agents
CatGoddess
Mar 11, 2024, 5:03 AM
12
points
0
comments
8
min read
LW
link
Steelmanning as an especially insidious form of strawmanning
Cornelius Dybdahl
Mar 11, 2024, 2:25 AM
10
points
13
comments
5
min read
LW
link
One-shot strategy games?
Raemon
Mar 11, 2024, 12:19 AM
41
points
42
comments
1
min read
LW
link
Understanding SAE Features with the Logit Lens
Joseph Bloom
and
Johnny Lin
Mar 11, 2024, 12:16 AM
68
points
0
comments
14
min read
LW
link
Replacing the Water Heater’s Anode
jefftk
Mar 11, 2024, 12:00 AM
22
points
0
comments
2
min read
LW
link
(www.jefftk.com)
Briefly Extending Differential Optimization to Distributions
J Bostock
Mar 10, 2024, 8:41 PM
4
points
0
comments
2
min read
LW
link
Evolution did a surprising good job at aligning humans...to social status
Eli Tyre
Mar 10, 2024, 7:34 PM
24
points
37
comments
1
min read
LW
link
Pausing AI is Positive Expected Value
Liron
Mar 10, 2024, 5:10 PM
9
points
2
comments
3
min read
LW
link
(twitter.com)
W2SG: Introduction
Maria Kapros
Mar 10, 2024, 4:25 PM
2
points
2
comments
10
min read
LW
link
An Optimistic Solution to the Fermi Paradox
Glenn Clayton
10 Mar 2024 14:39 UTC
4
points
6
comments
13
min read
LW
link
Counterfactual Civilization Simulation Version −1.0 aka my application to Johannes Mayer’s SPAR project
Morphism
10 Mar 2024 10:10 UTC
1
point
0
comments
14
min read
LW
link
Notes from a Prompt Factory
Richard_Ngo
10 Mar 2024 5:13 UTC
104
points
19
comments
9
min read
LW
link
(www.narrativeark.xyz)
Investigating Basin Volume with XOR Networks
CatGoddess
10 Mar 2024 1:35 UTC
10
points
0
comments
5
min read
LW
link
[Linkpost] MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data
Bogdan Ionut Cirstea
10 Mar 2024 1:30 UTC
10
points
0
comments
1
min read
LW
link
(openreview.net)
0th Person and 1st Person Logic
Adele Lopez
10 Mar 2024 0:56 UTC
60
points
28
comments
6
min read
LW
link
Completion Estimates
scarcegreengrass
9 Mar 2024 22:56 UTC
7
points
2
comments
3
min read
LW
link
Semi-Simplicial Types, Part I: Motivation and History
astradiol
9 Mar 2024 22:07 UTC
20
points
3
comments
10
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel