Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
Progress links and short notes, 2024-12-16
jasoncrawford
Dec 16, 2024, 5:24 PM
7
points
0
comments
2
min read
LW
link
(newsletter.rootsofprogress.org)
Effective Altruism FAQ
omnizoid
Dec 16, 2024, 4:27 PM
0
points
7
comments
12
min read
LW
link
Variably compressibly studies are fun
dkl9
Dec 16, 2024, 4:00 PM
0
points
0
comments
2
min read
LW
link
(dkl9.net)
AIs Will Increasingly Attempt Shenanigans
Zvi
Dec 16, 2024, 3:20 PM
114
points
2
comments
26
min read
LW
link
(thezvi.wordpress.com)
Testing which LLM architectures can do hidden serial reasoning
Filip Sondej
Dec 16, 2024, 1:48 PM
81
points
9
comments
4
min read
LW
link
NeuroAI for AI safety: A Differential Path
nz
and
Patrick Mineault
Dec 16, 2024, 1:17 PM
22
points
0
comments
7
min read
LW
link
(arxiv.org)
Circling as practice for “just be yourself”
Kaj_Sotala
Dec 16, 2024, 7:40 AM
86
points
5
comments
4
min read
LW
link
(kajsotala.fi)
Reanalyzing the 2023 Expert Survey on Progress in AI
AI Impacts
Dec 16, 2024, 6:10 AM
8
points
0
comments
1
min read
LW
link
(blog.aiimpacts.org)
Ideas for benchmarking LLM creativity
gwern
Dec 16, 2024, 5:18 AM
60
points
11
comments
1
min read
LW
link
(gwern.net)
Comparing the AirFanta 3Pro to the Coway AP-1512
jefftk
Dec 16, 2024, 1:40 AM
13
points
0
comments
1
min read
LW
link
(www.jefftk.com)
[Question]
are IQ tests a good measure of intelligence?
KvmanThinking
Dec 15, 2024, 11:06 PM
0
points
5
comments
1
min read
LW
link
Madison Secular Solstice
svfritz
Dec 15, 2024, 9:52 PM
1
point
0
comments
1
min read
LW
link
[Question]
Is AI alignment a purely functional property?
Roko
Dec 15, 2024, 9:42 PM
13
points
8
comments
1
min read
LW
link
[Question]
How counterfactual are logical counterfactuals?
Donald Hobson
Dec 15, 2024, 9:16 PM
11
points
10
comments
1
min read
LW
link
Debunking the myth of safe AI
henophilia
Dec 15, 2024, 5:44 PM
−11
points
8
comments
1
min read
LW
link
(henophilia.substack.com)
Introducing Avatarism: A Rational Framework for Building actual Heaven
ratiba ro
Dec 15, 2024, 5:17 PM
2
points
2
comments
2
min read
LW
link
A Public Choice Take on Effective Altruism
vaishnav92
Dec 15, 2024, 4:58 PM
9
points
4
comments
3
min read
LW
link
(www.optimaloutliers.com)
World Models I’m Currently Building
temporary
Dec 15, 2024, 4:29 PM
5
points
1
comment
1
min read
LW
link
(samuelshadrach.com)
Dress Up For Secular Solstice
Gordon H.S.
Dec 15, 2024, 4:28 PM
33
points
13
comments
7
min read
LW
link
Remap your caps lock key
bilalchughtai
Dec 15, 2024, 2:03 PM
80
points
18
comments
1
min read
LW
link
Effective Evil’s AI Misalignment Plan
lsusr
Dec 15, 2024, 7:39 AM
83
points
9
comments
3
min read
LW
link
Write Good Enough Code, Quickly
Oliver Daniels
Dec 15, 2024, 4:45 AM
19
points
10
comments
8
min read
LW
link
How to Edit an Essay into a Solstice Speech?
Czynski
Dec 15, 2024, 4:30 AM
5
points
1
comment
1
min read
LW
link
(thepdv.wordpress.com)
How Your Physiology Affects the Mind’s Projection Fallacy
YanLyutnev
Dec 14, 2024, 9:10 PM
−1
points
0
comments
6
min read
LW
link
Introducing the Evidence Color Wheel
Larry Lee
Dec 14, 2024, 4:08 PM
6
points
0
comments
3
min read
LW
link
An Illustrated Summary of “Robust Agents Learn Causal World Model”
Dalcy
Dec 14, 2024, 3:02 PM
67
points
2
comments
10
min read
LW
link
Best-of-N Jailbreaking
John Hughes
,
saraprice
,
Aengus Lynch
,
Rylan Schaeffer
,
Fazl
,
Henry Sleight
,
Ethan Perez
and
mrinank_sharma
Dec 14, 2024, 4:58 AM
78
points
5
comments
2
min read
LW
link
(arxiv.org)
D&D.Sci Dungeonbuilding: the Dungeon Tournament
aphyer
Dec 14, 2024, 4:30 AM
49
points
16
comments
3
min read
LW
link
Creating Interpretable Latent Spaces with Gradient Routing
Jacob G-W
Dec 14, 2024, 4:00 AM
26
points
6
comments
2
min read
LW
link
(jacobgw.com)
Probability of death by suicide by a 26 year old
John Wiseman
Dec 14, 2024, 3:33 AM
−25
points
4
comments
1
min read
LW
link
Matryoshka Sparse Autoencoders
Noa Nabeshima
Dec 14, 2024, 2:52 AM
98
points
15
comments
11
min read
LW
link
[Question]
What is MIRI currently doing?
Roko
Dec 14, 2024, 2:39 AM
32
points
14
comments
1
min read
LW
link
The o1 System Card Is Not About o1
Zvi
Dec 13, 2024, 8:30 PM
116
points
5
comments
16
min read
LW
link
(thezvi.wordpress.com)
Arch-anarchy and The Fable of the Dragon-Tyrant
Peter lawless
Dec 13, 2024, 8:15 PM
−10
points
0
comments
1
min read
LW
link
Communications in Hard Mode (My new job at MIRI)
tanagrabeast
Dec 13, 2024, 8:13 PM
206
points
25
comments
5
min read
LW
link
First Thoughts on Detachmentism
Jacob Peterson
Dec 13, 2024, 1:19 AM
−11
points
5
comments
9
min read
LW
link
How to Build Heaven: A Constrained Boltzmann Brain Generator
High Tides
Dec 13, 2024, 1:04 AM
−8
points
3
comments
5
min read
LW
link
Representing Irrationality in Game Theory
Larry Lee
Dec 13, 2024, 12:50 AM
−1
points
3
comments
11
min read
LW
link
“Charity” as a conflationary alliance term
Jan_Kulveit
Dec 12, 2024, 9:49 PM
35
points
2
comments
5
min read
LW
link
Just one more exposure bro
Chipmonk
Dec 12, 2024, 9:37 PM
52
points
6
comments
2
min read
LW
link
(chrislakin.blog)
The Dangers of Mirrored Life
Niko_McCarty
and
fin
Dec 12, 2024, 8:58 PM
119
points
9
comments
29
min read
LW
link
(www.asimov.press)
Effective Networking as Sending Hard to Fake Signals
vaishnav92
Dec 12, 2024, 8:32 PM
26
points
2
comments
7
min read
LW
link
(www.optimaloutliers.com)
Mini PAPR Review
jefftk
Dec 12, 2024, 7:10 PM
10
points
0
comments
2
min read
LW
link
(www.jefftk.com)
Biological risk from the mirror world
jasoncrawford
Dec 12, 2024, 7:07 PM
334
points
38
comments
7
min read
LW
link
(newsletter.rootsofprogress.org)
Naturalistic dualism
Arturo Macias
Dec 12, 2024, 4:19 PM
−4
points
0
comments
4
min read
LW
link
AI #94: Not Now, Google
Zvi
Dec 12, 2024, 3:40 PM
49
points
3
comments
64
min read
LW
link
(thezvi.wordpress.com)
Consciousness, Intelligence, and AI – Some Quick Notes [call it a mini-ramble]
Bill Benzon
Dec 12, 2024, 3:04 PM
−3
points
0
comments
4
min read
LW
link
The Dissolution of AI Safety
Roko
Dec 12, 2024, 10:34 AM
8
points
44
comments
1
min read
LW
link
(www.transhumanaxiology.com)
Is Optimization Correct?
Yoshinori Okamoto
Dec 12, 2024, 10:27 AM
−9
points
0
comments
2
min read
LW
link
AXRP Episode 38.3 - Erik Jenner on Learned Look-Ahead
DanielFilan
Dec 12, 2024, 5:40 AM
20
points
0
comments
16
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel