Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Anti MMAcevedo Protocol
Logan Zoellner
Apr 16, 2024, 10:32 PM
1
point
1
comment
8
min read
LW
link
Transformers Represent Belief State Geometry in their Residual Stream
Adam Shai
Apr 16, 2024, 9:16 PM
419
points
100
comments
12
min read
LW
link
Tinker
Richard_Ngo
Apr 16, 2024, 6:26 PM
38
points
0
comments
1
min read
LW
link
(press.asimov.com)
Paul Christiano named as US AI Safety Institute Head of AI Safety
Joel Burget
Apr 16, 2024, 4:22 PM
256
points
58
comments
1
min read
LW
link
(www.commerce.gov)
Creating unrestricted AI Agents with Command R+
Simon Lermen
Apr 16, 2024, 2:52 PM
77
points
13
comments
5
min read
LW
link
What should the EA community learn from the FTX / SBF disaster? An in-depth discussion with Will MacAskill on the Clearer Thinking podcast
spencerg
Apr 16, 2024, 1:11 PM
20
points
0
comments
LW
link
(podcast.clearerthinking.org)
{Book Summary} The Art of Gathering
Tristan Williams
Apr 16, 2024, 10:48 AM
28
points
0
comments
LW
link
Essay competition on the Automation of Wisdom and Philosophy — $25k in prizes
owencb
and
AI Impacts
Apr 16, 2024, 10:10 AM
82
points
12
comments
8
min read
LW
link
(blog.aiimpacts.org)
Announcing SPAR Summer 2024!
laurenmarie12
Apr 16, 2024, 8:30 AM
30
points
2
comments
1
min read
LW
link
The argument for near-term human disempowerment through AI
Chris_Leong
Apr 16, 2024, 4:50 AM
21
points
2
comments
1
min read
LW
link
(link.springer.com)
My experience using financial commitments to overcome akrasia
William Howard
Apr 15, 2024, 10:57 PM
137
points
33
comments
18
min read
LW
link
An evaluation of circuit evaluation metrics
Iván Arcuschin
,
Niels uit de Bos
and
Adrià Garriga-alonso
Apr 15, 2024, 7:38 PM
18
points
0
comments
4
min read
LW
link
Experiments with an alternative method to promote sparsity in sparse autoencoders
Eoin Farrell
Apr 15, 2024, 6:21 PM
29
points
7
comments
12
min read
LW
link
Effectively Handling Disagreements—Introducing a New Workshop
Camille Berger
Apr 15, 2024, 4:33 PM
37
points
2
comments
7
min read
LW
link
Four Local Gigs
jefftk
Apr 15, 2024, 4:00 PM
8
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Taking into account preferences of past selves
Jacob G-W
Apr 15, 2024, 1:15 PM
14
points
9
comments
7
min read
LW
link
Monthly Roundup #17: April 2024
Zvi
Apr 15, 2024, 12:10 PM
54
points
4
comments
76
min read
LW
link
(thezvi.wordpress.com)
Reconsider the anti-cavity bacteria if you are Asian
Lao Mein
Apr 15, 2024, 7:02 AM
170
points
43
comments
4
min read
LW
link
Anthropic AI made the right call
bhauth
Apr 15, 2024, 12:39 AM
22
points
20
comments
1
min read
LW
link
May 2024 Newton meetup???
duck_master
Apr 14, 2024, 10:28 PM
2
points
0
comments
1
min read
LW
link
Clipboard Filtering
jefftk
Apr 14, 2024, 8:50 PM
25
points
1
comment
1
min read
LW
link
(www.jefftk.com)
A High Decoupling Failure
Maxwell Tabarrok
Apr 14, 2024, 7:46 PM
37
points
5
comments
3
min read
LW
link
(www.maximum-progress.com)
ACX Zwolle meetup
Shaedys
Apr 14, 2024, 1:09 PM
7
points
0
comments
1
min read
LW
link
A quick experiment on LMs’ inductive biases in performing search
Alex Mallen
Apr 14, 2024, 3:41 AM
32
points
2
comments
4
min read
LW
link
UDT1.01 Essential Miscellanea (4/10)
Diffractor
Apr 14, 2024, 2:23 AM
19
points
0
comments
10
min read
LW
link
[Cosmology Talks] New Probability Axioms Could Fix Cosmology’s Multiverse (Partially) - Sylvia Wenmackers
mako yass
Apr 14, 2024, 1:26 AM
18
points
2
comments
1
min read
LW
link
(www.youtube.com)
Speedrun ruiner research idea
lemonhope
Apr 13, 2024, 11:42 PM
2
points
11
comments
2
min read
LW
link
Text Posts from the Kids Group: 2020
jefftk
Apr 13, 2024, 10:30 PM
69
points
3
comments
19
min read
LW
link
(www.jefftk.com)
[Question]
What convincing warning shot could help prevent extinction from AI?
Charbel-Raphaël
and
cozyfractal
Apr 13, 2024, 6:09 PM
108
points
22
comments
2
min read
LW
link
My experience at ML4Good AI Safety Bootcamp
TheManxLoiner
Apr 13, 2024, 10:55 AM
21
points
1
comment
5
min read
LW
link
Consequentialism is a compass, not a judge
Neil
Apr 13, 2024, 10:47 AM
26
points
6
comments
2
min read
LW
link
Carl Sagan, nuking the moon, and not nuking the moon
eukaryote
Apr 13, 2024, 4:08 AM
104
points
8
comments
6
min read
LW
link
(eukaryotewritesblog.com)
[Question]
Barcoding LLM Training Data Subsets. Anyone trying this for interpretability?
right..enough?
Apr 13, 2024, 3:09 AM
7
points
0
comments
7
min read
LW
link
Prompts for Big-Picture Planning
Raemon
Apr 13, 2024, 3:04 AM
72
points
1
comment
3
min read
LW
link
Claude wants to be conscious
Joe Kwon
Apr 13, 2024, 1:40 AM
2
points
8
comments
6
min read
LW
link
Things Solenoid Narrates
Solenoid_Entity
Apr 12, 2024, 11:57 PM
45
points
2
comments
2
min read
LW
link
MIRI’s April 2024 Newsletter
Harlan
Apr 12, 2024, 11:38 PM
95
points
0
comments
3
min read
LW
link
(intelligence.org)
Poker, Beef Wellington, and Mount Stupid
boghan
Apr 12, 2024, 6:06 PM
10
points
2
comments
7
min read
LW
link
Forecasting
A*
Apr 12, 2024, 5:55 PM
4
points
0
comments
1
min read
LW
link
Generalized Stat Mech: The Boltzmann Approach
David Lorell
and
johnswentworth
Apr 12, 2024, 5:47 PM
71
points
7
comments
20
min read
LW
link
AISN #33: Reassessing AI and Biorisk Plus, Consolidation in the Corporate AI Landscape, and National Investments in AI
Corin Katzke
,
Alexa Pan
and
Dan H
Apr 12, 2024, 4:10 PM
13
points
0
comments
9
min read
LW
link
(newsletter.safe.ai)
“How the Gaza Health Ministry Fakes Casualty Numbers”
CronoDAS
Apr 12, 2024, 5:57 AM
−11
points
9
comments
1
min read
LW
link
(www.tabletmag.com)
UDT1.01: Plannable and Unplanned Observations (3/10)
Diffractor
Apr 12, 2024, 5:24 AM
31
points
0
comments
7
min read
LW
link
Report: Evaluating an AI Chip Registration Policy
Deric Cheng
Apr 12, 2024, 4:39 AM
25
points
0
comments
5
min read
LW
link
(www.convergenceanalysis.org)
Interference Issues
jefftk
Apr 12, 2024, 2:30 AM
17
points
1
comment
3
min read
LW
link
(www.jefftk.com)
A D&D.Sci Dodecalogue
abstractapplic
Apr 12, 2024, 1:10 AM
56
points
0
comments
3
min read
LW
link
[Question]
Upcoming unambiguously good tech possibilities? (Like eg indoor plumbing)
lemonhope
Apr 11, 2024, 11:14 PM
9
points
6
comments
1
min read
LW
link
Leave No Context Behind—A Comment
Gunnar_Zarncke
Apr 11, 2024, 10:50 PM
18
points
0
comments
2
min read
LW
link
AXRP Episode 27 - AI Control with Buck Shlegeris and Ryan Greenblatt
DanielFilan
Apr 11, 2024, 9:30 PM
69
points
10
comments
107
min read
LW
link
ChatGPT defines 10 concrete terms: generically, for 5- and 11-year-olds, and for a scientist
Bill Benzon
Apr 11, 2024, 8:27 PM
3
points
9
comments
6
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel