Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
[Question]
Examples of Low Status Fun
niplav
Oct 10, 2023, 11:19 PM
18
points
17
comments
1
min read
LW
link
A New Model for Compute Center Verification
Damin Curtis
Oct 10, 2023, 7:22 PM
8
points
0
comments
5
min read
LW
link
Announcing MIRI’s new CEO and leadership team
Gretta Duleba
Oct 10, 2023, 7:22 PM
222
points
52
comments
3
min read
LW
link
18 Heterodox lenses to look the world through
Shaurya Gupta
Oct 10, 2023, 6:33 PM
−1
points
2
comments
5
min read
LW
link
Documenting Journey Into AI Safety
jacobhaimes
Oct 10, 2023, 6:30 PM
17
points
4
comments
6
min read
LW
link
Looking for AI Art Collaborators!
beatrice@foresight.org
Oct 10, 2023, 6:24 PM
1
point
0
comments
1
min read
LW
link
Childhood Roundup #3
Zvi
Oct 10, 2023, 2:30 PM
49
points
3
comments
30
min read
LW
link
(thezvi.wordpress.com)
My simple model for Alignment vs Capability
ryan_b
Oct 10, 2023, 12:07 PM
7
points
0
comments
7
min read
LW
link
Next year in Jerusalem: The brilliant ideas and radiant legacy of Miriam Lipschutz Yevick [in relation to current AI debates]
Bill Benzon
Oct 10, 2023, 9:06 AM
1
point
0
comments
1
min read
LW
link
(3quarksdaily.com)
I’m a Former Israeli Officer. AMA
Yovel Rom
Oct 10, 2023, 8:33 AM
78
points
70
comments
1
min read
LW
link
Become a PIBBSS Research Affiliate
Nora_Ammann
and
DusanDNesic
Oct 10, 2023, 7:41 AM
24
points
6
comments
6
min read
LW
link
My 1st month at a “neurodivergent gifted school” called Minerva University
exanova
Oct 10, 2023, 3:34 AM
4
points
1
comment
1
min read
LW
link
(inawe.substack.com)
Epistemic Motif of Abstract-Concrete Cycles & Domain Expansion
Dalcy
Oct 10, 2023, 3:28 AM
26
points
2
comments
3
min read
LW
link
Simple Terminal Colors
jefftk
Oct 10, 2023, 12:40 AM
11
points
1
comment
1
min read
LW
link
(www.jefftk.com)
The Handbook of Rationality (2021, MIT press) is now open access
romeostevensit
Oct 10, 2023, 12:30 AM
48
points
4
comments
1
min read
LW
link
Non-superintelligent paperclip maximizers are normal
jessicata
Oct 10, 2023, 12:29 AM
67
points
4
comments
9
min read
LW
link
(unstableontology.com)
The Witching Hour
Richard_Ngo
Oct 10, 2023, 12:19 AM
113
points
1
comment
9
min read
LW
link
(www.narrativeark.xyz)
One: a story
Richard_Ngo
Oct 10, 2023, 12:18 AM
30
points
0
comments
4
min read
LW
link
(www.narrativeark.xyz)
Truthseeking when your disagreements lie in moral philosophy
Elizabeth
and
Tristan Williams
Oct 10, 2023, 12:00 AM
99
points
4
comments
4
min read
LW
link
(acesounderglass.com)
NYT on the Manifest forecasting conference
Austin Chen
Oct 9, 2023, 9:40 PM
45
points
14
comments
LW
link
(www.nytimes.com)
Forecasting and prediction markets
CarlJ
Oct 9, 2023, 8:43 PM
3
points
0
comments
1
min read
LW
link
Comparing Two Forecasters in an Ideal World
nikos
Oct 9, 2023, 7:52 PM
5
points
0
comments
6
min read
LW
link
The case for aftermarket blind spot mirrors
Brendan Long
Oct 9, 2023, 7:30 PM
59
points
14
comments
2
min read
LW
link
(www.brendanlong.com)
New contractor role: Web security task force contractor for AI safety announcements
Ethan Ashkie
and
Andrew_Critch
Oct 9, 2023, 6:36 PM
11
points
0
comments
2
min read
LW
link
(survivalandflourishing.com)
[Question]
Anyone working on D. Amodei’s Bartlett show transcript?
Leopard
Oct 9, 2023, 6:17 PM
10
points
0
comments
1
min read
LW
link
Knowledge Base 3: Shopping advisor and other uses of knowledge base about products
iwis
Oct 9, 2023, 11:53 AM
0
points
0
comments
4
min read
LW
link
Knowledge Base 2: The structure and the method of building
iwis
Oct 9, 2023, 11:53 AM
2
points
4
comments
7
min read
LW
link
We don’t understand what happened with culture enough
Jan_Kulveit
Oct 9, 2023, 9:54 AM
87
points
22
comments
6
min read
LW
link
1
review
Leveraging Bayes’ Theorem to Supercharge Memory Techniques
disoha
Oct 9, 2023, 3:34 AM
−15
points
1
comment
4
min read
LW
link
Paper: Identifying the Risks of LM Agents with an LM-Emulated Sandbox—University of Toronto 2023 - Benchmark consisting of 36 high-stakes tools and 144 test cases!
Singularian2501
Oct 9, 2023, 12:00 AM
6
points
0
comments
1
min read
LW
link
AI Alignment Breakthroughs this week (10/08/23)
Logan Zoellner
Oct 8, 2023, 11:30 PM
30
points
14
comments
6
min read
LW
link
“The Heart of Gaming is the Power Fantasy”, and Cohabitive Games
Raemon
Oct 8, 2023, 9:02 PM
81
points
49
comments
4
min read
LW
link
(bottomfeeder.substack.com)
FAQ: What the heck is goal agnosticism?
porby
Oct 8, 2023, 7:11 PM
66
points
38
comments
28
min read
LW
link
Time is homogeneous sequentially-composable determination
TsviBT
Oct 8, 2023, 2:58 PM
15
points
0
comments
21
min read
LW
link
Linkpost: Are Emergent Abilities in Large Language Models just In-Context Learning?
Erich_Grunewald
Oct 8, 2023, 12:14 PM
12
points
7
comments
2
min read
LW
link
(arxiv.org)
Bird-eye view visualization of LLM activations
Sergii
Oct 8, 2023, 12:12 PM
11
points
2
comments
1
min read
LW
link
(grgv.xyz)
Perspective Based Reasoning Could Absolve CDT
dadadarren
Oct 8, 2023, 11:22 AM
4
points
5
comments
5
min read
LW
link
The Gradient – The Artificiality of Alignment
mic
Oct 8, 2023, 4:06 AM
12
points
1
comment
5
min read
LW
link
(thegradient.pub)
Comparing Anthropic’s Dictionary Learning to Ours
Robert_AIZI
Oct 7, 2023, 11:30 PM
137
points
8
comments
4
min read
LW
link
A thought about the constraints of debtlessness in online communities
mako yass
Oct 7, 2023, 9:26 PM
58
points
23
comments
1
min read
LW
link
Arguments for utilitarianism are impossibility arguments under unbounded prospects
MichaelStJules
Oct 7, 2023, 9:08 PM
7
points
7
comments
21
min read
LW
link
Sam Altman’s sister claims Sam sexually abused her—Part 1: Introduction, outline, author’s notes
pythagoras5015
Oct 7, 2023, 9:06 PM
95
points
108
comments
8
min read
LW
link
Griffin Island
jefftk
Oct 7, 2023, 6:40 PM
14
points
3
comments
1
min read
LW
link
(www.jefftk.com)
Every Mention of EA in “Going Infinite”
KirstenH
Oct 7, 2023, 2:42 PM
48
points
0
comments
8
min read
LW
link
(open.substack.com)
Fixing Insider Threats in the AI Supply Chain
Madhav Malhotra
Oct 7, 2023, 1:19 PM
20
points
2
comments
5
min read
LW
link
Contra Nora Belrose on Orthogonality Thesis Being Trivial
tailcalled
Oct 7, 2023, 11:47 AM
18
points
21
comments
1
min read
LW
link
Related Discussion from Thomas Kwa’s MIRI Research Experience
Raemon
Oct 7, 2023, 6:25 AM
71
points
140
comments
1
min read
LW
link
[Question]
Current State of Probabilistic Logic
lunatic_at_large
Oct 7, 2023, 5:06 AM
3
points
2
comments
1
min read
LW
link
On the Relationship Between Variability and the Evolutionary Outcomes of Systems in Nature
Artyom Shaposhnikov
Oct 7, 2023, 3:06 AM
2
points
0
comments
1
min read
LW
link
Announcing Dialogues
Ben Pace
Oct 7, 2023, 2:57 AM
155
points
59
comments
4
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel