Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
how 2 tell if ur input is out of distribution given only model weights
dkirmani
Aug 5, 2023, 10:45 PM
48
points
10
comments
1
min read
LW
link
Summary of Improving Global Decision Making (around AI)
Will_Pearson
Aug 5, 2023, 6:46 PM
−7
points
0
comments
1
min read
LW
link
Ground-Truth Label Imbalance Impairs the Performance of Contrast-Consistent Search (and Other Contrast-Pair-Based Unsupervised Methods)
Tom Angsten
and
Ami Hays
Aug 5, 2023, 5:55 PM
6
points
2
comments
7
min read
LW
link
(drive.google.com)
Seattle Astral Codex Ten Monthly Social
a7x
Aug 5, 2023, 5:55 PM
1
point
0
comments
1
min read
LW
link
AISafety.info’s Writing & Editing Hackathon
smallsilo
Aug 5, 2023, 5:14 PM
2
points
0
comments
1
min read
LW
link
Join AISafety.info’s Writing & Editing Hackathon (Aug 25-28) (Prizes to be won!)
smallsilo
Aug 5, 2023, 2:08 PM
19
points
3
comments
1
min read
LW
link
(forum.effectivealtruism.org)
Stomach Ulcers and Dental Cavities
Metacelsus
Aug 5, 2023, 2:08 PM
57
points
7
comments
1
min read
LW
link
(denovo.substack.com)
video games > IQ tests
bhauth
Aug 5, 2023, 1:27 PM
35
points
46
comments
3
min read
LW
link
[Linkpost] Applicability of scaling laws to vision encoding models
Bogdan Ionut Cirstea
Aug 5, 2023, 11:10 AM
11
points
2
comments
1
min read
LW
link
A Naive Proposal for Constructing Interpretable AI
Chris_Leong
Aug 5, 2023, 10:32 AM
18
points
6
comments
2
min read
LW
link
ACX Paris Meetup—August 11 2023
PoignardAzur
Aug 5, 2023, 9:44 AM
2
points
0
comments
1
min read
LW
link
Meet Hyperion on Sunday Aug 6?
duck_master
Aug 5, 2023, 4:36 AM
1
point
0
comments
1
min read
LW
link
[Question]
What are the best published papers from outside the alignment community that are relevant to Agent Foundations?
Stephen Fowler
Aug 5, 2023, 3:02 AM
20
points
4
comments
1
min read
LW
link
Announcing Squiggle Hub
ozziegooen
and
Slava Matyukhin
Aug 5, 2023, 1:00 AM
49
points
4
comments
5
min read
LW
link
(forum.effectivealtruism.org)
Read More Books but Pretend to Read Even More
Arjun Panickssery
Aug 5, 2023, 12:07 AM
26
points
12
comments
4
min read
LW
link
(arjunpanickssery.substack.com)
The Sinews of Sudan’s Latest War
Tim Liptrot
Aug 4, 2023, 6:17 PM
43
points
12
comments
12
min read
LW
link
Private notes on LW?
Raemon
Aug 4, 2023, 5:35 PM
61
points
33
comments
1
min read
LW
link
When training AI, we should escalate the frequency of capability tests
Hauke Hillebrandt
Aug 4, 2023, 4:07 PM
2
points
0
comments
1
min read
LW
link
Manifund: What we’re funding (weeks 2-4)
Austin Chen
Aug 4, 2023, 4:00 PM
44
points
2
comments
LW
link
(manifund.substack.com)
[Linkpost] Multimodal Neurons in Pretrained Text-Only Transformers
Bogdan Ionut Cirstea
Aug 4, 2023, 3:29 PM
11
points
0
comments
1
min read
LW
link
Apollo Research is hiring evals and interpretability engineers & scientists
Marius Hobbhahn
Aug 4, 2023, 10:54 AM
25
points
0
comments
2
min read
LW
link
[Question]
Has anyone tried creating a YouTube or TikTok series covering the sequences?
Max Rossi
Aug 4, 2023, 12:10 AM
4
points
4
comments
1
min read
LW
link
[Question]
Is there any metric measuring ~”proportion of people creating extra value”?
Amal
Aug 3, 2023, 10:54 PM
7
points
3
comments
1
min read
LW
link
[Question]
Hypothetical: what would you do?
JNS
Aug 3, 2023, 10:39 PM
4
points
2
comments
1
min read
LW
link
[Linkpost] Deception Abilities Emerged in Large Language Models
Bogdan Ionut Cirstea
Aug 3, 2023, 5:28 PM
12
points
0
comments
1
min read
LW
link
Embedding Ethical Priors into AI Systems: A Bayesian Approach
Justausername
Aug 3, 2023, 3:31 PM
−5
points
3
comments
21
min read
LW
link
Password-locked models: a stress case for capabilities evaluation
Fabien Roger
Aug 3, 2023, 2:53 PM
156
points
14
comments
6
min read
LW
link
AI #23: Fundamental Problems with RLHF
Zvi
Aug 3, 2023, 12:50 PM
59
points
9
comments
41
min read
LW
link
(thezvi.wordpress.com)
Bad Imitation Instruments
jefftk
Aug 3, 2023, 2:30 AM
21
points
1
comment
1
min read
LW
link
(www.jefftk.com)
Kolmogorov’s theory of Algorithmic Probability
Aidan Rocke
Aug 3, 2023, 12:58 AM
5
points
2
comments
2
min read
LW
link
(keplerlounge.com)
Work culture creep
CrimsonChin
Aug 3, 2023, 12:38 AM
32
points
15
comments
8
min read
LW
link
[Question]
Boxing
Zach Stein-Perlman
Aug 2, 2023, 11:38 PM
6
points
1
comment
1
min read
LW
link
External rationality vs. internal rationality
metachirality
Aug 2, 2023, 11:29 PM
7
points
0
comments
1
min read
LW
link
When performing a dimensionality reduction on tensors, the trace is often zero.
Joseph Van Name
Aug 2, 2023, 9:06 PM
7
points
1
comment
3
min read
LW
link
Progress links digest, 2023-08-02: Superconductor edition
jasoncrawford
Aug 2, 2023, 8:27 PM
13
points
0
comments
3
min read
LW
link
(rootsofprogress.org)
[Question]
What works for ADHD and/or related things?
TeaTieAndHat
Aug 2, 2023, 6:37 PM
7
points
13
comments
1
min read
LW
link
[Question]
Would you pay for a search engine limited to rationalist sites?
Conor
Aug 2, 2023, 6:06 PM
4
points
19
comments
1
min read
LW
link
The Roots of Progress Blog-Building Intensive: advice for applicants, request for support
jasoncrawford
Aug 2, 2023, 3:37 PM
9
points
0
comments
1
min read
LW
link
(rootsofprogress.org)
3 levels of threat obfuscation
HoldenKarnofsky
Aug 2, 2023, 2:58 PM
69
points
14
comments
7
min read
LW
link
ChatGPT for translation
Varshul Gupta
Aug 2, 2023, 11:57 AM
1
point
0
comments
3
min read
LW
link
(dubverseblack.substack.com)
Long-Term Future Fund: April 2023 grant recommendations
abergal
,
calebp99
,
Linch
,
habryka
,
Thomas Larsen
and
Vaniver
Aug 2, 2023, 7:54 AM
81
points
3
comments
50
min read
LW
link
[Question]
Could we breed/engineer intelligent parrots?
lemonhope
Aug 2, 2023, 7:32 AM
9
points
18
comments
1
min read
LW
link
Anthropical Motte and Bailey in two versions of Sleeping Beauty
Ape in the coat
2 Aug 2023 7:08 UTC
32
points
56
comments
6
min read
LW
link
solar-thermal and techno-economic analysis
bhauth
2 Aug 2023 6:22 UTC
21
points
8
comments
5
min read
LW
link
(www.bhauth.com)
South Bay ACX/SSC Meetup @ Whole Foods
allisona
2 Aug 2023 3:44 UTC
1
point
0
comments
1
min read
LW
link
“Is There Anything That’s Worth More”
Zack_M_Davis
2 Aug 2023 3:28 UTC
64
points
6
comments
1
min read
LW
link
Bay Winter Solstice: call for speech pitches!
tcheasdfjkl
2 Aug 2023 3:24 UTC
9
points
0
comments
1
min read
LW
link
(docs.google.com)
[Question]
What is ontology?
Adam Zerner
2 Aug 2023 0:54 UTC
28
points
19
comments
1
min read
LW
link
My current LK99 questions
Eliezer Yudkowsky
1 Aug 2023 22:48 UTC
206
points
38
comments
5
min read
LW
link
Spiral Staircase
Michael Samoilov
1 Aug 2023 21:51 UTC
19
points
2
comments
2
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel