Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Page
1
Leading The Parade
johnswentworth
Jan 31, 2024, 10:39 PM
148
points
31
comments
9
min read
LW
link
Proposal for an AI Safety Prize
sweenesm
Jan 31, 2024, 6:35 PM
3
points
0
comments
2
min read
LW
link
Literally Everything is Infinite
Spiral
Jan 31, 2024, 6:31 PM
−9
points
8
comments
5
min read
LW
link
What fuels your ambition?
Cissy
Jan 31, 2024, 6:30 PM
29
points
1
comment
5
min read
LW
link
(www.moremyself.xyz)
“Genlangs” and Zipf’s Law: Do languages generated by ChatGPT statistically look human?
Justin-Diamond
Jan 31, 2024, 6:30 PM
2
points
2
comments
1
min read
LW
link
(arxiv.org)
AI, Intellectual Property, and the Techno-Optimist Revolution
Justin-Diamond
Jan 31, 2024, 6:30 PM
1
point
0
comments
1
min read
LW
link
(www.researchgate.net)
My Alignment “Plan”: Avoid Strong Optimisation and Align Economy
VojtaKovarik
Jan 31, 2024, 5:03 PM
24
points
9
comments
7
min read
LW
link
Where freedom comes from
Logan Kieller
Jan 31, 2024, 4:53 PM
−5
points
1
comment
3
min read
LW
link
(logankieller.substack.com)
Per protocol analysis as medical malpractice
braces
Jan 31, 2024, 4:22 PM
53
points
8
comments
1
min read
LW
link
Adam Smith Meets AI Doomers
James_Miller
Jan 31, 2024, 3:53 PM
34
points
10
comments
5
min read
LW
link
Ten Modes of Culture War Discourse
jchan
Jan 31, 2024, 1:58 PM
54
points
15
comments
15
min read
LW
link
Without Fundamental Advances, Rebellion and Coup d’État are the Inevitable Outcomes of Dictators & Monarchs Trying to Control Large, Capable Countries
Roko
Jan 31, 2024, 10:14 AM
27
points
34
comments
1
min read
LW
link
Explaining Impact Markets
Saul Munn
Jan 31, 2024, 9:51 AM
95
points
2
comments
3
min read
LW
link
(www.brasstacks.blog)
Exploring OpenAI’s Latent Directions: Tests, Observations, and Poking Around
Johnny Lin
Jan 31, 2024, 6:01 AM
26
points
4
comments
14
min read
LW
link
Clip keys together with tiny carabiners
Brendan Long
Jan 31, 2024, 4:26 AM
11
points
5
comments
1
min read
LW
link
The problem with proportional extrapolation
pathos_bot
Jan 30, 2024, 11:40 PM
8
points
0
comments
1
min read
LW
link
Counterfactual Mechanism Networks
StrivingForLegibility
Jan 30, 2024, 8:30 PM
4
points
0
comments
5
min read
LW
link
Control vs Selection: Civilisation is best at control, but navigating AGI requires selection
VojtaKovarik
Jan 30, 2024, 7:06 PM
7
points
1
comment
1
min read
LW
link
AI governance frames
NathanBarnard
Jan 30, 2024, 6:18 PM
3
points
0
comments
3
min read
LW
link
Deciding What Project/Org to Start: A Guide to Prioritization Research
Alexandra Bos
Jan 30, 2024, 6:13 PM
8
points
0
comments
LW
link
on neodymium magnets
bhauth
Jan 30, 2024, 3:58 PM
47
points
6
comments
4
min read
LW
link
(www.bhauth.com)
[Question]
Can we create self-improving AIs that perfect their own ethics?
Gabi QUENE
Jan 30, 2024, 2:45 PM
1
point
10
comments
1
min read
LW
link
Childhood and Education Roundup #4
Zvi
Jan 30, 2024, 1:50 PM
44
points
10
comments
24
min read
LW
link
(thezvi.wordpress.com)
Last call for submissions for TAIS 2024!
Blaine
Jan 30, 2024, 12:08 PM
4
points
0
comments
1
min read
LW
link
(tais2024.cc)
[Question]
Has anyone actually changed their mind regarding Sleeping Beauty problem?
Ape in the coat
Jan 30, 2024, 8:34 AM
15
points
50
comments
1
min read
LW
link
San Fernando Valley Rationality: February 15, 2024
Thomas Broadley
Jan 30, 2024, 4:40 AM
3
points
0
comments
1
min read
LW
link
The case for more ambitious language model evals
Jozdien
Jan 30, 2024, 12:01 AM
117
points
30
comments
5
min read
LW
link
A short ‘derivation’ of Watanabe’s Free Energy Formula
Wuschel Schulz
Jan 29, 2024, 11:41 PM
13
points
6
comments
7
min read
LW
link
How important is AI hacking as LLMs advance?
Artyom Karpov
Jan 29, 2024, 6:41 PM
1
point
0
comments
6
min read
LW
link
LLM Psychometrics: A Speculative Approach to AI Safety
pskl
Jan 29, 2024, 6:38 PM
3
points
4
comments
1
min read
LW
link
(pascal.cc)
[Question]
How to write better?
TeaTieAndHat
Jan 29, 2024, 5:02 PM
8
points
24
comments
1
min read
LW
link
Processor clock speeds are not how fast AIs think
Ege Erdil
Jan 29, 2024, 2:39 PM
135
points
55
comments
2
min read
LW
link
Natural selection for ingame character build optimisation
Kongo Landwalker
Jan 29, 2024, 11:34 AM
8
points
5
comments
2
min read
LW
link
Analogy Bank for AI Safety
utilistrutil
Jan 29, 2024, 2:35 AM
23
points
0
comments
8
min read
LW
link
Minneapolis-St Paul ACX Article Club: Meditation and LSD
25Hour
Jan 29, 2024, 1:24 AM
3
points
0
comments
1
min read
LW
link
Simple distribution approximation: When sampled 100 times, can language models yield 80% A and 20% B?
Teun van der Weij
,
Felix Hofstätter
and
Francis Rhys Ward
Jan 29, 2024, 12:24 AM
39
points
5
comments
4
min read
LW
link
Why I take short timelines seriously
NicholasKees
Jan 28, 2024, 10:27 PM
122
points
29
comments
4
min read
LW
link
Win Friends and Influence People Ch. 2: The Bombshell
gull
Jan 28, 2024, 9:40 PM
37
points
13
comments
17
min read
LW
link
(www.google.com)
Riga ACX February 2024 Meetup: 2023 in Review
Anastasia
Jan 28, 2024, 9:36 PM
4
points
0
comments
1
min read
LW
link
Things You’re Allowed to Do: At the Dentist
rbinnn
Jan 28, 2024, 6:39 PM
39
points
16
comments
1
min read
LW
link
(metavee.github.io)
[Question]
What exactly did that great AI future involve again?
lemonhope
Jan 28, 2024, 10:10 AM
9
points
27
comments
1
min read
LW
link
Palworld development blog post
bhauth
Jan 28, 2024, 5:56 AM
82
points
12
comments
1
min read
LW
link
(note.com)
Virtually Rational—VRChat Meetup
Tomás B.
and
the gears to ascension
Jan 28, 2024, 5:52 AM
25
points
3
comments
1
min read
LW
link
[Stanford Daily] Table Talk
sudo
Jan 28, 2024, 3:15 AM
6
points
1
comment
9
min read
LW
link
(stanforddaily.com)
AI Law-a-Thon
Iknownothing
Jan 28, 2024, 2:30 AM
5
points
3
comments
1
min read
LW
link
Chapter 1 of How to Win Friends and Influence People
gull
Jan 28, 2024, 12:32 AM
51
points
5
comments
17
min read
LW
link
(www.google.com)
Epistemic Hell
rogersbacon
Jan 27, 2024, 5:13 PM
71
points
20
comments
14
min read
LW
link
David Burns Thinks Psychotherapy Is a Learnable Skill. Git Gud.
Morpheus
Jan 27, 2024, 1:21 PM
28
points
20
comments
11
min read
LW
link
(podcast.clearerthinking.org)
Aligned AI is dual use technology
lc
Jan 27, 2024, 6:50 AM
58
points
31
comments
2
min read
LW
link
Questions I’d Want to Ask an AGI+ to Test Its Understanding of Ethics
sweenesm
Jan 26, 2024, 11:40 PM
14
points
6
comments
4
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel