Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
2
Sparks of Consciousness
Charlie Sanders
Nov 13, 2024, 4:58 AM
2
points
0
comments
3
min read
LW
link
(www.dailymicrofiction.com)
Contra Musician Gender II
jefftk
Nov 13, 2024, 3:30 AM
9
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Flipping Out: The Cosmic Coinflip Thought Experiment Is Bad Philosophy
Joe Rogero
Nov 12, 2024, 11:55 PM
34
points
17
comments
4
min read
LW
link
Incentive design and capability elicitation
Joe Carlsmith
Nov 12, 2024, 8:56 PM
31
points
0
comments
12
min read
LW
link
The Humanitarian Economy
Kyle Furlong
Nov 12, 2024, 6:25 PM
1
point
14
comments
6
min read
LW
link
Current Attitudes Toward AI Provide Little Data Relevant to Attitudes Toward AGI
Seth Herd
Nov 12, 2024, 6:23 PM
19
points
2
comments
4
min read
LW
link
Basics of Handling Disagreements with People
Camille Berger
Nov 12, 2024, 5:55 PM
34
points
4
comments
6
min read
LW
link
Registrations Open for 2024 NYC Secular Solstice & Megameetup
Joe Rogero
and
Screwtape
Nov 12, 2024, 5:50 PM
13
points
0
comments
1
min read
LW
link
2024 NYC Secular Solstice & Megameetup
Joe Rogero
and
Screwtape
Nov 12, 2024, 5:46 PM
18
points
0
comments
1
min read
LW
link
2025 Q1 Pivotal Research Fellowship (Technical & Policy)
Tobias H
and
tilmanr
Nov 12, 2024, 10:56 AM
7
points
0
comments
2
min read
LW
link
Theories With Mentalistic Atoms Are As Validly Called Theories As Theories With Only Non-Mentalistic Atoms
Lorec
Nov 12, 2024, 6:45 AM
5
points
5
comments
8
min read
LW
link
The lying p value
kqr
Nov 12, 2024, 6:12 AM
7
points
7
comments
1
min read
LW
link
(entropicthoughts.com)
Modeling AI-driven occupational change over the next 10 years and beyond
2120eth
Nov 12, 2024, 4:58 AM
1
point
0
comments
2
min read
LW
link
How to Live Well: My Philosophy of Life
Philosofer123
Nov 12, 2024, 4:05 AM
−5
points
2
comments
1
min read
LW
link
The Packaging and the Payload
Screwtape
Nov 12, 2024, 3:07 AM
76
points
1
comment
5
min read
LW
link
Consider tabooing “I think”
Adam Zerner
Nov 12, 2024, 2:00 AM
8
points
2
comments
7
min read
LW
link
Festival Stats 2024
jefftk
Nov 12, 2024, 2:00 AM
10
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Not all biases are equal—a study of sycophancy and bias in fine-tuned LLMs
jakub_krys
Nov 11, 2024, 11:11 PM
8
points
0
comments
7
min read
LW
link
AI Craftsmanship
abramdemski
Nov 11, 2024, 10:17 PM
66
points
7
comments
4
min read
LW
link
Electric Grid Cyberattack: An AI-Informed Threat Model
moonlightmaze
Nov 11, 2024, 9:34 PM
22
points
0
comments
29
min read
LW
link
o1 is a bad idea
abramdemski
Nov 11, 2024, 9:20 PM
162
points
39
comments
2
min read
LW
link
Inferential Game: The Foraging (Ex-)Bandit
abstractapplic
Nov 11, 2024, 4:59 PM
27
points
4
comments
1
min read
LW
link
The Evals Gap
Marius Hobbhahn
Nov 11, 2024, 4:42 PM
55
points
7
comments
7
min read
LW
link
(www.apolloresearch.ai)
Summary: “Imagining and building wise machines: The centrality of AI metacognition” by Johnson, Karimi, Bengio, et al.
Chris_Leong
Nov 11, 2024, 4:13 PM
29
points
8
comments
8
min read
LW
link
(arxiv.org)
The Online Sports Gambling Experiment Has Failed
Zvi
Nov 11, 2024, 2:30 PM
287
points
59
comments
11
min read
LW
link
(thezvi.wordpress.com)
How I Learned That You Should Push Children Into Ponds
Bentham's Bulldog
Nov 11, 2024, 2:20 PM
−3
points
3
comments
4
min read
LW
link
The new ruling philosophy regarding AI
Mitchell_Porter
Nov 11, 2024, 1:28 PM
29
points
0
comments
5
min read
LW
link
What Ketamine Therapy Is Like
Sable
Nov 11, 2024, 11:09 AM
49
points
8
comments
6
min read
LW
link
(affablyevil.substack.com)
Spherical cow
dkl9
Nov 11, 2024, 3:10 AM
7
points
0
comments
1
min read
LW
link
(dkl9.net)
[Question]
how to truly feel my beliefs?
KvmanThinking
Nov 11, 2024, 12:04 AM
6
points
6
comments
1
min read
LW
link
Bay Winter Solstice 2024: song leading auditions
tcheasdfjkl
Nov 10, 2024, 11:59 PM
28
points
0
comments
1
min read
LW
link
[Question]
A Coordination Cookbook?
azergante
Nov 10, 2024, 11:20 PM
2
points
0
comments
1
min read
LW
link
Towards a Clever Hans Test: Unmasking Sentience Biases in Chatbot Interactions
glykokalyx
Nov 10, 2024, 10:34 PM
4
points
0
comments
1
min read
LW
link
Urbit New England Meetup
Conquerer Cohen
Nov 10, 2024, 5:56 PM
−4
points
0
comments
1
min read
LW
link
Personal AI Planning
jefftk
Nov 10, 2024, 2:00 PM
68
points
11
comments
2
min read
LW
link
(www.jefftk.com)
AI alignment via civilizational cognitive updates
AtillaYasar
Nov 10, 2024, 9:33 AM
1
point
10
comments
6
min read
LW
link
[Question]
How should vegans think about Methionine needs?
ChristianKl
Nov 10, 2024, 9:28 AM
32
points
3
comments
1
min read
LW
link
Is P(Doom) Meaningful? Bayesian vs. Popperian Epistemology Debate
Liron
Nov 9, 2024, 11:39 PM
5
points
1
comment
124
min read
LW
link
(www.youtube.com)
Bellevue Library Meetup—Nov 23
Cedar
Nov 9, 2024, 11:05 PM
5
points
3
comments
1
min read
LW
link
LifeKeeper Diaries: Exploring Misaligned AI Through Interactive Fiction
Tristan Tran
,
stijn
and
Mose Wintner
Nov 9, 2024, 8:58 PM
15
points
5
comments
2
min read
LW
link
[Question]
Poll: what’s your impression of altruism?
David Gross
Nov 9, 2024, 8:28 PM
2
points
4
comments
1
min read
LW
link
Chaos Theory in Ecology
Elizabeth
Nov 9, 2024, 5:50 PM
15
points
4
comments
20
min read
LW
link
(acesounderglass.com)
Some Comments on Recent AI Safety Developments
testingthewaters
Nov 9, 2024, 4:44 PM
4
points
0
comments
8
min read
LW
link
Formalize the Hashiness Model of AGI Uncontainability
Remmelt
Nov 9, 2024, 4:10 PM
3
points
0
comments
5
min read
LW
link
(docs.google.com)
Agenda Manipulation
Pazzaz
Nov 9, 2024, 2:13 PM
2
points
0
comments
3
min read
LW
link
Force Sequential Output with SCP?
jefftk
Nov 9, 2024, 12:40 PM
9
points
4
comments
1
min read
LW
link
(www.jefftk.com)
Anthropic teams up with Palantir and AWS to sell AI to defense customers
Matrice Jacobine
Nov 9, 2024, 11:50 AM
9
points
0
comments
2
min read
LW
link
(techcrunch.com)
GPT-4o Can In Some Cases Solve Moderately Complicated Captchas
dirk
Nov 9, 2024, 4:04 AM
12
points
2
comments
1
min read
LW
link
Stone Age Herbalist’s notes on ant warfare and slavery
trevor
Nov 9, 2024, 2:40 AM
32
points
0
comments
3
min read
LW
link
(x.com)
LLMs Look Increasingly Like General Reasoners
eggsyntax
Nov 8, 2024, 11:47 PM
94
points
45
comments
3
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel