Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
2
GPT-8 may not be ASI
rvzlxax409
Nov 4, 2024, 7:31 PM
−2
points
1
comment
3
min read
LW
link
AI timelines don’t account for base rate of tech progress
rvzlxax409
Nov 4, 2024, 7:31 PM
−10
points
2
comments
1
min read
LW
link
Update on the Mysterious Trump Buyers on Polymarket
Annapurna
Nov 4, 2024, 7:22 PM
19
points
9
comments
1
min read
LW
link
(jorgevelez.substack.com)
[Intuitive self-models] 8. Rooting Out Free Will Intuitions
Steven Byrnes
Nov 4, 2024, 6:16 PM
70
points
19
comments
24
min read
LW
link
Option control
Joe Carlsmith
Nov 4, 2024, 5:54 PM
28
points
0
comments
54
min read
LW
link
[Question]
Noticing the World
EvolutionByDesign
Nov 4, 2024, 4:41 PM
4
points
1
comment
1
min read
LW
link
The current state of RSPs
Zach Stein-Perlman
Nov 4, 2024, 4:00 PM
23
points
2
comments
9
min read
LW
link
[Question]
Does the “ancient wisdom” argument have any validity? If a particular teaching or tradition is old, to what extent does this make it more trustworthy?
SpectrumDT
Nov 4, 2024, 3:20 PM
18
points
49
comments
1
min read
LW
link
A brief history of the automated corporation
owencb
Nov 4, 2024, 2:35 PM
26
points
1
comment
5
min read
LW
link
(strangecities.substack.com)
Abstractions are not Natural
Alfred Harwood
Nov 4, 2024, 11:10 AM
25
points
21
comments
11
min read
LW
link
[Linkpost] Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms
Gunnar_Zarncke
Nov 4, 2024, 10:15 AM
13
points
0
comments
1
min read
LW
link
(arxiv.org)
Context-dependent consequentialism
Jeremy Gillen
and
mattmacdermott
Nov 4, 2024, 9:29 AM
31
points
6
comments
27
min read
LW
link
Survival without dignity
L Rudolf L
Nov 4, 2024, 2:29 AM
369
points
29
comments
15
min read
LW
link
(nosetgauge.substack.com)
Drug development costs can range over two orders of magnitude
rossry
Nov 3, 2024, 11:13 PM
38
points
0
comments
11
min read
LW
link
Redefining Tolerance: Beyond Popper’s Paradox
mindprison
Nov 3, 2024, 10:23 PM
−1
points
0
comments
3
min read
LW
link
Goal: Understand Intelligence
Johannes C. Mayer
Nov 3, 2024, 9:20 PM
14
points
19
comments
1
min read
LW
link
Current safety training techniques do not fully transfer to the agent setting
Simon Lermen
and
Govind Pimpale
Nov 3, 2024, 7:24 PM
158
points
9
comments
5
min read
LW
link
Why our politicians aren’t Median
Yair Halberstadt
Nov 3, 2024, 2:03 PM
62
points
15
comments
3
min read
LW
link
Human Biodiversity (Part 4: Astral Codex Ten)
Evan_Gaensbauer
Nov 3, 2024, 4:20 AM
−13
points
6
comments
LW
link
(reflectivealtruism.com)
Understanding incomparability versus incommensurability in relation to RLHF
artemiocobb
Nov 2, 2024, 10:57 PM
1
point
1
comment
2
min read
LW
link
electric turbofans
bhauth
Nov 2, 2024, 10:50 PM
63
points
2
comments
5
min read
LW
link
(bhauth.com)
Reality as Category-Theoretic State Machines: A Mathematical Framework
Wenitte Apiou
Nov 2, 2024, 9:04 PM
−8
points
0
comments
2
min read
LW
link
The Median Researcher Problem
johnswentworth
Nov 2, 2024, 8:16 PM
157
points
70
comments
1
min read
LW
link
Testing “True” Language Understanding in LLMs: A Simple Proposal
MtryaSam
Nov 2, 2024, 7:12 PM
9
points
2
comments
2
min read
LW
link
Testing “True” Language Understanding in LLMs: A Simple Proposal
MtryaSam
Nov 2, 2024, 7:12 PM
−3
points
0
comments
2
min read
LW
link
Fragile, Robust, and Antifragile Preference Satisfaction
adamShimi
Nov 2, 2024, 5:25 PM
19
points
0
comments
5
min read
LW
link
(formethods.substack.com)
Higher Order Signs, Hallucination and Schizophrenia
Nicolas Villarreal
Nov 2, 2024, 4:33 PM
3
points
0
comments
13
min read
LW
link
(nicolasdvillarreal.substack.com)
[Question]
Is OpenAI net negative for AI Safety?
Lysandre Terrisse
Nov 2, 2024, 4:18 PM
4
points
0
comments
1
min read
LW
link
Two arguments against longtermist thought experiments
momom2
Nov 2, 2024, 10:22 AM
15
points
5
comments
3
min read
LW
link
Both-Sidesism—When Fair & Balanced Goes Wrong
James Stephen Brown
Nov 2, 2024, 3:04 AM
3
points
15
comments
6
min read
LW
link
(nonzerosum.games)
What can we learn from insecure domains?
Logan Zoellner
Nov 1, 2024, 11:53 PM
14
points
21
comments
1
min read
LW
link
Science advances one funeral at a time
Cameron Berg
,
Judd Rosenblatt
,
Diogo de Lucena
and
AE Studio
Nov 1, 2024, 11:06 PM
100
points
9
comments
2
min read
LW
link
The Cartesian Crisis
mindprison
Nov 1, 2024, 11:02 PM
−5
points
2
comments
2
min read
LW
link
Composition Circuits in Vision Transformers (Hypothesis)
phenomanon
Nov 1, 2024, 10:16 PM
1
point
0
comments
3
min read
LW
link
SAE Probing: What is it good for?
Subhash Kantamneni
,
Josh Engels
,
Senthooran Rajamanoharan
and
Neel Nanda
Nov 1, 2024, 7:23 PM
33
points
0
comments
11
min read
LW
link
[Question]
Set Theory Multiverse vs Mathematical Truth—Philosophical Discussion
Wenitte Apiou
Nov 1, 2024, 6:56 PM
8
points
25
comments
1
min read
LW
link
Educational CAI: Aligning a Language Model with Pedagogical Theories
Bharath Puranam
Nov 1, 2024, 6:55 PM
5
points
1
comment
13
min read
LW
link
Prediction markets and Taxes
Edmund Nelson
Nov 1, 2024, 5:39 PM
11
points
8
comments
1
min read
LW
link
Dentistry, Oral Surgeons, and the Inefficiency of Small Markets
GeneSmith
Nov 1, 2024, 5:26 PM
86
points
16
comments
5
min read
LW
link
Live Machinery: An Interface Design Philosophy for Wholesome AI Futures
Sahil
Nov 1, 2024, 5:24 PM
48
points
3
comments
35
min read
LW
link
Seeking Collaborators
abramdemski
Nov 1, 2024, 5:13 PM
62
points
15
comments
7
min read
LW
link
Complete Feedback
abramdemski
Nov 1, 2024, 4:58 PM
25
points
8
comments
3
min read
LW
link
Levers for Biological Progress—A Response to “Machines of Loving Grace”
Niko_McCarty
Nov 1, 2024, 4:35 PM
15
points
0
comments
20
min read
LW
link
(www.asimov.press)
2024 Unofficial LW Community Census, Request for Comments
Screwtape
Nov 1, 2024, 4:34 PM
23
points
32
comments
3
min read
LW
link
[Question]
When engaging with a large amount of resources during a literature review, how do you prevent yourself from becoming overwhelmed?
corruptedCatapillar
Nov 1, 2024, 7:29 AM
25
points
2
comments
3
min read
LW
link
(draft) Cyborg software should be open (?)
AtillaYasar
Nov 1, 2024, 7:24 AM
4
points
5
comments
3
min read
LW
link
Another UFO Bet
codyz
Nov 1, 2024, 1:55 AM
9
points
11
comments
1
min read
LW
link
Trading Candy
jefftk
Nov 1, 2024, 1:10 AM
28
points
4
comments
1
min read
LW
link
(www.jefftk.com)
JargonBot Beta Test
Raemon
Nov 1, 2024, 1:05 AM
84
points
55
comments
6
min read
LW
link
GPT-4o Guardrails Gone: Data Poisoning & Jailbreak-Tuning
ChengCheng
,
Brendan Murphy
,
AdamGleave
and
Kellin Pelrine
Nov 1, 2024, 12:10 AM
18
points
0
comments
6
min read
LW
link
(far.ai)
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel