Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
DeepSeek beats o1-preview on math, ties on coding; will release weights
Zach Stein-Perlman
Nov 20, 2024, 11:50 PM
113
points
26
comments
1
min read
LW
link
Expected Utility, Geometric Utility, and Other Equivalent Representations
StrivingForLegibility
Nov 20, 2024, 11:28 PM
10
points
0
comments
11
min read
LW
link
[Question]
Green thumb
Pug stanky
Nov 20, 2024, 9:52 PM
−12
points
1
comment
2
min read
LW
link
Cost, Not Sacrifice
Joe Rogero
Nov 20, 2024, 9:32 PM
75
points
13
comments
LW
link
(subatomicarticles.com)
China Hawks are Manufacturing an AI Arms Race
garrison
Nov 20, 2024, 6:17 PM
144
points
44
comments
LW
link
(garrisonlovely.substack.com)
Why I Think All The Species Of Significantly Debated Consciousness Are Conscious And Suffer Intensely
omnizoid
Nov 20, 2024, 4:48 PM
25
points
5
comments
33
min read
LW
link
aspirational leadership
dhruvmethi
Nov 20, 2024, 4:07 PM
2
points
0
comments
7
min read
LW
link
Zvi’s Thoughts on His 2nd Round of SFF
Zvi
Nov 20, 2024, 1:40 PM
91
points
2
comments
10
min read
LW
link
(thezvi.wordpress.com)
A Little Depth Goes a Long Way: the Expressive Power of Log-Depth Transformers
Bogdan Ionut Cirstea
Nov 20, 2024, 11:48 AM
16
points
0
comments
1
min read
LW
link
(openreview.net)
[Question]
What changes should happen in the HHS?
ChristianKl
Nov 20, 2024, 11:04 AM
0
points
19
comments
1
min read
LW
link
[Question]
What are the good rationality films?
Ben Pace
Nov 20, 2024, 6:04 AM
83
points
54
comments
1
min read
LW
link
Valence Need Not Be Bounded; Utility Need Not Synthesize
Lorec
Nov 20, 2024, 1:37 AM
8
points
0
comments
6
min read
LW
link
Value/Utility: A History
Lorec
Nov 19, 2024, 11:01 PM
9
points
0
comments
10
min read
LW
link
Why Don’t We Just… Shoggoth+Face+Paraphraser?
Daniel Kokotajlo
and
abramdemski
Nov 19, 2024, 8:53 PM
145
points
58
comments
14
min read
LW
link
Every niche event should also be a meetup
DMMF
Nov 19, 2024, 8:47 PM
18
points
0
comments
3
min read
LW
link
(danfrank.ca)
U.S.-China Economic and Security Review Commission pushes Manhattan Project-style AI initiative
worse
Nov 19, 2024, 6:42 PM
56
points
7
comments
1
min read
LW
link
Intrinsic Power-Seeking: AI Might Seek Power for Power’s Sake
TurnTrout
Nov 19, 2024, 6:36 PM
40
points
5
comments
1
min read
LW
link
(turntrout.com)
Evolution’s selection target depends on your weighting
tailcalled
Nov 19, 2024, 6:24 PM
23
points
22
comments
1
min read
LW
link
AISN #44: The Trump Circle on AI Safety Plus, Chinese researchers used Llama to create a military tool for the PLA, a Google AI system discovered a zero-day cybersecurity vulnerability, and Complex Systems
Corin Katzke
,
Julius
,
andrewz
and
Dan H
Nov 19, 2024, 4:36 PM
9
points
0
comments
5
min read
LW
link
(newsletter.safe.ai)
Jakarta ACX December 2024 Meetup
Aud
Nov 19, 2024, 3:01 PM
1
point
0
comments
1
min read
LW
link
Visualizing small Attention-only Transformers
WCargo
Nov 19, 2024, 9:37 AM
4
points
0
comments
8
min read
LW
link
Americans are fat and sick—and it’s their fault…right?
Declan Molony
Nov 19, 2024, 6:41 AM
11
points
6
comments
7
min read
LW
link
Announcing the CLR Foundations Course and CLR S-Risk Seminars
JamesFaville
Nov 19, 2024, 1:18 AM
18
points
0
comments
LW
link
No Electricity in Manchuria
winstonBosan
Nov 19, 2024, 1:11 AM
25
points
0
comments
5
min read
LW
link
Looking back on the Future of Humanity Institute—Asterisk
jakeeaton
Nov 19, 2024, 12:44 AM
48
points
0
comments
1
min read
LW
link
Don’t Dismiss on Epistemics
ggex
Nov 19, 2024, 12:44 AM
8
points
3
comments
2
min read
LW
link
Training AI agents to solve hard problems could lead to Scheming
Marius Hobbhahn
and
AlexMeinke
Nov 19, 2024, 12:10 AM
61
points
12
comments
28
min read
LW
link
Proactive ‘If-Then’ Safety Cases
Nathan Helm-Burger
Nov 18, 2024, 9:16 PM
10
points
0
comments
4
min read
LW
link
[Question]
Will Orion/Gemini 2/Llama-4 outperform o1
LuigiPagani
Nov 18, 2024, 9:15 PM
2
points
3
comments
1
min read
LW
link
How to use bright light to improve your life.
Nat Martin
Nov 18, 2024, 7:32 PM
40
points
10
comments
10
min read
LW
link
Social events with plausible deniability
Chipmonk
Nov 18, 2024, 6:25 PM
25
points
24
comments
1
min read
LW
link
(chrislakin.blog)
How likely is brain preservation to work?
Andy_McKenzie
Nov 18, 2024, 4:58 PM
26
points
3
comments
6
min read
LW
link
Why imperfect adversarial robustness doesn’t doom AI control
Buck
and
Claude+
Nov 18, 2024, 4:05 PM
62
points
25
comments
2
min read
LW
link
Ethical Implications of the Quantum Multiverse
Jonah Wilberg
Nov 18, 2024, 4:00 PM
7
points
22
comments
6
min read
LW
link
Reducing x-risk might be actively harmful
MountainPath
Nov 18, 2024, 2:25 PM
5
points
5
comments
1
min read
LW
link
Monthly Roundup #24: November 2024
Zvi
Nov 18, 2024, 1:20 PM
44
points
14
comments
50
min read
LW
link
(thezvi.wordpress.com)
A Straightforward Explanation of the Good Regulator Theorem
Alfred Harwood
Nov 18, 2024, 12:45 PM
36
points
3
comments
14
min read
LW
link
The Choice Transition
owencb
and
Raymond Douglas
Nov 18, 2024, 12:30 PM
50
points
4
comments
15
min read
LW
link
(strangecities.substack.com)
Chat Bankman-Fried: an Exploration of LLM Alignment in Finance
claudia.biancotti
Nov 18, 2024, 9:38 AM
26
points
4
comments
1
min read
LW
link
Proposal to increase fertility: University parent clubs
Fluffnutt
Nov 18, 2024, 4:21 AM
17
points
3
comments
1
min read
LW
link
A small improvement to Wikipedia page on Pareto Efficiency
Edwin Evans
Nov 18, 2024, 2:13 AM
8
points
0
comments
1
min read
LW
link
[Question]
Why is Gemini telling the user to die?
Burny
Nov 18, 2024, 1:44 AM
13
points
1
comment
1
min read
LW
link
“It’s a 10% chance which I did 10 times, so it should be 100%”
egor.timatkov
Nov 18, 2024, 1:14 AM
154
points
59
comments
2
min read
LW
link
The Catastrophe of Shiny Objects
mindprison
Nov 18, 2024, 12:24 AM
−12
points
0
comments
3
min read
LW
link
Do Deep Neural Networks Have Brain-like Representations?: A Summary of Disagreements
Joseph Emerson
Nov 18, 2024, 12:07 AM
9
points
0
comments
26
min read
LW
link
Truth Terminal: A reconstruction of events
crvr.fr
and
MTorrents
Nov 17, 2024, 11:51 PM
3
points
1
comment
7
min read
LW
link
Which AI Safety Benchmark Do We Need Most in 2025?
Loïc Cabannes
and
William Ludington
Nov 17, 2024, 11:50 PM
2
points
2
comments
8
min read
LW
link
“The Solomonoff Prior is Malign” is a special case of a simpler argument
David Matolcsi
Nov 17, 2024, 9:32 PM
130
points
44
comments
12
min read
LW
link
Chess As The Model Game
criticalpoints
Nov 17, 2024, 7:45 PM
19
points
0
comments
8
min read
LW
link
(eregis.github.io)
The grass is always greener in the environment that shaped your values
Karl Faulks
Nov 17, 2024, 6:00 PM
8
points
0
comments
3
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel