Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Caring only about your child may increase human suffering
Cipolla
Aug 8, 2024, 11:47 PM
−15
points
2
comments
8
min read
LW
link
The Hessian rank bounds the learning coefficient
Lucius Bushnaq
Aug 8, 2024, 8:55 PM
68
points
10
comments
4
min read
LW
link
GPT-4o System Card
Zach Stein-Perlman
Aug 8, 2024, 8:30 PM
68
points
11
comments
2
min read
LW
link
(openai.com)
Parasites (not a metaphor)
lemonhope
Aug 8, 2024, 8:07 PM
133
points
19
comments
1
min read
LW
link
Some Unorthodox Ways To Achieve High GDP Growth
johnswentworth
and
David Lorell
Aug 8, 2024, 6:58 PM
57
points
6
comments
6
min read
LW
link
You can remove GPT2’s LayerNorm by fine-tuning for an hour
StefanHex
Aug 8, 2024, 6:33 PM
166
points
11
comments
8
min read
LW
link
Leaving MIRI, Seeking Funding
abramdemski
Aug 8, 2024, 6:32 PM
264
points
19
comments
2
min read
LW
link
[Question]
Does VETLM solve AI superalignment?
Oleg Trott
Aug 8, 2024, 6:22 PM
−1
points
10
comments
1
min read
LW
link
Toy Models of Superposition: what about BitNets?
Alejandro Tlaie
Aug 8, 2024, 4:29 PM
5
points
1
comment
5
min read
LW
link
[LDSL#1] Performance optimization as a metaphor for life
tailcalled
Aug 8, 2024, 4:16 PM
32
points
6
comments
5
min read
LW
link
Four Randomized Control Trials In Economics
Maxwell Tabarrok
Aug 8, 2024, 3:59 PM
20
points
1
comment
4
min read
LW
link
(www.maximum-progress.com)
Cheap Whiteboards!
Johannes C. Mayer
Aug 8, 2024, 1:52 PM
27
points
2
comments
1
min read
LW
link
AI #76: Six Shorts Stories About OpenAI
Zvi
Aug 8, 2024, 1:50 PM
53
points
10
comments
48
min read
LW
link
(thezvi.wordpress.com)
[Question]
What the cost difference in processing input vs. output tokens with LLMs?
kotrfa
Aug 8, 2024, 10:43 AM
3
points
10
comments
1
min read
LW
link
Meno’s Paradox
Hudjefa
Aug 8, 2024, 5:59 AM
0
points
10
comments
1
min read
LW
link
Case Story: Lack of Consumer Protection Procedures AI Manipulation and the Threat of Fund Concentration in Crypto Seeking Assistance to Fund a Civil Case to Establish Facts and Protect Vulnerable Consumers from Damage Caused by Automated Systems
Petr 'Margot' Andreev
Aug 8, 2024, 5:55 AM
−9
points
0
comments
9
min read
LW
link
Motivation Theory
Zero Contradictions
Aug 8, 2024, 5:05 AM
3
points
0
comments
1
min read
LW
link
(thewaywardaxolotl.blogspot.com)
It’s time for a self-reproducing machine
Carl Feynman
Aug 7, 2024, 9:52 PM
102
points
70
comments
9
min read
LW
link
[LDSL#0] Some epistemological conundrums
tailcalled
Aug 7, 2024, 7:52 PM
55
points
11
comments
10
min read
LW
link
Help us seed AI Safety Brussels
gergogaspar
and
ENAIS
Aug 7, 2024, 6:32 AM
3
points
2
comments
3
min read
LW
link
Adaptive Coherence
Zero Contradictions
Aug 7, 2024, 6:17 AM
2
points
0
comments
2
min read
LW
link
(thewaywardaxolotl.blogspot.com)
Individual Utilities Shift Continuously as Geometric Weights Shift
StrivingForLegibility
Aug 7, 2024, 1:41 AM
2
points
0
comments
17
min read
LW
link
Gradient Ascenders Reach the Harsanyi Hyperplane
StrivingForLegibility
Aug 7, 2024, 1:40 AM
4
points
0
comments
6
min read
LW
link
Deriving the Geometric Utilitarian Weights
StrivingForLegibility
Aug 7, 2024, 1:39 AM
2
points
0
comments
11
min read
LW
link
Proving the Geometric Utilitarian Theorem
StrivingForLegibility
Aug 7, 2024, 1:39 AM
25
points
0
comments
8
min read
LW
link
The Geometric Importance of Side Payments
StrivingForLegibility
Aug 7, 2024, 1:38 AM
8
points
4
comments
3
min read
LW
link
Attention-Feature Tables in Gemma 2 Residual Streams
J Bostock
Aug 6, 2024, 10:56 PM
2
points
0
comments
14
min read
LW
link
[Question]
What are the strategic implications if aliens and Earth civilizations produce similar utilities?
Maxime Riché
Aug 6, 2024, 9:16 PM
4
points
1
comment
1
min read
LW
link
WTH is Cerebrolysin, actually?
gsfitzgerald
and
delton137
Aug 6, 2024, 8:40 PM
181
points
23
comments
17
min read
LW
link
The Pragmatic Side of Cryptographically Boxing AI
Bart Jaworski
Aug 6, 2024, 5:46 PM
6
points
0
comments
9
min read
LW
link
Inference-Only Debate Experiments Using Math Problems
Arjun Panickssery
,
Abhimanyu Pallavi Sudhir
and
JacksonKaunismaa
Aug 6, 2024, 5:44 PM
31
points
0
comments
2
min read
LW
link
[Question]
Is an AI religion justified?
p4rziv4l
Aug 6, 2024, 3:42 PM
−35
points
11
comments
1
min read
LW
link
Startup Roundup #2
Zvi
Aug 6, 2024, 1:30 PM
45
points
0
comments
32
min read
LW
link
(thezvi.wordpress.com)
Mechanistic Anomaly Detection Research Update
Nora Belrose
and
David Johnston
Aug 6, 2024, 10:33 AM
11
points
0
comments
1
min read
LW
link
(blog.eleuther.ai)
Reasoning is not search—a chess example
p.b.
Aug 6, 2024, 9:29 AM
4
points
3
comments
2
min read
LW
link
Broadly human level, cognitively complete AGI
p.b.
Aug 6, 2024, 9:26 AM
9
points
0
comments
1
min read
LW
link
Does Evolutionary Theory Imply Genetic Tribalism?
Zero Contradictions
Aug 6, 2024, 5:43 AM
0
points
1
comment
1
min read
LW
link
(thewaywardaxolotl.blogspot.com)
How I Learned To Stop Trusting Prediction Markets and Love the Arbitrage
orthonormal
Aug 6, 2024, 2:32 AM
200
points
30
comments
3
min read
LW
link
John Schulman leaves OpenAI for Anthropic [and then left Anthropic again for Thinking Machines]
Sodium
Aug 6, 2024, 1:23 AM
57
points
0
comments
1
min read
LW
link
Self-explaining SAE features
Dmitrii Kharlapenko
,
neverix
,
Neel Nanda
and
Arthur Conmy
Aug 5, 2024, 10:20 PM
62
points
13
comments
10
min read
LW
link
Value fragility and AI takeover
Joe Carlsmith
Aug 5, 2024, 9:28 PM
76
points
5
comments
30
min read
LW
link
Excursions into Sparse Autoencoders: What is monosemanticity?
Jakub Smékal
Aug 5, 2024, 7:22 PM
2
points
0
comments
10
min read
LW
link
Madrid—ACX Meetups Everywhere Fall 2024
Pablo Villalobos
Aug 5, 2024, 6:36 PM
4
points
0
comments
1
min read
LW
link
LLMs stifle creativity, eliminate opportunities for serendipitous discovery and disrupt intergenerational transfer of wisdom
Ghdz
5 Aug 2024 18:27 UTC
6
points
2
comments
7
min read
LW
link
Circular Reasoning
abramdemski
5 Aug 2024 18:10 UTC
91
points
40
comments
8
min read
LW
link
Fear of centralized power vs. fear of misaligned AGI: Vitalik Buterin on 80,000 Hours
Seth Herd
5 Aug 2024 15:38 UTC
66
points
22
comments
5
min read
LW
link
Four Phases of AGI
Gabe M
5 Aug 2024 13:15 UTC
13
points
3
comments
13
min read
LW
link
AI Safety at the Frontier: Paper Highlights, July ’24
gasteigerjo
5 Aug 2024 13:00 UTC
8
points
0
comments
7
min read
LW
link
(aisafetyfrontier.substack.com)
Game Theory and Society
Zero Contradictions
5 Aug 2024 4:27 UTC
4
points
0
comments
1
min read
LW
link
(thewaywardaxolotl.blogspot.com)
Near-mode thinking on AI
Olli Järviniemi
4 Aug 2024 20:47 UTC
128
points
9
comments
5
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel