Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
[Question]
What are some posthumanist/more-than-human approaches to definitions of intelligence and agency? Particularly in application to AI research.
Eli Hiton
Apr 9, 2024, 9:52 PM
1
point
0
comments
1
min read
LW
link
Ophiology (or, how the Mamba architecture works)
Danielle Ensign
,
SrGonao
and
Adrià Garriga-alonso
Apr 9, 2024, 7:31 PM
67
points
8
comments
10
min read
LW
link
Apply to LASR Labs: a London-based technical AI safety research programme
Erin Robertson
,
charlie_griffin
and
joehardie
Apr 9, 2024, 5:34 PM
45
points
1
comment
3
min read
LW
link
“Decentralized Autonomous Education”—Call for Reviewers (Seeds of Science)
rogersbacon
Apr 9, 2024, 2:39 PM
6
points
0
comments
1
min read
LW
link
D&D.Sci: The Mad Tyrant’s Pet Turtles [Evaluation and Ruleset]
abstractapplic
Apr 9, 2024, 2:01 PM
48
points
6
comments
3
min read
LW
link
Medical Roundup #2
Zvi
Apr 9, 2024, 1:40 PM
37
points
18
comments
16
min read
LW
link
(thezvi.wordpress.com)
[Closed] PIBBSS is hiring in a variety of roles (alignment research and incubation program)
Nora_Ammann
,
Lucas Teixeira
and
DusanDNesic
Apr 9, 2024, 8:12 AM
54
points
0
comments
3
min read
LW
link
Any evidence or reason to expect a multiverse / Everett branches?
lemonhope
Apr 9, 2024, 5:26 AM
9
points
125
comments
1
min read
LW
link
Fermenting Form
koratkar
Apr 9, 2024, 2:46 AM
19
points
2
comments
4
min read
LW
link
(careerscouting.substack.com)
[Question]
Non-ultimatum game problem
numpyNaN
Apr 8, 2024, 11:25 PM
9
points
4
comments
2
min read
LW
link
Pandemic Identification Simulator
jefftk
Apr 8, 2024, 7:00 PM
22
points
0
comments
1
min read
LW
link
(www.jefftk.com)
How We Picture Bayesian Agents
johnswentworth
and
David Lorell
Apr 8, 2024, 6:12 PM
70
points
14
comments
7
min read
LW
link
CEA seeks co-founder for AI safety group support spin-off
agucova
Apr 8, 2024, 3:42 PM
18
points
0
comments
LW
link
Investigating the role of agency in AI x-risk
Corin Katzke
Apr 8, 2024, 3:12 PM
10
points
0
comments
LW
link
(www.convergenceanalysis.org)
Measuring Learned Optimization in Small Transformer Models
J Bostock
Apr 8, 2024, 2:41 PM
22
points
0
comments
11
min read
LW
link
[Question]
Can singularity emerge from transformers?
MP
Apr 8, 2024, 2:26 PM
−3
points
1
comment
1
min read
LW
link
Gated Attention Blocks: Preliminary Progress toward Removing Attention Head Superposition
cmathw
,
Dennis Akar
and
Lee Sharkey
Apr 8, 2024, 11:14 AM
42
points
4
comments
15
min read
LW
link
Math-to-English Cheat Sheet
nahoj
Apr 8, 2024, 9:19 AM
54
points
5
comments
6
min read
LW
link
[Question]
What does it take to transfer the knowledge to action?
EL_File4138
Apr 8, 2024, 6:23 AM
3
points
7
comments
1
min read
LW
link
Normalizing Sparse Autoencoders
Fengyuan Hu
Apr 8, 2024, 6:17 AM
22
points
18
comments
13
min read
LW
link
A Dozen Ways to Get More Dakka
Davidmanheim
Apr 8, 2024, 4:45 AM
134
points
11
comments
3
min read
LW
link
[Crosspost] Introducing the Hypermanifest: Redefining AI’s Role in Human Connection and Interaction
simulacra.exe
Apr 7, 2024, 5:21 PM
4
points
0
comments
5
min read
LW
link
Applications Open: Elevate Your Mental Wellbeing with Rethink Wellbeing’s CBT Program
Inga G.
Apr 7, 2024, 2:03 PM
13
points
2
comments
LW
link
The Poker Theory of Poker Night
omark
Apr 7, 2024, 9:47 AM
29
points
13
comments
9
min read
LW
link
(www.codeandbugs.com)
Centrists are (probably) less biased
Kevin Dorst
Apr 7, 2024, 6:40 AM
1
point
2
comments
5
min read
LW
link
(kevindorst.substack.com)
on the dollar-yen exchange rate
bhauth
Apr 7, 2024, 4:49 AM
50
points
21
comments
10
min read
LW
link
(www.bhauth.com)
Conflict in Posthuman Literature
Martín Soto
Apr 6, 2024, 10:26 PM
40
points
1
comment
2
min read
LW
link
(twitter.com)
“Fractal Strategy” workshop report
Raemon
Apr 6, 2024, 9:26 PM
68
points
23
comments
10
min read
LW
link
The 2nd Demographic Transition
Maxwell Tabarrok
Apr 6, 2024, 2:10 PM
68
points
17
comments
4
min read
LW
link
(www.maximum-progress.com)
My intellectual journey to (dis)solve the hard problem of consciousness
Charbel-Raphaël
Apr 6, 2024, 9:32 AM
49
points
44
comments
30
min read
LW
link
Measuring Predictability of Persona Evaluations
Thee Ho
and
evhub
Apr 6, 2024, 8:46 AM
20
points
0
comments
7
min read
LW
link
Privacy and writing
Neil
Apr 6, 2024, 8:20 AM
20
points
1
comment
5
min read
LW
link
[Question]
How does the ever-increasing use of AI in the military for the direct purpose of murdering people affect your p(doom)?
Justausername
Apr 6, 2024, 6:31 AM
19
points
16
comments
1
min read
LW
link
Two tools for rethinking existential risk
Arepo
Apr 6, 2024, 2:55 AM
2
points
0
comments
25
min read
LW
link
Exploring Whole Brain Emulation
PeterMcCluskey
Apr 6, 2024, 2:38 AM
13
points
1
comment
2
min read
LW
link
(bayesianinvestor.com)
Koan: divining alien datastructures from RAM activations
TsviBT
Apr 5, 2024, 6:04 PM
49
points
10
comments
21
min read
LW
link
On the 2nd CWT with Jonathan Haidt
Zvi
Apr 5, 2024, 5:30 PM
27
points
3
comments
33
min read
LW
link
(thezvi.wordpress.com)
End-to-end hacking with language models
tchauvin
Apr 5, 2024, 3:06 PM
29
points
0
comments
8
min read
LW
link
Partial value takeover without world takeover
KatjaGrace
Apr 5, 2024, 6:20 AM
89
points
23
comments
3
min read
LW
link
(worldspiritsockpuppet.com)
On Complexity Science
Garrett Baker
Apr 5, 2024, 2:24 AM
51
points
19
comments
4
min read
LW
link
Using game theory to elect a centrist in the 2024 US Presidential Election
Ebenezer Dukakis
Apr 5, 2024, 12:46 AM
−3
points
0
comments
8
min read
LW
link
New report: A review of the empirical evidence for existential risk from AI via misaligned power-seeking
Harlan
and
rosehadshar
Apr 4, 2024, 11:41 PM
31
points
5
comments
1
min read
LW
link
(blog.aiimpacts.org)
Quick evidence review of bulking & cutting
jp
Apr 4, 2024, 9:43 PM
31
points
5
comments
4
min read
LW
link
LLMs for Alignment Research: a safety priority?
abramdemski
Apr 4, 2024, 8:03 PM
145
points
24
comments
11
min read
LW
link
On Leif Wenar’s Absurdly Unconvincing Critique Of Effective Altruism
omnizoid
Apr 4, 2024, 7:01 PM
8
points
2
comments
14
min read
LW
link
Run evals on base models too!
orthonormal
Apr 4, 2024, 6:43 PM
49
points
6
comments
1
min read
LW
link
Let’s Fund: Impact of our $1M crowdfunded grant to the Center for Clean Energy Innovation
Hauke Hillebrandt
Apr 4, 2024, 4:28 PM
5
points
0
comments
LW
link
(lets-fund.org)
The Buckling World Hypothesis—Visualising Vulnerable Worlds
Rosco-Hunter
Apr 4, 2024, 3:51 PM
−5
points
2
comments
4
min read
LW
link
Can AI Transform the Electorate into a Citizen’s Assembly?
Rosco-Hunter
Apr 4, 2024, 3:45 PM
−6
points
0
comments
4
min read
LW
link
AI Discrimination Requirements: A Regulatory Review
Deric Cheng
and
Elliot Mckernon
4 Apr 2024 15:43 UTC
7
points
0
comments
6
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel