Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
[Linkpost] Towards a Theoretical Understanding of the ‘Reversal Curse’ via Training Dynamics
Bogdan Ionut Cirstea
11 May 2024 22:59 UTC
1
point
0
comments
1
min read
LW
link
(arxiv.org)
How To Do Patching Fast
Joseph Miller
11 May 2024 20:13 UTC
27
points
3
comments
4
min read
LW
link
Can we build a better Public Doublecrux?
Raemon
11 May 2024 19:21 UTC
51
points
4
comments
4
min read
LW
link
[Question]
How do I get better at D&D Sci?
FinalFormal2
11 May 2024 18:48 UTC
9
points
5
comments
1
min read
LW
link
[Question]
Resources for learning about poise / gracefulness?
David Gross
11 May 2024 18:30 UTC
14
points
0
comments
1
min read
LW
link
New intro textbook on AIXI
Alex_Altair
11 May 2024 18:18 UTC
39
points
3
comments
1
min read
LW
link
Questions are usually too cheap
Nathan Young
11 May 2024 13:00 UTC
46
points
8
comments
6
min read
LW
link
(nathanpmyoung.substack.com)
[Question]
Ethics and prospects of AI related jobs?
dr_s
11 May 2024 9:31 UTC
10
points
8
comments
1
min read
LW
link
Applying refusal-vector ablation to a Llama 3 70B agent
Simon Lermen
11 May 2024 0:08 UTC
39
points
6
comments
7
min read
LW
link
The Alignment Problem No One Is Talking About
James Stephen Brown
10 May 2024 18:34 UTC
10
points
2
comments
2
min read
LW
link
(nonzerosum.games)
Pascal’s Mugging and the Order of Quantification
Mascal's Pugging
10 May 2024 18:34 UTC
11
points
3
comments
2
min read
LW
link
Podcast with Yoshua Bengio on Why AI Labs are “Playing Dice with Humanity’s Future”
garrison
10 May 2024 17:23 UTC
41
points
0
comments
1
min read
LW
link
(garrisonlovely.substack.com)
(Geometrically) Maximal Lottery-Lotteries Are Probably Not Unique
Lorxus
10 May 2024 16:00 UTC
15
points
1
comment
14
min read
LW
link
What do you value ?
Akram Choudhary
10 May 2024 15:34 UTC
3
points
1
comment
2
min read
LW
link
[Question]
Do you know of lists of p(doom)s/AI forecasts/ AI quotes?
Nathan Young
10 May 2024 11:47 UTC
7
points
2
comments
1
min read
LW
link
AI and Chemical, Biological, Radiological, & Nuclear Hazards: A Regulatory Review
Elliot_Mckernon
and
Deric Cheng
10 May 2024 8:41 UTC
7
points
1
comment
9
min read
LW
link
shortest goddamn bayes guide ever
lukehmiles
10 May 2024 7:06 UTC
33
points
8
comments
1
min read
LW
link
Linear infra-Bayesian Bandits
Vanessa Kosoy
10 May 2024 6:41 UTC
29
points
2
comments
1
min read
LW
link
(arxiv.org)
Why Care About Natural Latents?
johnswentworth
and
David Lorell
9 May 2024 23:14 UTC
50
points
3
comments
5
min read
LW
link
What I learned from doing Quiz Bowl
Jacob G-W
9 May 2024 21:05 UTC
4
points
0
comments
6
min read
LW
link
(jacobgw.com)
Back to top
Next