Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
Superbabies: Putting The Pieces Together
sarahconstantin
11 Jul 2024 20:40 UTC
200
points
35
comments
10
min read
LW
link
(sarahconstantin.substack.com)
Safety isn’t safety without a social model (or: dispelling the myth of per se technical safety)
Andrew_Critch
14 Jun 2024 0:16 UTC
324
points
34
comments
4
min read
LW
link
LLM Generality is a Timeline Crux
eggsyntax
24 Jun 2024 12:52 UTC
201
points
92
comments
7
min read
LW
link
Poker is a bad game for teaching epistemics. Figgie is a better one.
rossry
8 Jul 2024 6:05 UTC
96
points
46
comments
11
min read
LW
link
(blog.rossry.net)
My AI Model Delta Compared To Yudkowsky
johnswentworth
10 Jun 2024 16:12 UTC
272
points
100
comments
4
min read
LW
link
My hour of memoryless lucidity
Eric Neyman
4 May 2024 1:40 UTC
349
points
34
comments
5
min read
LW
link
(ericneyman.wordpress.com)
Loving a world you don’t trust
Joe Carlsmith
18 Jun 2024 19:31 UTC
126
points
13
comments
33
min read
LW
link
Transformers Represent Belief State Geometry in their Residual Stream
Adam Shai
16 Apr 2024 21:16 UTC
397
points
100
comments
12
min read
LW
link
Truthseeking is the ground in which other principles grow
Elizabeth
27 May 2024 1:09 UTC
207
points
14
comments
16
min read
LW
link
Thoughts on seed oil
dynomight
20 Apr 2024 12:29 UTC
341
points
122
comments
17
min read
LW
link
(dynomight.net)
The Best Tacit Knowledge Videos on Every Subject
Parker Conley
31 Mar 2024 17:14 UTC
347
points
138
comments
16
min read
LW
link
Failures in Kindness
silentbob
26 Mar 2024 21:30 UTC
354
points
48
comments
9
min read
LW
link
AI catastrophes and rogue deployments
Buck
3 Jun 2024 17:04 UTC
117
points
16
comments
8
min read
LW
link
EIS XIII: Reflections on Anthropic’s SAE Research Circa May 2024
scasper
21 May 2024 20:15 UTC
155
points
16
comments
3
min read
LW
link
The Standard Analogy
Zack_M_Davis
3 Jun 2024 17:15 UTC
113
points
25
comments
12
min read
LW
link
On Not Pulling The Ladder Up Behind You
Screwtape
26 Apr 2024 21:58 UTC
186
points
19
comments
9
min read
LW
link
Deep Honesty
Aletheophile
7 May 2024 20:31 UTC
150
points
25
comments
9
min read
LW
link
On green
Joe Carlsmith
21 Mar 2024 17:38 UTC
261
points
35
comments
31
min read
LW
link
My PhD thesis: Algorithmic Bayesian Epistemology
Eric Neyman
16 Mar 2024 22:56 UTC
252
points
14
comments
7
min read
LW
link
(arxiv.org)
There is way too much serendipity
Malmesbury
19 Jan 2024 19:37 UTC
357
points
56
comments
7
min read
LW
link
Back to top
Next