Superbabies: Putting The Pieces Together

sarahconstantin11 Jul 2024 20:40 UTC

200 points

35 comments10 min readLW link

(sarahconstantin.substack.com)

Safety isn’t safety without a social model (or: dispelling the myth of per se technical safety)

Andrew_Critch14 Jun 2024 0:16 UTC

324 points

34 comments4 min readLW link

LLM Generality is a Timeline Crux

eggsyntax24 Jun 2024 12:52 UTC

201 points

92 comments7 min readLW link

Poker is a bad game for teaching epistemics. Figgie is a better one.

rossry8 Jul 2024 6:05 UTC

96 points

46 comments11 min readLW link

(blog.rossry.net)

My AI Model Delta Compared To Yudkowsky

johnswentworth10 Jun 2024 16:12 UTC

272 points

100 comments4 min readLW link

My hour of memoryless lucidity

Eric Neyman4 May 2024 1:40 UTC

349 points

34 comments5 min readLW link

(ericneyman.wordpress.com)

Loving a world you don’t trust

Joe Carlsmith18 Jun 2024 19:31 UTC

126 points

13 comments33 min readLW link

Transformers Represent Belief State Geometry in their Residual Stream

Adam Shai16 Apr 2024 21:16 UTC

397 points

100 comments12 min readLW link

Truthseeking is the ground in which other principles grow

Elizabeth27 May 2024 1:09 UTC

207 points

14 comments16 min readLW link

Thoughts on seed oil

dynomight20 Apr 2024 12:29 UTC

341 points

122 comments17 min readLW link

(dynomight.net)

The Best Tacit Knowledge Videos on Every Subject

Parker Conley31 Mar 2024 17:14 UTC

347 points

138 comments16 min readLW link

Failures in Kindness

silentbob26 Mar 2024 21:30 UTC

354 points

48 comments9 min readLW link

AI catastrophes and rogue deployments

Buck3 Jun 2024 17:04 UTC

117 points

16 comments8 min readLW link

EIS XIII: Reflections on Anthropic’s SAE Research Circa May 2024

scasper21 May 2024 20:15 UTC

155 points

16 comments3 min readLW link

The Standard Analogy

Zack_M_Davis3 Jun 2024 17:15 UTC

113 points

25 comments12 min readLW link

On Not Pulling The Ladder Up Behind You

Screwtape26 Apr 2024 21:58 UTC

186 points

19 comments9 min readLW link

Deep Honesty

Aletheophile7 May 2024 20:31 UTC

150 points

25 comments9 min readLW link

On green

Joe Carlsmith21 Mar 2024 17:38 UTC

261 points

35 comments31 min readLW link

My PhD thesis: Algorithmic Bayesian Epistemology

Eric Neyman16 Mar 2024 22:56 UTC

252 points

14 comments7 min readLW link

(arxiv.org)

There is way too much serendipity

Malmesbury19 Jan 2024 19:37 UTC

357 points

56 comments7 min readLW link