RSS

Why Would Belief-States Have A Frac­tal Struc­ture, And Why Would That Mat­ter For In­ter­pretabil­ity? An Explainer

18 Apr 2024 0:27 UTC
76 points
0 comments7 min readLW link

The Mom Test: Sum­mary and Thoughts

Adam Zerner18 Apr 2024 3:34 UTC
18 points
0 comments10 min readLW link

Trans­form­ers Rep­re­sent Belief State Geom­e­try in their Resi­d­ual Stream

Adam Shai16 Apr 2024 21:16 UTC
184 points
31 comments12 min readLW link

Mov­ing on from com­mu­nity living

Vika17 Apr 2024 17:02 UTC
41 points
6 comments3 min readLW link
(vkrakovna.wordpress.com)

When is a mind me?

Rob Bensinger17 Apr 2024 5:56 UTC
72 points
24 comments15 min readLW link

LLM Eval­u­a­tors Rec­og­nize and Fa­vor Their Own Generations

17 Apr 2024 21:09 UTC
15 points
0 comments3 min readLW link
(tiny.cc)

Mid-con­di­tional love

KatjaGrace17 Apr 2024 4:00 UTC
65 points
12 comments2 min readLW link
(worldspiritsockpuppet.com)

AXRP Epi­sode 28 - Su­ing Labs for AI Risk with Gabriel Weil

DanielFilan17 Apr 2024 21:42 UTC
10 points
0 comments65 min readLW link

Child­hood and Ed­u­ca­tion Roundup #5

Zvi17 Apr 2024 13:00 UTC
28 points
2 comments25 min readLW link
(thezvi.wordpress.com)

Re­con­sider the anti-cav­ity bac­te­ria if you are Asian

Lao Mein15 Apr 2024 7:02 UTC
153 points
30 comments4 min readLW link

Es­say com­pe­ti­tion on the Au­toma­tion of Wis­dom and Philos­o­phy — $25k in prizes

16 Apr 2024 10:10 UTC
76 points
4 comments8 min readLW link
(blog.aiimpacts.org)

Claude 3 Opus can op­er­ate as a Tur­ing machine

Gunnar_Zarncke17 Apr 2024 8:41 UTC
24 points
2 comments1 min readLW link
(twitter.com)

Staged release

Zach Stein-Perlman17 Apr 2024 16:00 UTC
7 points
2 comments2 min readLW link

Creat­ing un­re­stricted AI Agents with Com­mand R+

Simon Lermen16 Apr 2024 14:52 UTC
43 points
7 comments5 min readLW link

My ex­pe­rience us­ing fi­nan­cial com­mit­ments to over­come akrasia

William Howard15 Apr 2024 22:57 UTC
60 points
16 comments18 min readLW link

An eth­i­cal frame­work to su­per­sede Utilitarianism

metalcrow17 Apr 2024 17:18 UTC
3 points
4 comments4 min readLW link

[Question] Dis­com­fort Stacking

Lewis O’Brien17 Apr 2024 14:49 UTC
5 points
4 comments1 min readLW link

Tinker

Richard_Ngo16 Apr 2024 18:26 UTC
25 points
0 comments1 min readLW link
(press.asimov.com)

LessWrong’s (first) album: I Have Been A Good Bing

1 Apr 2024 7:33 UTC
507 points
152 comments11 min readLW link

Should we max­i­mize the Geo­met­ric Ex­pec­ta­tion of Utility?

A.H.17 Apr 2024 10:37 UTC
6 points
11 comments9 min readLW link