Deep Honesty

Aletheophile7 May 2024 20:31 UTC
165 points
26 comments9 min readLW link

Let’s De­sign A School, Part 2.2 School as Ed­u­ca­tion—The Cur­ricu­lum (Gen­eral)

Sable7 May 2024 19:22 UTC
25 points
3 comments12 min readLW link
(affablyevil.substack.com)

De­sign­ing for a sin­gle purpose

Itay Dreyfus7 May 2024 14:11 UTC
48 points
12 comments10 min readLW link
(productidentity.co)

Re­view­ing the Struc­ture of Cur­rent AI Regulations

7 May 2024 12:34 UTC
29 points
0 comments13 min readLW link

re­flec­tions on smileys and how to make so­ciety’s in­ter­pre­tive pri­ors more charitable

Emrik7 May 2024 11:20 UTC
17 points
0 comments1 min readLW link

Vir­tual Book Club on Nick Bostrom’s “Deep Utopia: Life and Mean­ing in a Solved World”

elte7 May 2024 9:57 UTC
5 points
0 comments1 min readLW link

Vir­tual Book Club on Nick Bostrom’s “Deep Utopia: Life and Mean­ing in a Solved World”

elte7 May 2024 9:55 UTC
1 point
0 comments1 min readLW link

[Question] What is a com­mu­nity that has changed their be­havi­our with­out strife?

Nathan Young7 May 2024 9:24 UTC
12 points
6 comments1 min readLW link

Men­tal Mas­tur­ba­tion and the In­tel­lec­tual Com­fort Zone

Declan Molony7 May 2024 5:47 UTC
40 points
2 comments2 min readLW link

AXRP Epi­sode 31 - Sin­gu­lar Learn­ing The­ory with Daniel Murfet

DanielFilan7 May 2024 3:50 UTC
72 points
4 comments71 min readLW link

How do open AI mod­els af­fect in­cen­tive to race?

jessicata7 May 2024 0:33 UTC
60 points
13 comments3 min readLW link
(unstablerontology.substack.com)

Rapid ca­pa­bil­ity gain around su­per­ge­nius level seems prob­a­ble even with­out in­tel­li­gence need­ing to im­prove intelligence

6 May 2024 17:09 UTC
48 points
17 comments4 min readLW link

Ob­ser­va­tions on Teach­ing for Four Weeks

ClareChiaraVincent6 May 2024 16:55 UTC
51 points
14 comments3 min readLW link

[Question] Orthog­o­nal­ity Th­e­sis bur­den of proof

Donatas Lučiūnas6 May 2024 16:21 UTC
−18 points
4 comments1 min readLW link

GDP per cap­ita in 2050

Hauke Hillebrandt6 May 2024 15:14 UTC
29 points
8 comments16 min readLW link
(hauke.substack.com)

an effec­tive ai safety initiative

Logan Zoellner6 May 2024 7:53 UTC
3 points
9 comments3 min readLW link

Un­cov­er­ing De­cep­tive Ten­den­cies in Lan­guage Models: A Si­mu­lated Com­pany AI Assistant

6 May 2024 7:07 UTC
95 points
13 comments1 min readLW link
(arxiv.org)

Biorisk is an Un­helpful Anal­ogy for AI Risk

Davidmanheim6 May 2024 6:20 UTC
4 points
17 comments3 min readLW link

Some Prob­lems with Or­di­nal Op­ti­miza­tion Frame

Mateusz Bagiński6 May 2024 5:28 UTC
9 points
0 comments7 min readLW link

Ac­ci­den­tal Elec­tronic Instrument

jefftk6 May 2024 2:10 UTC
15 points
6 comments2 min readLW link
(www.jefftk.com)

Ex­plain­ing a Math Magic Trick

Robert_AIZI5 May 2024 19:41 UTC
99 points
10 comments5 min readLW link

[Question] Does re­duc­ing the amount of RL for a given ca­pa­bil­ity level make AI safer?

Chris_Leong5 May 2024 17:04 UTC
43 points
22 comments1 min readLW link

Hay­mar­ket at Clos­ing Time

jefftk5 May 2024 2:40 UTC
15 points
2 comments2 min readLW link
(www.jefftk.com)

in­tro­duc­tion to can­cer vaccines

bhauth5 May 2024 1:06 UTC
113 points
19 comments5 min readLW link
(www.bhauth.com)

Some Ex­per­i­ments I’d Like Some­one To Try With An Amnestic

johnswentworth4 May 2024 22:04 UTC
47 points
33 comments3 min readLW link

In­tro­duc­ing AI-Pow­ered Au­dio­books of Ra­tional Fic­tion Classics

Askwho4 May 2024 17:32 UTC
67 points
14 comments1 min readLW link

S-Risks: Fates Worse Than Ex­tinc­tion

4 May 2024 15:30 UTC
53 points
2 comments6 min readLW link
(youtu.be)

Shan­non Val­lor’s “tech­nomoral virtues”

David Gross4 May 2024 14:48 UTC
15 points
1 comment5 min readLW link

Con­served Quan­tities (Stat Mech Part 2)

J Bostock4 May 2024 13:40 UTC
13 points
0 comments5 min readLW link

If you are as­sum­ing Soft­ware works well you are dead

Johannes C. Mayer4 May 2024 12:54 UTC
0 points
12 comments1 min readLW link

CCS on com­pound sentences

artkpv4 May 2024 12:23 UTC
6 points
0 comments9 min readLW link

Now THIS is fore­cast­ing: un­der­stand­ing Epoch’s Direct Approach

4 May 2024 12:06 UTC
63 points
4 comments19 min readLW link

OHGOOD: A co­or­di­na­tion body for com­pute governance

Adam Jones4 May 2024 12:03 UTC
5 points
2 comments16 min readLW link
(adamjones.me)

My hour of mem­o­ryless lucidity

Eric Neyman4 May 2024 1:40 UTC
375 points
36 comments5 min readLW link
(ericneyman.wordpress.com)

Ex­tra Tall Crib

jefftk4 May 2024 0:00 UTC
5 points
9 comments1 min readLW link
(www.jefftk.com)

Get your tick­ets to Man­i­fest 2024 by May 13th!

Saul Munn3 May 2024 23:57 UTC
18 points
0 comments1 min readLW link

Embodiment

A*3 May 2024 20:06 UTC
4 points
0 comments1 min readLW link

(Geo­met­ri­cally) Max­i­mal Lot­tery-Lot­ter­ies Exist

Lorxus3 May 2024 19:29 UTC
13 points
11 comments26 min readLW link

[Question] Were there any an­cient ra­tio­nal­ists?

OliverHayman3 May 2024 18:26 UTC
12 points
3 comments1 min readLW link

Key take­aways from our EA and al­ign­ment re­search sur­veys

3 May 2024 18:10 UTC
112 points
10 comments21 min readLW link

“AI Safety for Fleshy Hu­mans” an AI Safety ex­plainer by Nicky Case

habryka3 May 2024 18:10 UTC
90 points
11 comments4 min readLW link
(aisafety.dance)

AI Clar­ity: An Ini­tial Re­search Agenda

3 May 2024 13:54 UTC
18 points
1 comment8 min readLW link

Ap­ply to ESPR & PAIR, Ra­tion­al­ity and AI Camps for Ages 16-21

Anna Gajdova3 May 2024 12:36 UTC
58 points
5 comments1 min readLW link

On pre­cise out-of-con­text steering

Olli Järviniemi3 May 2024 9:41 UTC
9 points
6 comments3 min readLW link

LLM+Plan­ners hy­bridi­s­a­tion for friendly AGI

installgentoo3 May 2024 8:40 UTC
7 points
2 comments1 min readLW link

Mechanis­tic In­ter­pretabil­ity Work­shop Hap­pen­ing at ICML 2024!

3 May 2024 1:18 UTC
48 points
6 comments1 min readLW link

Weekly newslet­ter for AI safety events and train­ing programs

Bryce Robertson3 May 2024 0:33 UTC
29 points
0 comments1 min readLW link

CCS: Coun­ter­fac­tual Civ­i­liza­tion Simulation

Morphism2 May 2024 22:54 UTC
3 points
0 comments2 min readLW link

Let’s De­sign A School, Part 2.1 School as Ed­u­ca­tion—Structure

Sable2 May 2024 22:04 UTC
26 points
2 comments10 min readLW link
(affablyevil.substack.com)

Why I’m not do­ing PauseAI

kwiat.dev2 May 2024 22:00 UTC
−8 points
5 comments4 min readLW link