Deep Honesty

AletheophileMay 7, 2024, 8:31 PM
159 points
25 comments9 min readLW link

Let’s De­sign A School, Part 2.2 School as Ed­u­ca­tion—The Cur­ricu­lum (Gen­eral)

SableMay 7, 2024, 7:22 PM
25 points
3 comments12 min readLW link
(affablyevil.substack.com)

De­sign­ing for a sin­gle purpose

Itay DreyfusMay 7, 2024, 2:11 PM
48 points
12 comments10 min readLW link
(productidentity.co)

Re­view­ing the Struc­ture of Cur­rent AI Regulations

May 7, 2024, 12:34 PM
29 points
0 comments13 min readLW link

re­flec­tions on smileys and how to make so­ciety’s in­ter­pre­tive pri­ors more charitable

EmrikMay 7, 2024, 11:20 AM
17 points
0 comments1 min readLW link

Vir­tual Book Club on Nick Bostrom’s “Deep Utopia: Life and Mean­ing in a Solved World”

beatrice@foresight.orgMay 7, 2024, 9:57 AM
5 points
0 comments1 min readLW link

Vir­tual Book Club on Nick Bostrom’s “Deep Utopia: Life and Mean­ing in a Solved World”

beatrice@foresight.orgMay 7, 2024, 9:55 AM
1 point
0 comments1 min readLW link

[Question] What is a com­mu­nity that has changed their be­havi­our with­out strife?

Nathan YoungMay 7, 2024, 9:24 AM
12 points
6 commentsLW link

Men­tal Mas­tur­ba­tion and the In­tel­lec­tual Com­fort Zone

Declan MolonyMay 7, 2024, 5:47 AM
39 points
2 comments2 min readLW link

AXRP Epi­sode 31 - Sin­gu­lar Learn­ing The­ory with Daniel Murfet

DanielFilanMay 7, 2024, 3:50 AM
72 points
4 comments71 min readLW link

How do open AI mod­els af­fect in­cen­tive to race?

jessicataMay 7, 2024, 12:33 AM
60 points
13 comments3 min readLW link
(unstablerontology.substack.com)

Rapid ca­pa­bil­ity gain around su­per­ge­nius level seems prob­a­ble even with­out in­tel­li­gence need­ing to im­prove intelligence

May 6, 2024, 5:09 PM
48 points
17 comments4 min readLW link

Ob­ser­va­tions on Teach­ing for Four Weeks

ClareChiaraVincentMay 6, 2024, 4:55 PM
51 points
14 comments3 min readLW link

[Question] Orthog­o­nal­ity Th­e­sis bur­den of proof

Donatas LučiūnasMay 6, 2024, 4:21 PM
−18 points
4 comments1 min readLW link

GDP per cap­ita in 2050

Hauke HillebrandtMay 6, 2024, 3:14 PM
29 points
8 commentsLW link
(hauke.substack.com)

an effec­tive ai safety initiative

Logan ZoellnerMay 6, 2024, 7:53 AM
1 point
9 comments3 min readLW link

Un­cov­er­ing De­cep­tive Ten­den­cies in Lan­guage Models: A Si­mu­lated Com­pany AI Assistant

May 6, 2024, 7:07 AM
95 points
13 comments1 min readLW link
(arxiv.org)

Biorisk is an Un­helpful Anal­ogy for AI Risk

DavidmanheimMay 6, 2024, 6:20 AM
4 points
17 commentsLW link

Some Prob­lems with Or­di­nal Op­ti­miza­tion Frame

Mateusz BagińskiMay 6, 2024, 5:28 AM
9 points
0 comments7 min readLW link

Ac­ci­den­tal Elec­tronic Instrument

jefftkMay 6, 2024, 2:10 AM
15 points
6 comments2 min readLW link
(www.jefftk.com)

Ex­plain­ing a Math Magic Trick

Robert_AIZIMay 5, 2024, 7:41 PM
99 points
10 comments5 min readLW link

[Question] Does re­duc­ing the amount of RL for a given ca­pa­bil­ity level make AI safer?

Chris_LeongMay 5, 2024, 5:04 PM
43 points
22 comments1 min readLW link

Hay­mar­ket at Clos­ing Time

jefftkMay 5, 2024, 2:40 AM
15 points
2 comments2 min readLW link
(www.jefftk.com)

in­tro­duc­tion to can­cer vaccines

bhauthMay 5, 2024, 1:06 AM
113 points
19 comments5 min readLW link
(www.bhauth.com)

Some Ex­per­i­ments I’d Like Some­one To Try With An Amnestic

johnswentworthMay 4, 2024, 10:04 PM
47 points
33 comments3 min readLW link

In­tro­duc­ing AI-Pow­ered Au­dio­books of Ra­tional Fic­tion Classics

AskwhoMay 4, 2024, 5:32 PM
67 points
14 comments1 min readLW link

S-Risks: Fates Worse Than Ex­tinc­tion

May 4, 2024, 3:30 PM
53 points
2 comments6 min readLW link
(youtu.be)

Shan­non Val­lor’s “tech­nomoral virtues”

David GrossMay 4, 2024, 2:48 PM
15 points
1 comment5 min readLW link

Con­served Quan­tities (Stat Mech Part 2)

J BostockMay 4, 2024, 1:40 PM
13 points
0 comments5 min readLW link

If you are as­sum­ing Soft­ware works well you are dead

Johannes C. MayerMay 4, 2024, 12:54 PM
0 points
12 comments1 min readLW link

CCS on com­pound sentences

Artyom KarpovMay 4, 2024, 12:23 PM
6 points
0 comments9 min readLW link

Now THIS is fore­cast­ing: un­der­stand­ing Epoch’s Direct Approach

May 4, 2024, 12:06 PM
63 points
4 comments19 min readLW link

OHGOOD: A co­or­di­na­tion body for com­pute governance

Adam JonesMay 4, 2024, 12:03 PM
5 points
2 comments16 min readLW link
(adamjones.me)

My hour of mem­o­ryless lucidity

Eric NeymanMay 4, 2024, 1:40 AM
372 points
35 comments5 min readLW link
(ericneyman.wordpress.com)

Ex­tra Tall Crib

jefftkMay 4, 2024, 12:00 AM
5 points
9 comments1 min readLW link
(www.jefftk.com)

Get your tick­ets to Man­i­fest 2024 by May 13th!

Saul MunnMay 3, 2024, 11:57 PM
18 points
0 commentsLW link

Embodiment

A*May 3, 2024, 8:06 PM
4 points
0 comments1 min readLW link

(Geo­met­ri­cally) Max­i­mal Lot­tery-Lot­ter­ies Exist

LorxusMay 3, 2024, 7:29 PM
13 points
11 comments26 min readLW link

[Question] Were there any an­cient ra­tio­nal­ists?

OliverHaymanMay 3, 2024, 6:26 PM
11 points
3 comments1 min readLW link

Key take­aways from our EA and al­ign­ment re­search sur­veys

May 3, 2024, 6:10 PM
112 points
10 comments21 min readLW link

“AI Safety for Fleshy Hu­mans” an AI Safety ex­plainer by Nicky Case

habrykaMay 3, 2024, 6:10 PM
90 points
11 comments4 min readLW link
(aisafety.dance)

AI Clar­ity: An Ini­tial Re­search Agenda

May 3, 2024, 1:54 PM
18 points
1 comment8 min readLW link

Ap­ply to ESPR & PAIR, Ra­tion­al­ity and AI Camps for Ages 16-21

Anna GajdovaMay 3, 2024, 12:36 PM
58 points
5 comments1 min readLW link

On pre­cise out-of-con­text steering

Olli JärviniemiMay 3, 2024, 9:41 AM
9 points
6 comments3 min readLW link

LLM+Plan­ners hy­bridi­s­a­tion for friendly AGI

installgentooMay 3, 2024, 8:40 AM
7 points
2 comments1 min readLW link

Mechanis­tic In­ter­pretabil­ity Work­shop Hap­pen­ing at ICML 2024!

May 3, 2024, 1:18 AM
48 points
6 comments1 min readLW link

Weekly newslet­ter for AI safety events and train­ing programs

Bryce RobertsonMay 3, 2024, 12:33 AM
29 points
0 comments1 min readLW link

CCS: Coun­ter­fac­tual Civ­i­liza­tion Simulation

MorphismMay 2, 2024, 10:54 PM
3 points
0 comments2 min readLW link

Let’s De­sign A School, Part 2.1 School as Ed­u­ca­tion—Structure

SableMay 2, 2024, 10:04 PM
26 points
2 comments10 min readLW link
(affablyevil.substack.com)

Why I’m not do­ing PauseAI

kwiat.devMay 2, 2024, 10:00 PM
−8 points
5 comments4 min readLW link