A com­par­i­son of causal scrub­bing, causal ab­strac­tions, and re­lated methods

Jun 8, 2023, 11:40 PM
73 points
3 comments22 min readLW link

Up­dates and Reflec­tions on Op­ti­mal Ex­er­cise af­ter Nearly a Decade

romeostevensitJun 8, 2023, 11:02 PM
213 points
57 comments2 min readLW link1 review

Take­aways from the Mechanis­tic In­ter­pretabil­ity Challenges

scasperJun 8, 2023, 6:56 PM
94 points
5 comments6 min readLW link

Leave an Emo­tional Line of Retreat

Johannes C. MayerJun 8, 2023, 6:36 PM
23 points
1 comment1 min readLW link

Cur­rent AI harms are also sci-fi

Christopher KingJun 8, 2023, 5:49 PM
26 points
3 comments1 min readLW link

Two Ways To Re­duce Un­hap­piness That Comes From Dis­torted Views of Reality

Anne HsuJun 8, 2023, 5:43 PM
3 points
0 comments7 min readLW link

Col­lab­o­ra­tion in Science: Hap­pier Peo­ple ↔ Bet­ter Research

nadinespyJun 8, 2023, 5:42 PM
3 points
0 comments32 min readLW link

Biomimetic al­ign­ment: Align­ment be­tween an­i­mal genes and an­i­mal brains as a model for al­ign­ment be­tween hu­mans and AI sys­tems

geoffreymillerJun 8, 2023, 4:05 PM
10 points
1 comment16 min readLW link

A po­ten­tially high im­pact differ­en­tial tech­nolog­i­cal de­vel­op­ment area

Noosphere89Jun 8, 2023, 2:33 PM
5 points
2 comments2 min readLW link

[Question] Ques­tion for Pre­dic­tion Mar­ket peo­ple: where is the money sup­posed to come from?

Robert_AIZIJun 8, 2023, 1:58 PM
25 points
26 comments1 min readLW link

AI #15: The Prin­ci­ple of Charity

ZviJun 8, 2023, 12:10 PM
73 points
16 comments44 min readLW link
(thezvi.wordpress.com)

if you’re read­ing this it’s too late (a new the­ory on what is caus­ing the Great Stag­na­tion)

rogersbaconJun 8, 2023, 11:49 AM
−10 points
2 comments13 min readLW link
(www.secretorum.life)

[Linkpost] Scal­ing laws for lan­guage en­cod­ing mod­els in fMRI

Bogdan Ionut CirsteaJun 8, 2023, 10:52 AM
30 points
0 comments1 min readLW link

Trans­for­ma­tive AI is a pro­cess

meijer1973Jun 8, 2023, 8:57 AM
2 points
0 comments5 min readLW link

Cri­sis of Faith case study: be­yond re­duc­tion­ism?

MalcolmOceanJun 8, 2023, 6:11 AM
6 points
9 comments19 min readLW link

I wrote this be­cause of watermelon

ArtiJun 8, 2023, 3:55 AM
4 points
2 comments1 min readLW link

Learn­ing Trans­former Pro­grams [Linkpost]

aogJun 8, 2023, 12:16 AM
7 points
0 comments1 min readLW link
(arxiv.org)

What will GPT-2030 look like?

jsteinhardtJun 7, 2023, 11:40 PM
185 points
43 comments23 min readLW link
(bounded-regret.ghost.io)

Progress links and tweets, 2023-06-07

jasoncrawfordJun 7, 2023, 11:26 PM
11 points
0 comments1 min readLW link
(rootsofprogress.org)

LEAst-squares Con­cept Era­sure (LEACE)

tricky_labyrinthJun 7, 2023, 9:51 PM
68 points
10 comments1 min readLW link
(twitter.com)

Pro­posal: Tune LLMs to Use Cal­ibrated Language

OneManyNoneJun 7, 2023, 9:05 PM
9 points
0 comments5 min readLW link

A moral back­lash against AI will prob­a­bly slow down AGI development

geoffreymillerJun 7, 2023, 8:39 PM
51 points
10 comments14 min readLW link

An Ex­er­cise to Build In­tu­itions on AGI Risk

Lauro LangoscoJun 7, 2023, 6:35 PM
52 points
3 comments8 min readLW link

Elon talked with se­nior Chi­nese lead­er­ship about AI X-risk

ChristianKlJun 7, 2023, 3:02 PM
47 points
2 comments1 min readLW link
(www.youtube.com)

Ar­ti­cle Sum­mary: Cur­rent and Near-Term AI as a Po­ten­tial Ex­is­ten­tial Risk Factor

André FerrettiJun 7, 2023, 1:51 PM
28 points
3 comments1 min readLW link
(dl.acm.org)

gamers be­ware: mod­ded Minecraft has new malware

the gears to ascensionJun 7, 2023, 1:49 PM
14 points
5 comments1 min readLW link
(github.com)

Launch­ing Light­speed Grants (Ap­ply by July 6th)

habrykaJun 7, 2023, 2:53 AM
211 points
42 comments5 min readLW link

Cul­ti­vate an ob­ses­sion with the ob­ject level

Richard_NgoJun 7, 2023, 1:39 AM
77 points
4 comments3 min readLW link

How to Slow AI Development

PeterMcCluskeyJun 7, 2023, 12:29 AM
20 points
0 comments5 min readLW link
(bayesianinvestor.com)

[Question] Killing Re­cur­rent Me­mory Over Self At­ten­tion?

Del NoboloJun 6, 2023, 11:02 PM
3 points
0 comments1 min readLW link

[Job Ad] SERI MATS is (still) hiring for our sum­mer program

Jun 6, 2023, 9:07 PM
12 points
0 comments7 min readLW link

Why I am not a longter­mist (May 2022)

boazbarakJun 6, 2023, 8:36 PM
38 points
19 comments9 min readLW link
(windowsontheory.org)

So­ciety Library seek­ing con­tri­bu­tions for canon­i­cal AI Safety de­bate map

Jarred FilmerJun 6, 2023, 6:15 PM
36 points
0 comments1 min readLW link
(www.societylibrary.org)

A Play­book for AI Risk Re­duc­tion (fo­cused on mis­al­igned AI)

HoldenKarnofskyJun 6, 2023, 6:05 PM
90 points
42 comments14 min readLW link1 review

A “bot­tom-up” ap­proach to AI as a more trans­par­ent al­ter­na­tive to “top-down” LLMs

Paul JorionJun 6, 2023, 6:00 PM
1 point
0 comments1 min readLW link

Why Yud­kowsky Is Wrong And What He Does Can Be More Dangerous

idontagreewiththatJun 6, 2023, 5:59 PM
−38 points
4 comments3 min readLW link

The Base Rate Times, news through pre­dic­tion markets

vandemonianJun 6, 2023, 5:42 PM
268 points
41 comments4 min readLW link1 review

Monthly Roundup #7: June 2023

ZviJun 6, 2023, 5:40 PM
23 points
13 comments43 min readLW link
(thezvi.wordpress.com)

Trans­for­ma­tive AGI by 2043 is <1% likely

Ted SandersJun 6, 2023, 5:36 PM
33 points
117 comments5 min readLW link
(arxiv.org)

AISN #9: State­ment on Ex­tinc­tion Risks, Com­pet­i­tive Pres­sures, and When Will AI Reach Hu­man-Level?

Dan HJun 6, 2023, 4:10 PM
12 points
0 comments7 min readLW link
(newsletter.safe.ai)

An Eter­nal Company

moyamoJun 6, 2023, 3:56 PM
7 points
8 comments4 min readLW link

AISC end of pro­gram presentations

Jun 6, 2023, 3:45 PM
18 points
0 comments1 min readLW link

Why the Solu­tions to AI Align­ment are Likely Out­side the Over­ton Window

williamsaeJun 6, 2023, 2:21 PM
−6 points
0 comments3 min readLW link

Stampy’s AI Safety Info—New Distil­la­tions #3 [May 2023]

markovJun 6, 2023, 2:18 PM
16 points
0 comments2 min readLW link
(aisafety.info)

Agen­tic Mess (A Failure Story)

Jun 6, 2023, 1:09 PM
46 points
5 comments13 min readLW link

Ber­lin AI Align­ment Open Meetup June 2023

GuyPJun 6, 2023, 10:04 AM
5 points
0 comments1 min readLW link

The Sharp Right Turn: sud­den de­cep­tive al­ign­ment as a con­ver­gent goal

avturchinJun 6, 2023, 9:59 AM
38 points
5 comments1 min readLW link

Open Thread: June 2023 (In­line Re­acts!)

RaemonJun 6, 2023, 7:40 AM
19 points
57 comments1 min readLW link

[Linkpost] Given Ex­tinc­tion Wor­ries, Why Don’t AI Re­searchers Quit? Well, Sev­eral Reasons

Daniel_EthJun 6, 2023, 7:31 AM
10 points
0 commentsLW link

Is the 10% Giv­ing What We Can Pledge Core to EA’s Rep­u­ta­tion?

DirectedEvolutionJun 6, 2023, 6:21 AM
10 points
1 commentLW link