[Question] If al­ign­ment prob­lem was un­solv­able, would that avoid doom?

KinranyMay 7, 2023, 10:13 PM
3 points

6 votes

Overall karma indicates overall quality.

3 comments1 min readLW link

An ar­tifi­cially struc­tured ar­gu­ment for ex­pect­ing AGI ruin

Rob BensingerMay 7, 2023, 9:52 PM
91 points

44 votes

Overall karma indicates overall quality.

26 comments19 min readLW link

Where “the Se­quences” Are Wrong

Thoth HermesMay 7, 2023, 8:21 PM
−15 points

14 votes

Overall karma indicates overall quality.

5 comments14 min readLW link
(thothhermes.substack.com)

What’s wrong with be­ing dumb?

Adam ZernerMay 7, 2023, 6:31 PM
14 points

5 votes

Overall karma indicates overall quality.

17 comments2 min readLW link

Cat­e­gories of Ar­gu­ing Style : Why be­ing good among ra­tio­nal­ists isn’t enough to ar­gue with everyone

Camille Berger May 7, 2023, 5:45 PM
16 points

10 votes

Overall karma indicates overall quality.

0 comments23 min readLW link

Self-Ad­ministered Gell-Mann Amnesia

krsMay 7, 2023, 5:44 PM
1 point

1 vote

Overall karma indicates overall quality.

1 comment1 min readLW link

Un­der­stand­ing mesa-op­ti­miza­tion us­ing toy models

May 7, 2023, 5:00 PM
46 points

27 votes

Overall karma indicates overall quality.

6 comments10 min readLW link

How to have Poly­geni­cally Screened Children

GeneSmithMay 7, 2023, 4:01 PM
368 points

160 votes

Overall karma indicates overall quality.

128 comments27 min readLW link1 review

Statis­ti­cal mod­els & the ir­rele­vance of rare exceptions

patrissimoMay 7, 2023, 3:59 PM
36 points

9 votes

Overall karma indicates overall quality.

6 comments2 min readLW link

Let’s look for co­her­ence theorems

ValdesMay 7, 2023, 2:45 PM
25 points

12 votes

Overall karma indicates overall quality.

18 comments6 min readLW link

Graph­i­cal Rep­re­sen­ta­tions of Paul Chris­ti­ano’s Doom Model

Nathan YoungMay 7, 2023, 1:03 PM
9 points

5 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

An an­thro­po­mor­phic AI dilemma

TsviBTMay 7, 2023, 12:44 PM
26 points

13 votes

Overall karma indicates overall quality.

0 comments7 min readLW link

Violin Supports

jefftkMay 7, 2023, 12:10 PM
12 points

5 votes

Overall karma indicates overall quality.

1 comment1 min readLW link
(www.jefftk.com)

Prop­er­ties of Good Textbooks

niplavMay 7, 2023, 8:38 AM
50 points

19 votes

Overall karma indicates overall quality.

11 comments1 min readLW link

Against sac­ri­fic­ing AI trans­parency for gen­er­al­ity gains

Ape in the coatMay 7, 2023, 6:52 AM
4 points

7 votes

Overall karma indicates overall quality.

0 comments2 min readLW link

TED talk by Eliezer Yud­kowsky: Un­leash­ing the Power of Ar­tifi­cial Intelligence

bayesedMay 7, 2023, 5:45 AM
49 points

31 votes

Overall karma indicates overall quality.

36 comments1 min readLW link
(www.youtube.com)

Think­ing of Con­ve­nience as an Eco­nomic Term

ozziegooenMay 7, 2023, 1:21 AM
6 points

3 votes

Overall karma indicates overall quality.

0 comments12 min readLW link
(forum.effectivealtruism.org)

Cor­rigi­bil­ity, Much more de­tail than any­one wants to Read

Logan ZoellnerMay 7, 2023, 1:02 AM
27 points

10 votes

Overall karma indicates overall quality.

3 comments7 min readLW link

Resi­d­ual stream norms grow ex­po­nen­tially over the for­ward pass

May 7, 2023, 12:46 AM
77 points

35 votes

Overall karma indicates overall quality.

24 comments9 min readLW link

On the Loeb­ner Silver Prize (a Tur­ing test)

hold_my_fishMay 7, 2023, 12:39 AM
18 points

9 votes

Overall karma indicates overall quality.

2 comments2 min readLW link

Time and En­ergy Costs to Erase a Bit

DaemonicSigilMay 6, 2023, 11:29 PM
24 points

11 votes

Overall karma indicates overall quality.

32 comments7 min readLW link

How much do you be­lieve your re­sults?

Eric NeymanMay 6, 2023, 8:31 PM
514 points

235 votes

Overall karma indicates overall quality.

18 comments15 min readLW link4 reviews
(ericneyman.wordpress.com)

Long Covid Risks: 2023 Update

ElizabethMay 6, 2023, 6:20 PM
69 points

31 votes

Overall karma indicates overall quality.

11 comments4 min readLW link
(acesounderglass.com)

Is “red” for GPT-4 the same as “red” for you?

Yusuke HayashiMay 6, 2023, 5:55 PM
9 points

8 votes

Overall karma indicates overall quality.

6 comments2 min readLW link

The Broader Fos­sil Fuel Community

Jeffrey HeningerMay 6, 2023, 2:49 PM
16 points

12 votes

Overall karma indicates overall quality.

1 comment3 min readLW link

Es­ti­mat­ing Norovirus Prevalence

jefftkMay 6, 2023, 11:40 AM
16 points

4 votes

Overall karma indicates overall quality.

0 comments2 min readLW link
(www.jefftk.com)

Align­ment as Func­tion Fitting

A.H.May 6, 2023, 11:38 AM
7 points

3 votes

Overall karma indicates overall quality.

0 comments12 min readLW link

My preferred fram­ings for re­ward mis­speci­fi­ca­tion and goal misgeneralisation

Yi-YangMay 6, 2023, 4:48 AM
27 points

10 votes

Overall karma indicates overall quality.

1 comment8 min readLW link

You don’t need to be a ge­nius to be in AI safety research

Claire ShortMay 6, 2023, 2:32 AM
15 points

28 votes

Overall karma indicates overall quality.

1 comment6 min readLW link

Nat­u­ral­ist Collection

LoganStrohlMay 6, 2023, 12:37 AM
71 points

21 votes

Overall karma indicates overall quality.

7 comments15 min readLW link

Do you work at an AI lab? Please quit

Nik SamoylovMay 5, 2023, 11:41 PM
−29 points

13 votes

Overall karma indicates overall quality.

9 comments1 min readLW link

Ex­plain­ing “Hell is Game The­ory Folk The­o­rems”

electroswingMay 5, 2023, 11:33 PM
57 points

31 votes

Overall karma indicates overall quality.

21 comments5 min readLW link

Sleep­ing Beauty – the Death Hypothesis

Guillaume CharrierMay 5, 2023, 11:32 PM
7 points

10 votes

Overall karma indicates overall quality.

8 comments5 min readLW link

Orthog­o­nal’s For­mal-Goal Align­ment the­ory of change

Tamsin LeakeMay 5, 2023, 10:36 PM
69 points

33 votes

Overall karma indicates overall quality.

13 comments4 min readLW link
(carado.moe)

A smart enough LLM might be deadly sim­ply if you run it for long enough

Mikhail SaminMay 5, 2023, 8:49 PM
19 points

17 votes

Overall karma indicates overall quality.

16 comments8 min readLW link

What Ja­son has been read­ing, May 2023: “Pro­topia,” com­plex sys­tems, Daedalus vs. Icarus, and more

jasoncrawfordMay 5, 2023, 7:54 PM
26 points

9 votes

Overall karma indicates overall quality.

2 comments11 min readLW link
(rootsofprogress.org)

CHAT Di­plo­macy: LLMs and Na­tional Security

SebastianG May 5, 2023, 7:45 PM
25 points

10 votes

Overall karma indicates overall quality.

6 comments7 min readLW link

Linkpost for Ac­cursed Farms Dis­cus­sion /​ de­bate with AI ex­pert Eliezer Yudkowsky

gilchMay 5, 2023, 6:20 PM
14 points

9 votes

Overall karma indicates overall quality.

2 comments1 min readLW link
(www.youtube.com)

Reg­u­late or Com­pete? The China Fac­tor in U.S. AI Policy (NAIR #2)

charles_mMay 5, 2023, 5:43 PM
2 points

2 votes

Overall karma indicates overall quality.

1 comment7 min readLW link
(navigatingairisks.substack.com)

Kingfisher Live CD Process

jefftkMay 5, 2023, 5:00 PM
13 points

3 votes

Overall karma indicates overall quality.

0 comments3 min readLW link
(www.jefftk.com)

What can we learn from Bayes about rea­son­ing?

jasoncrawfordMay 5, 2023, 3:52 PM
22 points

7 votes

Overall karma indicates overall quality.

11 comments1 min readLW link

[Question] Why not use ac­tive SETI to pre­vent AI Doom?

RomanSMay 5, 2023, 2:41 PM
13 points

20 votes

Overall karma indicates overall quality.

13 comments1 min readLW link

In­ves­ti­gat­ing Emer­gent Goal-Like Be­hav­ior in Large Lan­guage Models us­ing Ex­per­i­men­tal Economics

phelps-sgMay 5, 2023, 11:15 AM
6 points

4 votes

Overall karma indicates overall quality.

1 comment4 min readLW link

Monthly Shorts 4/​23

CelerMay 5, 2023, 7:20 AM
8 points

4 votes

Overall karma indicates overall quality.

1 comment3 min readLW link
(keller.substack.com)

[Question] What is it like to be a com­pat­i­bil­ist?

tslarmMay 5, 2023, 2:56 AM
8 points

5 votes

Overall karma indicates overall quality.

72 comments1 min readLW link

Tran­script of a pre­sen­ta­tion on catas­trophic risks from AI

RobertMMay 5, 2023, 1:38 AM
6 points

1 vote

Overall karma indicates overall quality.

0 comments8 min readLW link

How to get good at programming

Ulisse MiniMay 5, 2023, 1:14 AM
40 points

25 votes

Overall karma indicates overall quality.

3 comments2 min readLW link

A brief col­lec­tion of Hin­ton’s re­cent com­ments on AGI risk

Kaj_SotalaMay 4, 2023, 11:31 PM
148 points

58 votes

Overall karma indicates overall quality.

9 comments11 min readLW link

Robin Han­son and I talk about AI risk

KatjaGraceMay 4, 2023, 10:20 PM
39 points

11 votes

Overall karma indicates overall quality.

8 comments1 min readLW link
(worldspiritsockpuppet.com)

Who reg­u­lates the reg­u­la­tors? We need to go be­yond the re­view-and-ap­proval paradigm

jasoncrawfordMay 4, 2023, 10:11 PM
122 points

41 votes

Overall karma indicates overall quality.

29 comments13 min readLW link
(rootsofprogress.org)