What will GPT-2030 look like?

jsteinhardt7 Jun 2023 23:40 UTC
182 points
42 comments23 min readLW link
(bounded-regret.ghost.io)

Progress links and tweets, 2023-06-07

jasoncrawford7 Jun 2023 23:26 UTC
11 points
0 comments1 min readLW link
(rootsofprogress.org)

LEAst-squares Con­cept Era­sure (LEACE)

tricky_labyrinth7 Jun 2023 21:51 UTC
68 points
10 comments1 min readLW link
(twitter.com)

Pro­posal: Tune LLMs to Use Cal­ibrated Language

OneManyNone7 Jun 2023 21:05 UTC
9 points
0 comments5 min readLW link

A moral back­lash against AI will prob­a­bly slow down AGI development

geoffreymiller7 Jun 2023 20:39 UTC
49 points
10 comments14 min readLW link

RAMP—RoboNet Ar­tifi­cial Me­dia Protocol

antoniomax7 Jun 2023 19:01 UTC
−1 points
0 comments19 min readLW link
(antoniomax.substack.com)

An Ex­er­cise to Build In­tu­itions on AGI Risk

Lauro Langosco7 Jun 2023 18:35 UTC
52 points
3 comments8 min readLW link

Elon talked with se­nior Chi­nese lead­er­ship about AI X-risk

ChristianKl7 Jun 2023 15:02 UTC
47 points
2 comments1 min readLW link
(www.youtube.com)

Ar­ti­cle Sum­mary: Cur­rent and Near-Term AI as a Po­ten­tial Ex­is­ten­tial Risk Factor

André Ferretti7 Jun 2023 13:51 UTC
28 points
3 comments1 min readLW link
(dl.acm.org)

gamers be­ware: mod­ded Minecraft has new malware

the gears to ascension7 Jun 2023 13:49 UTC
14 points
5 comments1 min readLW link
(github.com)

Launch­ing Light­speed Grants (Ap­ply by July 6th)

habryka7 Jun 2023 2:53 UTC
211 points
41 comments5 min readLW link

Cul­ti­vate an ob­ses­sion with the ob­ject level

Richard_Ngo7 Jun 2023 1:39 UTC
70 points
4 comments3 min readLW link

How to Slow AI Development

PeterMcCluskey7 Jun 2023 0:29 UTC
20 points
0 comments5 min readLW link
(bayesianinvestor.com)

[Question] Killing Re­cur­rent Me­mory Over Self At­ten­tion?

Del Nobolo6 Jun 2023 23:02 UTC
3 points
0 comments1 min readLW link

[Job Ad] SERI MATS is (still) hiring for our sum­mer program

6 Jun 2023 21:07 UTC
12 points
0 comments7 min readLW link

Why I am not a longter­mist (May 2022)

boazbarak6 Jun 2023 20:36 UTC
39 points
18 comments9 min readLW link
(windowsontheory.org)

So­ciety Library seek­ing con­tri­bu­tions for canon­i­cal AI Safety de­bate map

Jarred Filmer6 Jun 2023 18:15 UTC
36 points
0 comments1 min readLW link
(www.societylibrary.org)

A Play­book for AI Risk Re­duc­tion (fo­cused on mis­al­igned AI)

HoldenKarnofsky6 Jun 2023 18:05 UTC
90 points
41 comments14 min readLW link

A “bot­tom-up” ap­proach to AI as a more trans­par­ent al­ter­na­tive to “top-down” LLMs

Paul Jorion6 Jun 2023 18:00 UTC
1 point
0 comments1 min readLW link

Why Yud­kowsky Is Wrong And What He Does Can Be More Dangerous

idontagreewiththat6 Jun 2023 17:59 UTC
−40 points
3 comments3 min readLW link

The Base Rate Times, news through pre­dic­tion markets

vandemonian6 Jun 2023 17:42 UTC
267 points
39 comments4 min readLW link

Monthly Roundup #7: June 2023

Zvi6 Jun 2023 17:40 UTC
23 points
13 comments43 min readLW link
(thezvi.wordpress.com)

Trans­for­ma­tive AGI by 2043 is <1% likely

Ted Sanders6 Jun 2023 17:36 UTC
34 points
115 comments5 min readLW link
(arxiv.org)

AISN #9: State­ment on Ex­tinc­tion Risks, Com­pet­i­tive Pres­sures, and When Will AI Reach Hu­man-Level?

6 Jun 2023 16:10 UTC
12 points
0 comments7 min readLW link
(newsletter.safe.ai)

An Eter­nal Company

moyamo6 Jun 2023 15:56 UTC
7 points
8 comments4 min readLW link

AISC end of pro­gram presentations

6 Jun 2023 15:45 UTC
18 points
0 comments1 min readLW link

Why the Solu­tions to AI Align­ment are Likely Out­side the Over­ton Window

williamsae6 Jun 2023 14:21 UTC
−6 points
0 comments3 min readLW link

Stampy’s AI Safety Info—New Distil­la­tions #3 [May 2023]

markov6 Jun 2023 14:18 UTC
16 points
0 comments2 min readLW link
(aisafety.info)

Agen­tic Mess (A Failure Story)

6 Jun 2023 13:09 UTC
44 points
5 comments13 min readLW link

Ber­lin AI Align­ment Open Meetup June 2023

GuyP6 Jun 2023 10:04 UTC
5 points
0 comments1 min readLW link

The Sharp Right Turn: sud­den de­cep­tive al­ign­ment as a con­ver­gent goal

avturchin6 Jun 2023 9:59 UTC
38 points
5 comments1 min readLW link

Open Thread: June 2023 (In­line Re­acts!)

Raemon6 Jun 2023 7:40 UTC
19 points
57 comments1 min readLW link

[Linkpost] Given Ex­tinc­tion Wor­ries, Why Don’t AI Re­searchers Quit? Well, Sev­eral Reasons

Daniel_Eth6 Jun 2023 7:31 UTC
10 points
0 comments1 min readLW link

Is the 10% Giv­ing What We Can Pledge Core to EA’s Rep­u­ta­tion?

DirectedEvolution6 Jun 2023 6:21 UTC
9 points
1 comment1 min readLW link

Rishi to out­line his vi­sion for Bri­tain to take the world lead in polic­ing AI threats when he meets Joe Biden

Mati_Roy6 Jun 2023 4:47 UTC
25 points
1 comment1 min readLW link
(www.dailymail.co.uk)

In­tel­li­gence Offi­cials Say U.S. Has Retrieved Craft of Non-Hu­man Origin

lc6 Jun 2023 3:54 UTC
33 points
149 comments1 min readLW link
(thedebrief.org)

Al­gorith­mic Im­prove­ment Is Prob­a­bly Faster Than Scal­ing Now

johnswentworth6 Jun 2023 2:57 UTC
137 points
23 comments2 min readLW link

Con­tra Mask Status

jefftk6 Jun 2023 2:10 UTC
10 points
0 comments1 min readLW link
(www.jefftk.com)

An­drew Ng wants to have a con­ver­sa­tion about ex­tinc­tion risk from AI

Leon Lang5 Jun 2023 22:29 UTC
32 points
2 comments1 min readLW link
(twitter.com)

True Re­jec­tion Challenges

Screwtape5 Jun 2023 22:17 UTC
20 points
11 comments5 min readLW link

AISafety.info “How can I help?” FAQ

5 Jun 2023 22:09 UTC
58 points
0 comments2 min readLW link

An­swer to a ques­tion: what do I think about God’s com­mu­ni­ca­tion pat­terns?

Jim Pivarski5 Jun 2023 21:40 UTC
1 point
16 comments8 min readLW link

The In­trin­sic In­ter­play of Hu­man Values and Ar­tifi­cial In­tel­li­gence: Nav­i­gat­ing the Op­ti­miza­tion Challenge

Joe Kwon5 Jun 2023 20:41 UTC
2 points
1 comment18 min readLW link

The (lo­cal) unit of in­tel­li­gence is FLOPs

boazbarak5 Jun 2023 18:23 UTC
40 points
7 comments5 min readLW link

Tu­tor-GPT & Ped­a­gog­i­cal Reasoning

courtlandleer5 Jun 2023 17:53 UTC
26 points
3 comments4 min readLW link

Not an­other bias!

Lionel5 Jun 2023 17:50 UTC
3 points
0 comments1 min readLW link
(lionelpage.substack.com)

What I’ve been read­ing, June 2023

jasoncrawford5 Jun 2023 17:08 UTC
16 points
0 comments7 min readLW link
(rootsofprogress.org)

Hu­mans don’t un­der­stand how we do most things

Nathan11235 Jun 2023 14:35 UTC
2 points
2 comments2 min readLW link

Wild­fire of strategicness

TsviBT5 Jun 2023 13:59 UTC
36 points
19 comments1 min readLW link

Speak­ing off-meta

Epirito5 Jun 2023 13:56 UTC
4 points
0 comments1 min readLW link