[Question] Fore­cast­ing thread: How does AI risk level vary based on timelines?

elifland14 Sep 2022 23:56 UTC
34 points
7 comments1 min readLW link

Co­or­di­nate-Free In­ter­pretabil­ity Theory

johnswentworth14 Sep 2022 23:33 UTC
50 points
16 comments5 min readLW link

Progress links and tweets, 2022-09-14

jasoncrawford14 Sep 2022 23:21 UTC
9 points
2 comments1 min readLW link
(rootsofprogress.org)

Effec­tive al­tru­ism in the gar­den of ends

Tyler Alterman14 Sep 2022 22:02 UTC
24 points
1 comment27 min readLW link

The prob­lem with the me­dia pre­sen­ta­tion of “be­liev­ing in AI”

Roman Leventov14 Sep 2022 21:05 UTC
3 points
0 comments1 min readLW link

See­ing the Schema

vitaliya14 Sep 2022 20:45 UTC
23 points
6 comments1 min readLW link

Re­spond­ing to ‘Beyond Hyper­an­thro­po­mor­phism’

ukc1001414 Sep 2022 20:37 UTC
8 points
0 comments16 min readLW link

When is in­tent al­ign­ment suffi­cient or nec­es­sary to re­duce AGI con­flict?

14 Sep 2022 19:39 UTC
40 points
0 comments9 min readLW link

When would AGIs en­gage in con­flict?

14 Sep 2022 19:38 UTC
52 points
5 comments13 min readLW link

When does tech­ni­cal work to re­duce AGI con­flict make a differ­ence?: Introduction

14 Sep 2022 19:38 UTC
52 points
3 comments6 min readLW link

ACT-1: Trans­former for Actions

Daniel Kokotajlo14 Sep 2022 19:09 UTC
52 points
4 comments1 min readLW link
(www.adept.ai)

Renor­mal­iza­tion: Why Big­ger is Simpler

tailcalled14 Sep 2022 17:52 UTC
30 points
5 comments1 min readLW link
(www.youtube.com)

Guessti­mate Al­gorithm for Med­i­cal Research

Elizabeth14 Sep 2022 17:30 UTC
26 points
0 comments7 min readLW link
(acesounderglass.com)

Pre­cise P(doom) isn’t very im­por­tant for pri­ori­ti­za­tion or strategy

harsimony14 Sep 2022 17:19 UTC
14 points
6 comments1 min readLW link

Tran­shu­man­ism, ge­netic en­g­ineer­ing, and the biolog­i­cal ba­sis of in­tel­li­gence.

fowlertm14 Sep 2022 15:55 UTC
41 points
23 comments1 min readLW link

What would hap­pen if we abol­ished the FDA to­mor­row?

Yair Halberstadt14 Sep 2022 15:22 UTC
19 points
15 comments4 min readLW link

Emily Brontë on: Psy­chol­ogy Re­quired for Se­ri­ous™ AGI Safety Research

robertzk14 Sep 2022 14:47 UTC
2 points
0 comments1 min readLW link

The Defen­der’s Ad­van­tage of Interpretability

Marius Hobbhahn14 Sep 2022 14:05 UTC
41 points
4 comments6 min readLW link

[Question] Why Do Peo­ple Think Hu­mans Are Stupid?

DragonGod14 Sep 2022 13:55 UTC
22 points
41 comments3 min readLW link

[Question] Are Speed Su­per­in­tel­li­gences Fea­si­ble for Modern ML Tech­niques?

DragonGod14 Sep 2022 12:59 UTC
9 points
7 comments1 min readLW link

[Question] Would a Misal­igned SSI Really Kill Us All?

DragonGod14 Sep 2022 12:15 UTC
6 points
7 comments6 min readLW link

Some ideas for epis­tles to the AI ethicists

Charlie Steiner14 Sep 2022 9:07 UTC
19 points
0 comments4 min readLW link

Git Re-Basin: Merg­ing Models mod­ulo Per­mu­ta­tion Sym­me­tries [Linkpost]

aogara14 Sep 2022 8:55 UTC
21 points
0 comments2 min readLW link
(arxiv.org)

Dan Luu on Fu­tur­ist Predictions

RobertM14 Sep 2022 3:01 UTC
50 points
9 comments5 min readLW link
(danluu.com)

Sim­ple 5x5 Go

jefftk14 Sep 2022 2:00 UTC
18 points
3 comments1 min readLW link
(www.jefftk.com)

I’m tak­ing a course on game the­ory and am faced with this ques­tion. What’s the ra­tio­nal de­ci­sion?

Dalton Mabery14 Sep 2022 0:27 UTC
0 points
12 comments1 min readLW link

Twin Cities ACX Meetup—Oct 2022

Timothy M.13 Sep 2022 22:38 UTC
1 point
2 comments1 min readLW link

Try­ing to find the un­der­ly­ing struc­ture of com­pu­ta­tional systems

Matthias G. Mayer13 Sep 2022 21:16 UTC
17 points
9 comments4 min readLW link

Risk aver­sion and GPT-3

hatta_afiq13 Sep 2022 20:50 UTC
1 point
0 comments1 min readLW link

Sim­ple proofs of the age of the uni­verse (or other things)

Astynax13 Sep 2022 18:20 UTC
16 points
12 comments1 min readLW link

New tool for ex­plor­ing EA Fo­rum, LessWrong and Align­ment Fo­rum—Tree of Tags

Filip Sondej13 Sep 2022 17:33 UTC
31 points
2 comments1 min readLW link

An in­ves­ti­ga­tion into when agents may be in­cen­tivized to ma­nipu­late our be­liefs.

Felix Hofstätter13 Sep 2022 17:08 UTC
15 points
0 comments14 min readLW link

Deep Q-Net­works Explained

Jay Bailey13 Sep 2022 12:01 UTC
55 points
4 comments22 min readLW link

Ideas of the Gaps

Q Home13 Sep 2022 10:55 UTC
4 points
3 comments12 min readLW link

[Question] Which LessWrong con­tent would you like recorded into au­dio/​pod­cast form?

Ruby13 Sep 2022 1:20 UTC
29 points
11 comments1 min readLW link

How To Ac­tu­ally Succeed

Jordan Arel13 Sep 2022 0:21 UTC
2 points
1 comment5 min readLW link

EA & LW Fo­rums Weekly Sum­mary (5 − 11 Sep 22′)

Zoe Williams12 Sep 2022 23:24 UTC
24 points
0 comments13 min readLW link

Time is not the bot­tle­neck (on mak­ing progress think­ing about difficult things)

kman12 Sep 2022 20:45 UTC
26 points
8 comments1 min readLW link

[Linkpost] A sur­vey on over 300 works about in­ter­pretabil­ity in deep networks

scasper12 Sep 2022 19:07 UTC
97 points
7 comments2 min readLW link
(arxiv.org)

Con­tem­po­rary Lin­guis­tics: A Per­spec­tive on Re­search and In­for­ma­tion Sharing

Miniman12 Sep 2022 19:02 UTC
1 point
5 comments3 min readLW link

[Question] Why do Peo­ple Think In­tel­li­gence Will be “Easy”?

DragonGod12 Sep 2022 17:32 UTC
15 points
32 comments2 min readLW link

Align­ment via proso­cial brain algorithms

Cameron Berg12 Sep 2022 13:48 UTC
42 points
28 comments6 min readLW link

I’ve writ­ten a Fan­tasy Novel to Pro­mote Effec­tive Altruism

Timothy Underwood12 Sep 2022 12:14 UTC
23 points
21 comments13 min readLW link

Ide­olog­i­cal In­fer­ence Eng­ines: Mak­ing Deon­tol­ogy Differ­en­tiable*

Paul Bricman12 Sep 2022 12:00 UTC
6 points
0 comments14 min readLW link

Freeload­ing?

jefftk12 Sep 2022 11:20 UTC
28 points
24 comments3 min readLW link
(www.jefftk.com)

Can you force a neu­ral net­work to keep gen­er­al­iz­ing?

Q Home12 Sep 2022 10:14 UTC
2 points
10 comments5 min readLW link

Black Box In­ves­ti­ga­tion Re­search Hackathon

12 Sep 2022 7:20 UTC
9 points
4 comments2 min readLW link

Ar­gu­ment against 20% GDP growth from AI within 10 years [Linkpost]

aogara12 Sep 2022 4:08 UTC
59 points
21 comments5 min readLW link
(twitter.com)

AI Safety field-build­ing pro­jects I’d like to see

Akash11 Sep 2022 23:43 UTC
44 points
7 comments6 min readLW link

Fermi Para­dox: Iron Age Milky Way

Rofel Wodring11 Sep 2022 20:32 UTC
−10 points
9 comments3 min readLW link