Un­der­stand­ing goals in com­plex systems

Johannes C. Mayer1 Dec 2022 23:49 UTC
9 points
0 comments1 min readLW link
(www.youtube.com)

A challenge for AGI or­ga­ni­za­tions, and a challenge for readers

1 Dec 2022 23:11 UTC
301 points
33 comments2 min readLW link

Play­ing with Ae­rial Photos

jefftk1 Dec 2022 22:50 UTC
9 points
0 comments1 min readLW link
(www.jefftk.com)

Take 1: We’re not go­ing to re­verse-en­g­ineer the AI.

Charlie Steiner1 Dec 2022 22:41 UTC
38 points
4 comments4 min readLW link

Re-Ex­am­in­ing LayerNorm

Eric Winsor1 Dec 2022 22:20 UTC
124 points
12 comments5 min readLW link

The LessWrong 2021 Re­view: In­tel­lec­tual Cir­cle Expansion

1 Dec 2022 21:17 UTC
95 points
55 comments8 min readLW link

The Plan − 2022 Update

johnswentworth1 Dec 2022 20:43 UTC
239 points
37 comments8 min readLW link1 review

Find­ing gliders in the game of life

paulfchristiano1 Dec 2022 20:40 UTC
101 points
7 comments16 min readLW link
(ai-alignment.com)

The Ma­chine Stops (Chap­ter 9)

Justin Bullock1 Dec 2022 19:20 UTC
3 points
0 comments47 min readLW link

Covid 12/​1/​22: China Protests

Zvi1 Dec 2022 17:10 UTC
38 points
2 comments10 min readLW link
(thezvi.wordpress.com)

ChatGPT: First Impressions

specbug1 Dec 2022 16:36 UTC
18 points
2 comments13 min readLW link
(sixeleven.in)

[LINK] - ChatGPT discussion

JanB1 Dec 2022 15:04 UTC
13 points
8 comments1 min readLW link
(openai.com)

Re­search re­quest (al­ign­ment strat­egy): Deep dive on “mak­ing AI solve al­ign­ment for us”

JanB1 Dec 2022 14:55 UTC
16 points
3 comments1 min readLW link

The­o­ries of im­pact for Science of Deep Learning

Marius Hobbhahn1 Dec 2022 14:39 UTC
21 points
0 comments11 min readLW link

Safe Devel­op­ment of Hacker-AI Coun­ter­mea­sures – What if we are too late?

Erland Wittkotter1 Dec 2022 7:59 UTC
3 points
0 comments14 min readLW link

Did ChatGPT just gaslight me?

ThomasW1 Dec 2022 5:41 UTC
123 points
45 comments9 min readLW link
(aiwatchtower.substack.com)

SBF’s com­ments on ethics are no sur­prise to virtue ethicists

c.trout1 Dec 2022 4:18 UTC
36 points
30 comments16 min readLW link

Notes on Caution

David Gross1 Dec 2022 3:05 UTC
14 points
0 comments19 min readLW link

Reestab­lish­ing Reli­able Sources: A Sys­tem for Tag­ging URLs

Riley Mueller1 Dec 2022 2:27 UTC
7 points
1 comment3 min readLW link

Seek­ing sub­mis­sions for short AI-safety course proposals

Sergio1 Dec 2022 0:32 UTC
4 points
0 comments1 min readLW link

SBF’s re­cent live in­ter­view at the DealBook Summit

agucova30 Nov 2022 23:11 UTC
12 points
0 comments1 min readLW link

An­nounc­ing the in­com­ing CEO for The Roots of Progress

jasoncrawford30 Nov 2022 23:04 UTC
16 points
0 comments1 min readLW link
(rootsofprogress.org)

Has AI gone too far?

Boston Anderson30 Nov 2022 18:49 UTC
−15 points
3 comments1 min readLW link

AGI Im­pos­si­ble due to En­ergy Constrains

TheKlaus30 Nov 2022 18:48 UTC
−11 points
13 comments1 min readLW link

Bi­ases are en­g­ines of cognition

30 Nov 2022 16:47 UTC
45 points
7 comments1 min readLW link

[Question] Open phone recom­men­da­tion for Elon.

YimbyGeorge30 Nov 2022 15:20 UTC
−13 points
3 comments1 min readLW link

Be less scared of overconfidence

benkuhn30 Nov 2022 15:20 UTC
163 points
22 comments9 min readLW link
(www.benkuhn.net)

LessWrong Lurk­shop (ap­ply by Dec 1st)

GradientDissenter30 Nov 2022 11:41 UTC
3 points
0 comments1 min readLW link

AI takeover table­top RPG: “The Treach­er­ous Turn”

Daniel Kokotajlo30 Nov 2022 7:16 UTC
53 points
5 comments1 min readLW link

Master plan spec: needs au­dit (logic and co­op­er­a­tive AI)

Quinn30 Nov 2022 6:10 UTC
13 points
5 comments7 min readLW link

Ne­glected cause: au­to­mated fraud de­tec­tion in academia through image analysis

Lao Mein30 Nov 2022 5:52 UTC
11 points
1 comment2 min readLW link

The Good Place has a line that defines the hu­man dilemma

William Gasarch30 Nov 2022 5:27 UTC
1 point
1 comment1 min readLW link

Men­tal Abstractions

30 Nov 2022 5:07 UTC
4 points
1 comment5 min readLW link

Bed­bugs are a Solved Prob­lem—DIY Bio-Weapon works.

sapphire30 Nov 2022 3:40 UTC
58 points
4 comments1 min readLW link

[Question] Do any of the AI Risk eval­u­a­tions fo­cus on hu­mans as the risk?

jmh30 Nov 2022 3:09 UTC
10 points
8 comments1 min readLW link

Sense-mak­ing around the FTX catas­tro­phe: a deep dive pod­cast epi­sode we just released

spencerg30 Nov 2022 1:54 UTC
12 points
1 comment1 min readLW link

Multi-Com­po­nent Learn­ing and S-Curves

30 Nov 2022 1:37 UTC
61 points
24 comments7 min readLW link

EA & LW Fo­rums Weekly Sum­mary (14th Nov − 27th Nov 22′)

Zoe Williams29 Nov 2022 23:00 UTC
21 points
1 comment1 min readLW link

Dist­in­guish­ing test from training

So8res29 Nov 2022 21:41 UTC
68 points
11 comments6 min readLW link

Progress links and tweets, 2022-11-29

jasoncrawford29 Nov 2022 20:54 UTC
9 points
0 comments1 min readLW link
(rootsofprogress.org)

Prevent­ing atheroscle­ro­sis, the eas­iest way to im­prove your life ex­pec­tancy?

Eli_29 Nov 2022 20:05 UTC
24 points
9 comments15 min readLW link

Why Would AI “Aim” To Defeat Hu­man­ity?

HoldenKarnofsky29 Nov 2022 19:30 UTC
68 points
9 comments33 min readLW link
(www.cold-takes.com)

Why Bet Kelly?

Joe Zimmerman29 Nov 2022 18:47 UTC
16 points
4 comments4 min readLW link

SSC/​ACX Meetup

svfritz29 Nov 2022 16:32 UTC
1 point
1 comment1 min readLW link

Is Con­struc­tor The­ory a use­ful tool for AI al­ign­ment?

A.H.29 Nov 2022 12:35 UTC
11 points
8 comments26 min readLW link

Align­ment al­lows “non­ro­bust” de­ci­sion-in­fluences and doesn’t re­quire ro­bust grading

TurnTrout29 Nov 2022 6:23 UTC
60 points
42 comments15 min readLW link

Cam­bridge LW Meetup: Lifehacks

29 Nov 2022 5:45 UTC
2 points
0 comments1 min readLW link

[Question] Will chat logs and other records of our lives be main­tained in­definitely by the ad­ver­tis­ing in­dus­try?

mako yass29 Nov 2022 0:30 UTC
14 points
8 comments1 min readLW link

ACX meetup [De­cem­ber]

sallatik28 Nov 2022 22:06 UTC
2 points
0 comments1 min readLW link

Us­ing mechanis­tic in­ter­pretabil­ity to find in-dis­tri­bu­tion failure in toy transformers

Charlie George28 Nov 2022 19:39 UTC
6 points
0 comments4 min readLW link