[Question] Where’s the eco­nomic in­cen­tive for wok­ism com­ing from?

Valentine8 Dec 2022 23:28 UTC
12 points
105 comments1 min readLW link

I Believe we are in a Hard­ware Overhang

nem8 Dec 2022 23:18 UTC
8 points
0 comments1 min readLW link

Of pump­kins, the Fal­con Heavy, and Grou­cho Marx: High-Level dis­course struc­ture in ChatGPT

Bill Benzon8 Dec 2022 22:25 UTC
2 points
0 comments8 min readLW link

How Many Lives Does X-Risk Work Save From Nonex­is­tence On Aver­age?

Jordan Arel8 Dec 2022 21:57 UTC
4 points
5 comments14 min readLW link

AI Safety Seems Hard to Measure

HoldenKarnofsky8 Dec 2022 19:50 UTC
71 points
6 comments14 min readLW link
(www.cold-takes.com)

Play­ing shell games with definitions

weverka8 Dec 2022 19:35 UTC
9 points
3 comments1 min readLW link

Notes on OpenAI’s al­ign­ment plan

Alex Flint8 Dec 2022 19:13 UTC
40 points
5 comments7 min readLW link

Rele­vant to nat­u­ral ab­strac­tions: Eu­clidean Sym­me­try Equiv­ar­i­ant Ma­chine Learn­ing—Overview, Ap­pli­ca­tions, and Open Questions

the gears to ascension8 Dec 2022 18:01 UTC
8 points
0 comments1 min readLW link
(youtu.be)

I’ve started pub­lish­ing the novel I wrote to pro­mote EA

Timothy Underwood8 Dec 2022 17:30 UTC
10 points
3 comments1 min readLW link

Neu­ral net­works bi­ased to­wards ge­o­met­ri­cally sim­ple func­tions?

DavidHolmes8 Dec 2022 16:16 UTC
16 points
2 comments3 min readLW link

If Went­worth is right about nat­u­ral ab­strac­tions, it would be bad for alignment

Wuschel Schulz8 Dec 2022 15:19 UTC
28 points
5 comments4 min readLW link

Covid 12/​8/​22: Another Win­ter Wave

Zvi8 Dec 2022 14:40 UTC
23 points
8 comments11 min readLW link
(thezvi.wordpress.com)

Why I’m Scep­ti­cal of Foom

DragonGod8 Dec 2022 10:01 UTC
20 points
36 comments3 min readLW link

Take 7: You should talk about “the hu­man’s util­ity func­tion” less.

Charlie Steiner8 Dec 2022 8:14 UTC
50 points
22 comments2 min readLW link

Ma­chine Learn­ing Consent

jefftk8 Dec 2022 3:50 UTC
38 points
14 comments3 min readLW link
(www.jefftk.com)

Riffing on the agent type

Quinn8 Dec 2022 0:19 UTC
21 points
3 comments4 min readLW link

[Question] Look­ing for ideas of pub­lic as­sets (stocks, funds, ETFs) that I can in­vest in to have a chance at prof­it­ing from the mass adop­tion and com­mer­cial­iza­tion of AI technology

Annapurna7 Dec 2022 22:35 UTC
15 points
9 comments1 min readLW link

A Fal­li­bil­ist Wordview

Toni MUENDEL7 Dec 2022 20:59 UTC
−13 points
2 comments13 min readLW link

Thoughts on AGI or­ga­ni­za­tions and ca­pa­bil­ities work

7 Dec 2022 19:46 UTC
102 points
17 comments5 min readLW link

How to Think About Cli­mate Models and How to Im­prove Them

clans7 Dec 2022 19:37 UTC
7 points
0 comments2 min readLW link
(locationtbd.home.blog)

The nov­elty quotient

River Lewis7 Dec 2022 17:16 UTC
4 points
7 comments2 min readLW link
(heytraveler.substack.com)

ChatGPT: “An er­ror oc­curred. If this is­sue per­sists...”

Bill Benzon7 Dec 2022 15:41 UTC
5 points
11 comments3 min readLW link

Take 6: CAIS is ac­tu­ally Or­wellian.

Charlie Steiner7 Dec 2022 13:50 UTC
14 points
8 comments2 min readLW link

Peter Thiel on Tech­nolog­i­cal Stag­na­tion and Out of Touch Rationalists

Matt Goldenberg7 Dec 2022 13:15 UTC
9 points
26 comments1 min readLW link
(youtu.be)

[Link] Wave­func­tions: from Lin­ear Alge­bra to Spinors

sen7 Dec 2022 12:44 UTC
11 points
12 comments1 min readLW link
(paperclip.substack.com)

Why I like Zulip in­stead of Slack or Discord

Alok Singh7 Dec 2022 9:28 UTC
31 points
10 comments1 min readLW link

Bioweapons, and ChatGPT (an­other vuln­er­a­bil­ity story)

joshuatanderson7 Dec 2022 7:27 UTC
−5 points
0 comments2 min readLW link

Where to be an AI Safety Pro­fes­sor

scasper7 Dec 2022 7:09 UTC
30 points
12 comments2 min readLW link

[Question] Are there any tools to con­vert LW se­quences to PDF or any other file for­mat?

quetzal_rainbow7 Dec 2022 5:28 UTC
2 points
2 comments1 min readLW link

Man­i­fold Mar­kets com­mu­nity meetup

Sinclair Chen7 Dec 2022 3:25 UTC
4 points
0 comments1 min readLW link

“At­ten­tion Pas­sen­gers”: not for Signs

jefftk7 Dec 2022 2:00 UTC
27 points
10 comments1 min readLW link
(www.jefftk.com)

[ASoT] Prob­a­bil­ity In­fects Con­cepts it Touches

Ulisse Mini7 Dec 2022 1:48 UTC
10 points
4 comments1 min readLW link

Sim­ple Way to Prevent Power-Seek­ing AI

research_prime_space7 Dec 2022 0:26 UTC
12 points
1 comment1 min readLW link

In defense of prob­a­bly wrong mechanis­tic models

evhub6 Dec 2022 23:24 UTC
53 points
10 comments2 min readLW link

AI Safety in a Vuln­er­a­ble World: Re­quest­ing Feed­back on Pre­limi­nary Thoughts

Jordan Arel6 Dec 2022 22:35 UTC
4 points
2 comments3 min readLW link

ChatGPT and the Hu­man Race

Ben Reilly6 Dec 2022 21:38 UTC
6 points
1 comment3 min readLW link

[Question] How do finite fac­tored sets com­pare with phase space?

Alex_Altair6 Dec 2022 20:05 UTC
15 points
1 comment1 min readLW link

Mesa-Op­ti­miz­ers via Grokking

orthonormal6 Dec 2022 20:05 UTC
36 points
4 comments6 min readLW link

Us­ing GPT-Eliezer against ChatGPT Jailbreaking

6 Dec 2022 19:54 UTC
170 points
85 comments9 min readLW link

The Parable of the Crimp

Phosphorous6 Dec 2022 18:41 UTC
11 points
3 comments3 min readLW link

The Cat­e­gor­i­cal Im­per­a­tive Obscures

Gordon Seidoh Worley6 Dec 2022 17:48 UTC
17 points
17 comments2 min readLW link

MIRI’s “Death with Dig­nity” in 60 sec­onds.

Cleo Nardo6 Dec 2022 17:18 UTC
55 points
4 comments1 min readLW link

Things roll downhill

awenonian6 Dec 2022 15:27 UTC
19 points
0 comments1 min readLW link

EA & LW Fo­rums Weekly Sum­mary (28th Nov − 4th Dec 22′)

Zoe Williams6 Dec 2022 9:38 UTC
10 points
1 comment1 min readLW link

Free Will is [REDACTED]

lsusr6 Dec 2022 8:14 UTC
−5 points
21 comments1 min readLW link

Take 5: Another prob­lem for nat­u­ral ab­strac­tions is laz­i­ness.

Charlie Steiner6 Dec 2022 7:00 UTC
30 points
4 comments3 min readLW link

Ver­ifi­ca­tion Is Not Easier Than Gen­er­a­tion In General

johnswentworth6 Dec 2022 5:20 UTC
60 points
27 comments1 min readLW link

Shh, don’t tell the AI it’s likely to be evil

naterush6 Dec 2022 3:35 UTC
19 points
9 comments1 min readLW link

[Question] What are the ma­jor un­der­ly­ing di­vi­sions in AI safety?

Chris_Leong6 Dec 2022 3:28 UTC
5 points
2 comments1 min readLW link

[Link] Why I’m op­ti­mistic about OpenAI’s al­ign­ment approach

janleike5 Dec 2022 22:51 UTC
98 points
15 comments1 min readLW link
(aligned.substack.com)