[Question] Posts with click­able sec­tions of images?

NoBadCake23 Sep 2022 23:19 UTC
1 point
5 comments1 min readLW link

Un­der what cir­cum­stances have gov­ern­ments can­cel­led AI-type sys­tems?

David Gross23 Sep 2022 21:11 UTC
7 points
1 comment1 min readLW link
(www.carnegieuktrust.org.uk)

There are no rules

unoptimal23 Sep 2022 20:47 UTC
34 points
2 comments5 min readLW link

In­ter­pret­ing Neu­ral Net­works through the Poly­tope Lens

23 Sep 2022 17:58 UTC
136 points
29 comments33 min readLW link

The het­ero­gene­ity of hu­man value types: Im­pli­ca­tions for AI alignment

geoffreymiller23 Sep 2022 17:03 UTC
10 points
2 comments10 min readLW link

How to use DMT with­out go­ing in­sane: On nav­i­gat­ing epistemic un­cer­tainty in the DMT memeplex

cube_flipper23 Sep 2022 16:32 UTC
1 point
4 comments8 min readLW link
(smoothbrains.net)

Sha­har Avin On How To Reg­u­late Ad­vanced AI Systems

Michaël Trazzi23 Sep 2022 15:46 UTC
31 points
0 comments4 min readLW link
(theinsideview.ai)

In­ter­lude: But Who Op­ti­mizes The Op­ti­mizer?

Paul Bricman23 Sep 2022 15:30 UTC
15 points
0 comments10 min readLW link

Why do so many things break in a 2 el­e­ment set?

Alok Singh23 Sep 2022 6:30 UTC
6 points
3 comments1 min readLW link
(alok.github.io)

In­tel­li­gence as a Platform

Robert Kennedy23 Sep 2022 5:51 UTC
10 points
5 comments3 min readLW link

Public-fac­ing Cen­sor­ship Is Safety Theater, Caus­ing Rep­u­ta­tional Da­m­age

Yitz23 Sep 2022 5:08 UTC
149 points
42 comments6 min readLW link

A game of mattering

KatjaGrace23 Sep 2022 2:30 UTC
64 points
7 comments5 min readLW link
(worldspiritsockpuppet.com)

Mak­ing Prunes

jefftk23 Sep 2022 2:13 UTC
10 points
0 comments1 min readLW link
(www.jefftk.com)

Fund­ing is All You Need: Get­ting into Grad School by Hack­ing the NSF GRFP Fellowship

hapanin22 Sep 2022 21:39 UTC
103 points
9 comments12 min readLW link

[Question] What Do AI Safety Pitches Not Get About Your Field?

Aris22 Sep 2022 21:27 UTC
28 points
3 comments1 min readLW link

“Free Will” in a Com­pu­ta­tional Universe

DragonGod22 Sep 2022 21:25 UTC
5 points
6 comments14 min readLW link

Ini­tial Thoughts on Dis­solv­ing “Could­ness”

DragonGod22 Sep 2022 21:23 UTC
6 points
1 comment3 min readLW link

Let’s Com­pare Notes

Shoshannah Tekofsky22 Sep 2022 20:47 UTC
17 points
3 comments6 min readLW link

Method­olog­i­cal Ther­apy: An Agenda For Tack­ling Re­search Bottlenecks

22 Sep 2022 18:41 UTC
54 points
6 comments9 min readLW link

Berkeley group house, spots open

Jack R22 Sep 2022 17:13 UTC
4 points
1 comment1 min readLW link

Fake qual­ities of mind

Kaj_Sotala22 Sep 2022 16:40 UTC
58 points
2 comments2 min readLW link
(kajsotala.fi)

Dath Ilan’s Views on Stop­gap Corrigibility

David Udell22 Sep 2022 16:16 UTC
77 points
19 comments13 min readLW link
(www.glowfic.com)

Ukraine Post #12

Zvi22 Sep 2022 14:40 UTC
104 points
3 comments16 min readLW link
(thezvi.wordpress.com)

Covid 9/​22/​22: The Joe Bi­den Sings

Zvi22 Sep 2022 14:40 UTC
15 points
17 comments24 min readLW link
(thezvi.wordpress.com)

AI Risk In­tro 2: Solv­ing The Problem

22 Sep 2022 13:55 UTC
22 points
0 comments27 min readLW link

Un­der­stand­ing In­fra-Bayesi­anism: A Begin­ner-Friendly Video Series

22 Sep 2022 13:25 UTC
140 points
6 comments2 min readLW link

[Question] Is the game de­sign/​art maxim more gen­er­al­iz­able to crit­i­cism/​praise it­self?

Noosphere8922 Sep 2022 13:19 UTC
4 points
1 comment1 min readLW link

[Question] AI career

ondragon22 Sep 2022 3:48 UTC
2 points
0 comments1 min readLW link

Math­e­mat­i­cal Cir­cuits in Neu­ral Networks

Sean Osier22 Sep 2022 3:48 UTC
34 points
4 comments1 min readLW link
(www.youtube.com)

LW Petrov Day 2022 (Mon­day, 9/​26)

Ruby22 Sep 2022 2:56 UTC
121 points
111 comments5 min readLW link

Tues­day Fam­ily Dinner

jefftk22 Sep 2022 2:40 UTC
18 points
0 comments1 min readLW link
(www.jefftk.com)

Toy Models of Superposition

evhub21 Sep 2022 23:48 UTC
68 points
4 comments5 min readLW link1 review
(transformer-circuits.pub)

How to Train Your AGI Dragon

Oren Montano21 Sep 2022 22:28 UTC
−1 points
3 comments5 min readLW link

An is­sue with MacAskill’s Ev­i­den­tial­ist’s Wager

Martín Soto21 Sep 2022 22:02 UTC
1 point
9 comments4 min readLW link

An­nounc­ing AISIC 2022 - the AI Safety Is­rael Con­fer­ence, Oc­to­ber 19-20

Davidmanheim21 Sep 2022 19:32 UTC
13 points
0 comments1 min readLW link

Nearcast-based “de­ploy­ment prob­lem” analysis

HoldenKarnofsky21 Sep 2022 18:52 UTC
85 points
2 comments26 min readLW link

Scrap­ing train­ing data for your mind

Henrik Karlsson21 Sep 2022 16:27 UTC
47 points
4 comments8 min readLW link
(escapingflatland.substack.com)

Trends in Train­ing Dataset Sizes

Pablo Villalobos21 Sep 2022 15:47 UTC
25 points
2 comments5 min readLW link
(epochai.org)

[Question] Can you define “util­ity” in util­i­tar­i­anism with­out us­ing words for spe­cific hu­man emo­tions?

SurvivalBias21 Sep 2022 3:29 UTC
13 points
46 comments1 min readLW link

“In­fo­haz­ards” The ML Field’s Great­est Ex­cuse.

Puffy Bird21 Sep 2022 3:19 UTC
−3 points
1 comment3 min readLW link

Case Rates to Se­quenc­ing Reads

jefftk21 Sep 2022 2:00 UTC
15 points
4 comments4 min readLW link
(www.jefftk.com)

Towards de­con­fus­ing wire­head­ing and re­ward maximization

leogao21 Sep 2022 0:36 UTC
81 points
7 comments4 min readLW link

[Question] What key nu­tri­ents are re­quired for daily en­ergy?

trevor20 Sep 2022 23:30 UTC
6 points
4 comments1 min readLW link

Quan­tified In­tu­itions: An epistemics train­ing web­site in­clud­ing a new EA-themed cal­ibra­tion app

20 Sep 2022 22:25 UTC
28 points
2 comments2 min readLW link

The Redac­tion Machine

Ben20 Sep 2022 22:03 UTC
495 points
46 comments27 min readLW link1 review

You Are Not Mea­sur­ing What You Think You Are Measuring

johnswentworth20 Sep 2022 20:04 UTC
369 points
44 comments8 min readLW link2 reviews

What hap­pened to the idea of progress?

jasoncrawford20 Sep 2022 19:56 UTC
8 points
2 comments1 min readLW link
(bigthink.com)

Fea­tures and Antifeatures

Davis_Kingsley20 Sep 2022 17:54 UTC
23 points
8 comments1 min readLW link

Cryp­tocur­rency Ex­ploits Show the Im­por­tance of Proac­tive Poli­cies for AI X-Risk

eSpencer20 Sep 2022 17:53 UTC
1 point
0 comments4 min readLW link

Align­ment Org Cheat Sheet

20 Sep 2022 17:36 UTC
69 points
8 comments4 min readLW link