THE 3 WILLPOWER KEYS

GregorDeVillain4 Sep 2022 22:57 UTC
−11 points
0 comments4 min readLW link

What’s your Mis­sion?

GregorDeVillain4 Sep 2022 18:52 UTC
−4 points
1 comment6 min readLW link

EA, Ve­ganism and Nega­tive An­i­mal Utilitarianism

Yair Halberstadt4 Sep 2022 18:30 UTC
9 points
12 comments1 min readLW link

The ethics of re­clin­ing air­plane seats

braces4 Sep 2022 17:59 UTC
92 points
70 comments1 min readLW link

Rus­sian Food for Petrov Day

weft4 Sep 2022 17:57 UTC
17 points
5 comments1 min readLW link

Pro­to­typ­ing in C

jefftk4 Sep 2022 17:50 UTC
19 points
11 comments2 min readLW link
(www.jefftk.com)

Turn your flash­cards into Art

Heye Groß4 Sep 2022 17:31 UTC
16 points
2 comments1 min readLW link

Let’s Ter­raform West Texas

blackstampede4 Sep 2022 16:24 UTC
87 points
33 comments5 min readLW link

[Question] Help me find a good Hackathon sub­ject

Charbel-Raphaël4 Sep 2022 8:40 UTC
6 points
18 comments1 min readLW link

Bay Sols­tice 2022 Call For Volunteers

Scott Alexander4 Sep 2022 6:44 UTC
43 points
2 comments1 min readLW link

The shard the­ory of hu­man values

4 Sep 2022 4:28 UTC
235 points
66 comments24 min readLW link2 reviews

Break­ing New­comb’s Prob­lem with Non-Halt­ing states

Slimepriestess4 Sep 2022 4:01 UTC
18 points
9 comments5 min readLW link

Monthly Shorts 8/​22

Celer4 Sep 2022 2:30 UTC
3 points
0 comments7 min readLW link
(keller.substack.com)

Fully Live Elec­tronic Contra

jefftk4 Sep 2022 1:30 UTC
9 points
0 comments1 min readLW link
(www.jefftk.com)

How To Know What the AI Knows—An ELK Distillation

Fabien Roger4 Sep 2022 0:46 UTC
7 points
0 comments5 min readLW link

Pri­vate al­ign­ment re­search shar­ing and coordination

porby4 Sep 2022 0:01 UTC
62 points
13 comments5 min readLW link

AXRP Epi­sode 18 - Con­cept Ex­trap­o­la­tion with Stu­art Armstrong

DanielFilan3 Sep 2022 23:12 UTC
12 points
1 comment39 min readLW link

An Up­date on Academia vs. In­dus­try (one year into my fac­ulty job)

David Scott Krueger (formerly: capybaralet)3 Sep 2022 20:43 UTC
121 points
18 comments4 min readLW link

[Question] Re­quest for Align­ment Re­search Pro­ject Recommendations

Rauno Arike3 Sep 2022 15:29 UTC
10 points
2 comments1 min readLW link

Three sce­nar­ios of pseudo-al­ign­ment

Eleni Angelou3 Sep 2022 12:47 UTC
9 points
0 comments3 min readLW link

Bugs or Fea­tures?

qbolec3 Sep 2022 7:04 UTC
72 points
9 comments2 min readLW link

[Ex­plo­ra­tory] Seper­ate ex­plo­ra­tory writ­ing from pub­lic writing

Johannes C. Mayer3 Sep 2022 2:57 UTC
6 points
2 comments1 min readLW link

We may be able to see sharp left turns coming

3 Sep 2022 2:55 UTC
53 points
29 comments2 min readLW link

[Ex­plo­ra­tory] Ex­plo­ra­tory Writ­ing Info

Johannes C. Mayer3 Sep 2022 2:50 UTC
3 points
3 comments1 min readLW link

[Question] Can some­one ex­plain to me why most re­searchers think al­ign­ment is prob­a­bly some­thing that is hu­manly tractable?

iamthouthouarti3 Sep 2022 1:12 UTC
32 points
11 comments1 min readLW link

Be­havi­our Man­i­folds and the Hes­sian of the To­tal Loss—Notes and Criticism

Spencer Becker-Kahn3 Sep 2022 0:15 UTC
35 points
5 comments6 min readLW link

Sticky goals: a con­crete ex­per­i­ment for un­der­stand­ing de­cep­tive alignment

evhub2 Sep 2022 21:57 UTC
39 points
13 comments3 min readLW link

Agency en­g­ineer­ing: is AI-al­ign­ment “to hu­man in­tent” enough?

catubc2 Sep 2022 18:14 UTC
9 points
10 comments6 min readLW link

Hanover, Ger­many—ACX Mee­tups Every­where 2022

eikowagenknecht2 Sep 2022 17:31 UTC
2 points
0 comments1 min readLW link

Laz­i­ness in AI

Richard Henage2 Sep 2022 17:04 UTC
13 points
5 comments1 min readLW link

Ex­port­ing Han­gouts History

jefftk2 Sep 2022 15:00 UTC
20 points
0 comments2 min readLW link
(www.jefftk.com)

Simulators

janus2 Sep 2022 12:45 UTC
594 points
161 comments41 min readLW link8 reviews
(generative.ink)

Lev­el­ling Up in AI Safety Re­search Engineering

Gabriel Mukobi2 Sep 2022 4:59 UTC
57 points
9 comments17 min readLW link

Stop Dis­cour­ag­ing Microwave For­mula Preparation

jefftk2 Sep 2022 2:10 UTC
68 points
12 comments2 min readLW link
(www.jefftk.com)

A Richly In­ter­ac­tive AGI Align­ment Chart

lisperati2 Sep 2022 0:44 UTC
14 points
6 comments1 min readLW link

Ap­pendix: How to run a suc­cess­ful Ham­ming circle

CFAR!Duncan2 Sep 2022 0:22 UTC
35 points
6 comments7 min readLW link

Re­place­ment for PONR concept

Daniel Kokotajlo2 Sep 2022 0:09 UTC
58 points
6 comments2 min readLW link

AI co­or­di­na­tion needs clear wins

evhub1 Sep 2022 23:41 UTC
146 points
16 comments2 min readLW link1 review

Short story spec­u­lat­ing on pos­si­ble ram­ifi­ca­tions of AI on the art world

Yitz1 Sep 2022 21:15 UTC
30 points
8 comments3 min readLW link
(archiveofourown.org)

Why was progress so slow in the past?

jasoncrawford1 Sep 2022 20:26 UTC
54 points
31 comments6 min readLW link
(rootsofprogress.org)

AI Safety and Neigh­bor­ing Com­mu­ni­ties: A Quick-Start Guide, as of Sum­mer 2022

Sam Bowman1 Sep 2022 19:15 UTC
76 points
2 comments7 min readLW link

Gra­di­ent Hacker De­sign Prin­ci­ples From Biology

johnswentworth1 Sep 2022 19:03 UTC
60 points
13 comments3 min readLW link

Book re­view: Put Your Ass Where Your Heart Wants to Be

Ruhul1 Sep 2022 18:21 UTC
1 point
2 comments10 min readLW link

A Sur­vey of Foun­da­tional Meth­ods in In­verse Re­in­force­ment Learning

adamk1 Sep 2022 18:21 UTC
19 points
0 comments12 min readLW link

I Tripped and Be­came GPT! (And How This Up­dated My Timelines)

Frankophone1 Sep 2022 17:56 UTC
31 points
0 comments4 min readLW link

[Question] Fixed point the­ory (lo­cally (α,β,ψ) dom­i­nated con­trac­tive con­di­tion)

muzammil1 Sep 2022 17:56 UTC
0 points
3 comments1 min readLW link

Align­ment is hard. Com­mu­ni­cat­ing that, might be harder

Eleni Angelou1 Sep 2022 16:57 UTC
7 points
8 comments3 min readLW link

Covid 9/​1/​22: Meet the New Booster

Zvi1 Sep 2022 14:00 UTC
41 points
6 comments14 min readLW link
(thezvi.wordpress.com)

A Starter-kit for Ra­tion­al­ity Space

Jesse Hoogland1 Sep 2022 13:04 UTC
41 points
0 comments1 min readLW link
(github.com)

Pon­der­ing the paucity of vol­canic pro­fan­ity post Pom­peii perusal

CraigMichael1 Sep 2022 9:29 UTC
21 points
2 comments15 min readLW link