AI Gover­nance Needs Tech­ni­cal Work

Mau5 Sep 2022 22:28 UTC
41 points
1 comment8 min readLW link

pro­gram searches

Tamsin Leake5 Sep 2022 20:04 UTC
21 points
2 comments2 min readLW link
(carado.moe)

Over­ton Gym­nas­tics: An Ex­er­cise in Discomfort

5 Sep 2022 19:20 UTC
40 points
15 comments4 min readLW link

The Good King

GregorDeVillain5 Sep 2022 19:17 UTC
−6 points
0 comments13 min readLW link

Beta Read­ers are Great

HoldenKarnofsky5 Sep 2022 19:10 UTC
28 points
0 comments1 min readLW link
(www.cold-takes.com)

Im­pact Shares For Spec­u­la­tive Projects

Elizabeth5 Sep 2022 18:00 UTC
30 points
8 comments7 min readLW link
(acesounderglass.com)

An un­offi­cial “High­lights from the Se­quences” tier list

Akash5 Sep 2022 14:07 UTC
29 points
1 comment5 min readLW link

A Game About AI Align­ment (& Meta-Ethics): What Are the Must Haves?

JonathanErhardt5 Sep 2022 7:55 UTC
18 points
15 comments2 min readLW link

[Ex­plo­ra­tory] What does it mean that an ex­per­i­ment is high bit?

Johannes C. Mayer5 Sep 2022 3:13 UTC
5 points
0 comments2 min readLW link

(Link) I’m Miss­ing a Chunk of My Brain

mukashi5 Sep 2022 2:10 UTC
13 points
2 comments1 min readLW link
(www.nytimes.com)

THE 3 WILLPOWER KEYS

GregorDeVillain4 Sep 2022 22:57 UTC
−11 points
0 comments4 min readLW link

What’s your Mis­sion?

GregorDeVillain4 Sep 2022 18:52 UTC
−4 points
1 comment6 min readLW link

EA, Ve­ganism and Nega­tive An­i­mal Utilitarianism

Yair Halberstadt4 Sep 2022 18:30 UTC
9 points
12 comments1 min readLW link

The ethics of re­clin­ing air­plane seats

braces4 Sep 2022 17:59 UTC
92 points
70 comments1 min readLW link

Rus­sian Food for Petrov Day

weft4 Sep 2022 17:57 UTC
17 points
5 comments1 min readLW link

Pro­to­typ­ing in C

jefftk4 Sep 2022 17:50 UTC
19 points
11 comments2 min readLW link
(www.jefftk.com)

Turn your flash­cards into Art

Heye Groß4 Sep 2022 17:31 UTC
16 points
2 comments1 min readLW link

Let’s Ter­raform West Texas

blackstampede4 Sep 2022 16:24 UTC
87 points
33 comments5 min readLW link

[Question] Help me find a good Hackathon sub­ject

Charbel-Raphaël4 Sep 2022 8:40 UTC
6 points
18 comments1 min readLW link

Bay Sols­tice 2022 Call For Volunteers

Scott Alexander4 Sep 2022 6:44 UTC
43 points
2 comments1 min readLW link

The shard the­ory of hu­man values

4 Sep 2022 4:28 UTC
235 points
66 comments24 min readLW link2 reviews

Break­ing New­comb’s Prob­lem with Non-Halt­ing states

Slimepriestess4 Sep 2022 4:01 UTC
18 points
9 comments5 min readLW link

Monthly Shorts 8/​22

Celer4 Sep 2022 2:30 UTC
3 points
0 comments7 min readLW link
(keller.substack.com)

Fully Live Elec­tronic Contra

jefftk4 Sep 2022 1:30 UTC
9 points
0 comments1 min readLW link
(www.jefftk.com)

How To Know What the AI Knows—An ELK Distillation

Fabien Roger4 Sep 2022 0:46 UTC
7 points
0 comments5 min readLW link

Pri­vate al­ign­ment re­search shar­ing and coordination

porby4 Sep 2022 0:01 UTC
62 points
13 comments5 min readLW link

AXRP Epi­sode 18 - Con­cept Ex­trap­o­la­tion with Stu­art Armstrong

DanielFilan3 Sep 2022 23:12 UTC
12 points
1 comment39 min readLW link

An Up­date on Academia vs. In­dus­try (one year into my fac­ulty job)

David Scott Krueger (formerly: capybaralet)3 Sep 2022 20:43 UTC
121 points
18 comments4 min readLW link

[Question] Re­quest for Align­ment Re­search Pro­ject Recommendations

Rauno Arike3 Sep 2022 15:29 UTC
10 points
2 comments1 min readLW link

Three sce­nar­ios of pseudo-al­ign­ment

Eleni Angelou3 Sep 2022 12:47 UTC
9 points
0 comments3 min readLW link

Bugs or Fea­tures?

qbolec3 Sep 2022 7:04 UTC
72 points
9 comments2 min readLW link

[Ex­plo­ra­tory] Seper­ate ex­plo­ra­tory writ­ing from pub­lic writing

Johannes C. Mayer3 Sep 2022 2:57 UTC
6 points
2 comments1 min readLW link

We may be able to see sharp left turns coming

3 Sep 2022 2:55 UTC
53 points
29 comments2 min readLW link

[Ex­plo­ra­tory] Ex­plo­ra­tory Writ­ing Info

Johannes C. Mayer3 Sep 2022 2:50 UTC
3 points
3 comments1 min readLW link

[Question] Can some­one ex­plain to me why most re­searchers think al­ign­ment is prob­a­bly some­thing that is hu­manly tractable?

iamthouthouarti3 Sep 2022 1:12 UTC
32 points
11 comments1 min readLW link

Be­havi­our Man­i­folds and the Hes­sian of the To­tal Loss—Notes and Criticism

Spencer Becker-Kahn3 Sep 2022 0:15 UTC
35 points
5 comments6 min readLW link

Sticky goals: a con­crete ex­per­i­ment for un­der­stand­ing de­cep­tive alignment

evhub2 Sep 2022 21:57 UTC
39 points
13 comments3 min readLW link

Agency en­g­ineer­ing: is AI-al­ign­ment “to hu­man in­tent” enough?

catubc2 Sep 2022 18:14 UTC
9 points
10 comments6 min readLW link

Hanover, Ger­many—ACX Mee­tups Every­where 2022

eikowagenknecht2 Sep 2022 17:31 UTC
2 points
0 comments1 min readLW link

Laz­i­ness in AI

Richard Henage2 Sep 2022 17:04 UTC
13 points
5 comments1 min readLW link

Ex­port­ing Han­gouts History

jefftk2 Sep 2022 15:00 UTC
20 points
0 comments2 min readLW link
(www.jefftk.com)

Simulators

janus2 Sep 2022 12:45 UTC
594 points
161 comments41 min readLW link8 reviews
(generative.ink)

Lev­el­ling Up in AI Safety Re­search Engineering

Gabriel Mukobi2 Sep 2022 4:59 UTC
57 points
9 comments17 min readLW link

Stop Dis­cour­ag­ing Microwave For­mula Preparation

jefftk2 Sep 2022 2:10 UTC
68 points
12 comments2 min readLW link
(www.jefftk.com)

A Richly In­ter­ac­tive AGI Align­ment Chart

lisperati2 Sep 2022 0:44 UTC
14 points
6 comments1 min readLW link

Ap­pendix: How to run a suc­cess­ful Ham­ming circle

CFAR!Duncan2 Sep 2022 0:22 UTC
35 points
6 comments7 min readLW link

Re­place­ment for PONR concept

Daniel Kokotajlo2 Sep 2022 0:09 UTC
58 points
6 comments2 min readLW link

AI co­or­di­na­tion needs clear wins

evhub1 Sep 2022 23:41 UTC
146 points
16 comments2 min readLW link1 review

Short story spec­u­lat­ing on pos­si­ble ram­ifi­ca­tions of AI on the art world

Yitz1 Sep 2022 21:15 UTC
30 points
8 comments3 min readLW link
(archiveofourown.org)

Why was progress so slow in the past?

jasoncrawford1 Sep 2022 20:26 UTC
54 points
31 comments6 min readLW link
(rootsofprogress.org)