Seek­ing beta read­ers who are ig­no­rant of biol­ogy but knowl­edge­able about AI safety

Holly_Elmore27 Jul 2022 23:02 UTC
11 points
6 comments1 min readLW link

Prin­ci­ples of Pri­vacy for Align­ment Research

johnswentworth27 Jul 2022 19:53 UTC
72 points
30 comments7 min readLW link

Mo­ral strate­gies at differ­ent ca­pa­bil­ity levels

Richard_Ngo27 Jul 2022 18:50 UTC
112 points
14 comments5 min readLW link
(thinkingcomplete.blogspot.com)

Progress links and tweets, 2022-07-27

jasoncrawford27 Jul 2022 17:20 UTC
18 points
0 comments1 min readLW link
(rootsofprogress.org)

Quan­tum Ad­van­tage in Learn­ing from Experiments

Dennis Towne27 Jul 2022 15:49 UTC
5 points
5 comments1 min readLW link
(ai.googleblog.com)

Levels of Pluralism

adamShimi27 Jul 2022 9:35 UTC
34 points
0 comments14 min readLW link

Hu­man tri­als for the Mar­burg vac­cine: fund­ing op­por­tu­nity?

americanwalrus27 Jul 2022 5:53 UTC
3 points
0 comments1 min readLW link
(www.independent.co.uk)

[Question] “Fa­nat­i­cal” Longter­mists: Why is Pas­cal’s Wager wrong?

Yitz27 Jul 2022 4:16 UTC
3 points
7 comments1 min readLW link

Unify­ing Bar­gain­ing No­tions (2/​2)

Diffractor27 Jul 2022 3:40 UTC
116 points
19 comments21 min readLW link

AGI ruin sce­nar­ios are likely (and dis­junc­tive)

So8res27 Jul 2022 3:21 UTC
174 points
38 comments6 min readLW link

Tech­noc­racy and the Space Age

jasoncrawford26 Jul 2022 23:14 UTC
25 points
5 comments2 min readLW link
(rootsofprogress.org)

«Boundaries», Part 1: a key miss­ing con­cept from util­ity theory

Andrew_Critch26 Jul 2022 23:03 UTC
158 points
32 comments7 min readLW link

In­co­her­ence of un­bounded selfishness

emmab26 Jul 2022 22:27 UTC
−6 points
2 comments1 min readLW link

«Boundaries» Se­quence (In­dex Post)

Andrew_Critch26 Jul 2022 19:12 UTC
25 points
1 comment1 min readLW link

Ac­tive In­fer­ence as a for­mal­i­sa­tion of in­stru­men­tal convergence

Roman Leventov26 Jul 2022 17:55 UTC
12 points
2 comments3 min readLW link
(direct.mit.edu)

NeurIPS ML Safety Work­shop 2022

Dan H26 Jul 2022 15:28 UTC
72 points
2 comments1 min readLW link
(neurips2022.mlsafety.org)

AI ethics vs AI alignment

Wei Dai26 Jul 2022 13:08 UTC
5 points
1 comment1 min readLW link

Utility func­tions and prob­a­bil­ities are entangled

Thomas Kwa26 Jul 2022 5:36 UTC
15 points
5 comments1 min readLW link

How Promis­ing is The­o­ret­i­cal Re­search on Ra­tion­al­ity? Seek­ing Ca­reer Advice

Aspirant22326 Jul 2022 1:08 UTC
3 points
3 comments3 min readLW link

Pre­dic­tion mar­kets meetup/​cowork­ing (hosted by Man­i­fold Mar­kets)

26 Jul 2022 0:14 UTC
2 points
0 comments1 min readLW link

Align­ment be­ing im­pos­si­ble might be bet­ter than it be­ing re­ally difficult

Martín Soto25 Jul 2022 23:57 UTC
13 points
2 comments2 min readLW link

[Question] How op­ti­mistic should we be about AI figur­ing out how to in­ter­pret it­self?

oh5432125 Jul 2022 22:09 UTC
3 points
1 comment1 min readLW link

Pro­tec­tion­ism in One Coun­try: How In­dus­trial Policy Worked in Canada

Davis Kedrosky25 Jul 2022 22:08 UTC
5 points
0 comments16 min readLW link
(daviskedrosky.substack.com)

Mis­takes as agency

pchvykov25 Jul 2022 16:17 UTC
12 points
8 comments4 min readLW link

My Bit­coin Th­e­sis @2022 - Part 1

aysajan25 Jul 2022 15:49 UTC
6 points
6 comments13 min readLW link

The Reader’s Guide to Op­ti­mal Mone­tary Policy

Ege Erdil25 Jul 2022 15:10 UTC
56 points
10 comments14 min readLW link

AGI Safety Needs Peo­ple With All Skil­lsets!

Severin T. Seehrich25 Jul 2022 13:32 UTC
28 points
0 comments2 min readLW link

[Question] Is there any ev­i­dence that hand­wash­ing does any­thing to pre­vent COVID?

mukashi25 Jul 2022 7:34 UTC
4 points
3 comments1 min readLW link

Open­ing Ses­sion Tips & Advice

CFAR!Duncan25 Jul 2022 3:57 UTC
81 points
3 comments14 min readLW link1 review

How much should we worry about mesa-op­ti­miza­tion challenges?

sudo25 Jul 2022 3:56 UTC
4 points
13 comments2 min readLW link

[Question] Does agent foun­da­tions cover all fu­ture ML sys­tems?

Jonas Hallgren25 Jul 2022 1:17 UTC
2 points
0 comments1 min readLW link

Unify­ing Bar­gain­ing No­tions (1/​2)

Diffractor25 Jul 2022 0:28 UTC
204 points
41 comments16 min readLW link

Re­ward is not the op­ti­miza­tion target

TurnTrout25 Jul 2022 0:03 UTC
348 points
123 comments10 min readLW link3 reviews

Brain­storm of things that could force an AI team to burn their lead

So8res24 Jul 2022 23:58 UTC
134 points
8 comments13 min readLW link

Find­ing Skele­tons on Rashomon Ridge

24 Jul 2022 22:31 UTC
30 points
2 comments7 min readLW link

Gather­ing In­for­ma­tion you won’t use di­rectly is of­ten useful

Johannes C. Mayer24 Jul 2022 21:21 UTC
6 points
1 comment1 min readLW link

[Question] Im­pact of ” ‘Let’s think step by step’ is all you need”?

yrimon24 Jul 2022 20:59 UTC
20 points
2 comments1 min readLW link

The Most Im­por­tant Cen­tury: The Animation

24 Jul 2022 20:58 UTC
46 points
2 comments20 min readLW link
(youtu.be)

Hiring Pro­gram­mers in Academia

jefftk24 Jul 2022 20:20 UTC
36 points
19 comments2 min readLW link
(www.jefftk.com)

Less Wrong Bu­dapest July 30th Meetup

Richard Horvath24 Jul 2022 19:07 UTC
2 points
0 comments1 min readLW link

Re­la­tion­ship be­tween sub­jec­tive ex­pe­rience and in­tel­li­gence?

Q Home24 Jul 2022 9:10 UTC
5 points
4 comments9 min readLW link

Dou­ble Crux

CFAR!Duncan24 Jul 2022 6:34 UTC
56 points
9 comments11 min readLW link

Ex­am­ple Meetup Description

Julius24 Jul 2022 5:38 UTC
6 points
0 comments2 min readLW link

Eaves­drop­ping on Aliens: A Data De­cod­ing Challenge

anonymousaisafety24 Jul 2022 4:35 UTC
44 points
9 comments4 min readLW link

In­for­ma­tion the­o­retic model anal­y­sis may not lend much in­sight, but we may have been do­ing them wrong!

Garrett Baker24 Jul 2022 0:42 UTC
7 points
0 comments10 min readLW link

What’s next for in­stru­men­tal ra­tio­nal­ity?

Andrew_Critch23 Jul 2022 22:55 UTC
63 points
7 comments1 min readLW link

Easy guide for run­ning a lo­cal Ra­tion­al­ity meetup

Nikita Sokolsky23 Jul 2022 22:52 UTC
13 points
1 comment6 min readLW link

Cu­rat­ing “The Epistemic Se­quences” (list v.0.1)

Andrew_Critch23 Jul 2022 22:17 UTC
65 points
12 comments7 min readLW link

Room Opening

jefftk23 Jul 2022 21:00 UTC
8 points
3 comments4 min readLW link
(www.jefftk.com)

A Bias Against Altruism

Lone Pine23 Jul 2022 20:44 UTC
58 points
30 comments2 min readLW link