Leav­ing Orbit

Rob Bensinger6 Dec 2021 21:48 UTC
50 points
17 comments1 min readLW link

Declus­ter­ing, reclus­ter­ing, and filling in thingspace

Stuart_Armstrong6 Dec 2021 20:53 UTC
16 points
6 comments3 min readLW link

More Chris­ti­ano, Co­tra, and Yud­kowsky on AI progress

6 Dec 2021 20:33 UTC
91 points
28 comments40 min readLW link

Are there al­ter­na­tive to solv­ing value trans­fer and ex­trap­o­la­tion?

Stuart_Armstrong6 Dec 2021 18:53 UTC
20 points
8 comments5 min readLW link

Im­pli­ca­tions of the Grabby Aliens Model

harsimony6 Dec 2021 18:34 UTC
2 points
3 comments2 min readLW link
(harsimony.wordpress.com)

A Pos­si­ble Re­s­olu­tion To Spu­ri­ous Counterfactuals

JoshuaOSHickman6 Dec 2021 18:26 UTC
15 points
5 comments4 min readLW link

In­for­ma­tion bot­tle­neck for coun­ter­fac­tual corrigibility

tailcalled6 Dec 2021 17:11 UTC
8 points
1 comment7 min readLW link

Omicron Post #4

Zvi6 Dec 2021 17:00 UTC
153 points
66 comments15 min readLW link
(thezvi.wordpress.com)

Life, strug­gle, and the psy­cholog­i­cal fal­lout from COVID

Alex Flint6 Dec 2021 16:59 UTC
15 points
1 comment8 min readLW link

Model­ing Failure Modes of High-Level Ma­chine Intelligence

6 Dec 2021 13:54 UTC
54 points
1 comment12 min readLW link

A Frame­work to Ex­plain Bayesian Models

Jsevillamol6 Dec 2021 10:38 UTC
24 points
1 comment8 min readLW link

Anti-cor­re­lated causation

DirectedEvolution6 Dec 2021 4:36 UTC
18 points
2 comments3 min readLW link

ML Align­ment The­ory Pro­gram un­der Evan Hubinger

6 Dec 2021 0:03 UTC
82 points
3 comments2 min readLW link

[Question] Are limited-hori­zon agents a good heuris­tic for the off-switch prob­lem?

Yonadav Shavit5 Dec 2021 19:27 UTC
6 points
19 comments1 min readLW link

Ex­plicit model visualizations

Dorian Stern vukotic5 Dec 2021 18:37 UTC
12 points
0 comments12 min readLW link

[Question] What would be the pros and cons of a gov­ern­ment-backed, prop­erty-based cryp­tocur­rency?

Asgård5 Dec 2021 17:57 UTC
−3 points
6 comments2 min readLW link

In­ter­pret­ing Yud­kowsky on Deep vs Shal­low Knowledge

adamShimi5 Dec 2021 17:32 UTC
100 points
32 comments24 min readLW link

Scott Alexan­der’s “Iver­mectin: Much More Than You Wanted To Know”

Raemon5 Dec 2021 5:43 UTC
13 points
2 comments1 min readLW link2 reviews
(astralcodexten.substack.com)

[Question] What have your ro­man­tic ex­pe­riences with non-EAs/​non-Ra­tion­al­ists been like?

Randomized, Controlled5 Dec 2021 4:41 UTC
46 points
24 comments1 min readLW link

Be­hav­ior Clon­ing is Miscalibrated

leogao5 Dec 2021 1:36 UTC
77 points
3 comments3 min readLW link

Pri­vacy and Manipulation

Raemon5 Dec 2021 0:39 UTC
78 points
41 comments8 min readLW link

Covid Christmas

jefftk4 Dec 2021 22:00 UTC
20 points
6 comments2 min readLW link
(www.jefftk.com)

A Gen­er­al­iza­tion of ROC AUC for Bi­nary Classifiers

Adam Scherlis4 Dec 2021 21:47 UTC
10 points
0 comments2 min readLW link
(adam.scherlis.com)

Agents as P₂B Chain Reactions

Daniel Kokotajlo4 Dec 2021 21:35 UTC
18 points
0 comments2 min readLW link

Agency: What it is and why it matters

Daniel Kokotajlo4 Dec 2021 21:32 UTC
25 points
2 comments2 min readLW link

Ad­vanc­ing Math­e­mat­ics By Guid­ing Hu­man In­tu­ition With AI

interstice4 Dec 2021 20:00 UTC
5 points
0 comments1 min readLW link
(www.nature.com)

[Question] Misc. ques­tions about EfficientZero

Daniel Kokotajlo4 Dec 2021 19:45 UTC
51 points
17 comments1 min readLW link

Lars Doucet’s Ge­or­gism se­ries on As­tral Codex Ten

Sune4 Dec 2021 19:43 UTC
13 points
2 comments1 min readLW link1 review
(astralcodexten.substack.com)

[Question] What are the limi­ta­tions on poli­ti­cally mo­ti­vated re­lo­ca­tion?

Asgård4 Dec 2021 15:16 UTC
9 points
2 comments1 min readLW link

[Question] Should we post­pone get­ting a booster due to Omicron, till there are Omicron-spe­cific boost­ers?

ChristianKl4 Dec 2021 12:46 UTC
33 points
18 comments1 min readLW link

Can solip­sism be dis­proven?

nx20594 Dec 2021 8:24 UTC
−2 points
6 comments2 min readLW link

Shul­man and Yud­kowsky on AI progress

3 Dec 2021 20:05 UTC
90 points
16 comments20 min readLW link

[Linkpost] A Gen­eral Lan­guage As­sis­tant as a Lab­o­ra­tory for Alignment

Quintin Pope3 Dec 2021 19:42 UTC
37 points
2 comments2 min readLW link

Browser Engines

jefftk3 Dec 2021 19:30 UTC
23 points
0 comments2 min readLW link
(www.jefftk.com)

The Learn­ing System

Henrik Karlsson3 Dec 2021 19:08 UTC
18 points
12 comments10 min readLW link
(escapingflatland.substack.com)

[Question] Does the Struc­ture of an al­gorithm mat­ter for AI Risk and/​or con­scious­ness?

Logan Zoellner3 Dec 2021 18:31 UTC
7 points
4 comments1 min readLW link

A Med­i­ta­tive Experience

Yonatan Cale3 Dec 2021 17:58 UTC
4 points
1 comment4 min readLW link

$100/​$50 re­wards for good references

Stuart_Armstrong3 Dec 2021 16:55 UTC
20 points
5 comments1 min readLW link

[Question] Where in the world will a UBI de­velop first?

Asgård3 Dec 2021 15:54 UTC
4 points
10 comments1 min readLW link

Rus­sian x-risks newslet­ter fall 2021

avturchin3 Dec 2021 13:06 UTC
29 points
2 comments1 min readLW link

Se­cond-or­der se­lec­tion against the immortal

Malmesbury3 Dec 2021 5:01 UTC
44 points
47 comments6 min readLW link

For­mal­iz­ing Policy-Mod­ifi­ca­tion Corrigibility

TurnTrout3 Dec 2021 1:31 UTC
25 points
6 comments6 min readLW link

Fore­cast­ing Newslet­ter: Novem­ber 2021

NunoSempere2 Dec 2021 21:44 UTC
18 points
2 comments6 min readLW link

“In­fo­haz­ard” is a pre­dom­i­nantly con­flict-the­o­retic concept

jessicata2 Dec 2021 17:54 UTC
45 points
17 comments14 min readLW link
(unstableontology.com)

Covid 12/​2: But Aside From That

Zvi2 Dec 2021 16:20 UTC
36 points
11 comments9 min readLW link
(thezvi.wordpress.com)

Omicron Post #3

Zvi2 Dec 2021 15:10 UTC
57 points
15 comments13 min readLW link
(thezvi.wordpress.com)

Built-in Mea­sur­ing Spoons

jefftk2 Dec 2021 13:20 UTC
10 points
0 comments1 min readLW link
(www.jefftk.com)

Covid Pre­dic­tion Mar­kets at Polymarket

Zvi2 Dec 2021 12:50 UTC
39 points
10 comments7 min readLW link
(thezvi.wordpress.com)

Syd­ney AI Safety Fellowship

Chris_Leong2 Dec 2021 7:34 UTC
22 points
0 comments2 min readLW link

Mo­ral­ity is Scary

Wei Dai2 Dec 2021 6:35 UTC
193 points
116 comments4 min readLW link1 review