Feel­ing Old: Leav­ing your 20s in the 2020s

squidious22 Nov 2022 22:50 UTC
37 points
3 comments1 min readLW link
(opalsandbonobos.blogspot.com)

Brute-forc­ing the uni­verse: a non-stan­dard shot at di­a­mond alignment

Martín Soto22 Nov 2022 22:36 UTC
9 points
2 comments20 min readLW link

An­nounc­ing AI Align­ment Awards: $100k re­search con­tests about goal mis­gen­er­al­iza­tion & corrigibility

22 Nov 2022 22:19 UTC
73 points
20 comments4 min readLW link

ACX Zurich Novem­ber Meetup

MB22 Nov 2022 21:41 UTC
1 point
0 comments1 min readLW link

Hu­man-level Full-Press Di­plo­macy (some bare facts).

Cleo Nardo22 Nov 2022 20:59 UTC
50 points
7 comments3 min readLW link

[Question] How does late-2022 COVID trans­mis­si­bil­ity drop over time?

Daniel Dewey22 Nov 2022 19:54 UTC
8 points
2 comments1 min readLW link

AI will change the world, but won’t take it over by play­ing “3-di­men­sional chess”.

22 Nov 2022 18:57 UTC
133 points
98 comments24 min readLW link

Progress links and tweets, 2022-11-22

jasoncrawford22 Nov 2022 17:39 UTC
17 points
0 comments1 min readLW link
(rootsofprogress.org)

Tyranny of the Epistemic Majority

Scott Garrabrant22 Nov 2022 17:19 UTC
188 points
13 comments9 min readLW link1 review

A Walk­through of In-Con­text Learn­ing and In­duc­tion Heads (w/​ Charles Frye) Part 1 of 2

Neel Nanda22 Nov 2022 17:12 UTC
20 points
0 comments1 min readLW link
(www.youtube.com)

Sim­ple Im­prove­ment to Col­lege Foot­ball Over­time Rules

Zvi22 Nov 2022 17:00 UTC
10 points
0 comments1 min readLW link
(thezvi.wordpress.com)

Meta AI an­nounces Cicero: Hu­man-Level Di­plo­macy play (with di­alogue)

Jacy Reese Anthis22 Nov 2022 16:50 UTC
93 points
64 comments1 min readLW link
(www.science.org)

Austin LW meetup notes: The FTX Affair

jchan22 Nov 2022 14:01 UTC
20 points
3 comments16 min readLW link

Mo­ti­vated Cog­ni­tion and the Mul­ti­verse of Truth

Q Home22 Nov 2022 12:51 UTC
8 points
16 comments24 min readLW link

LessWrong read­ers are in­vited to ap­ply to the Lurkshop

22 Nov 2022 9:19 UTC
101 points
41 comments3 min readLW link

Gaox­ing Guy

Alok Singh22 Nov 2022 1:50 UTC
3 points
1 comment1 min readLW link
(alok.github.io)

Mis­cel­la­neous First-Pass Align­ment Thoughts

NickGabs21 Nov 2022 21:23 UTC
12 points
4 comments10 min readLW link

[Heb­bian Nat­u­ral Ab­strac­tions] Introduction

21 Nov 2022 20:34 UTC
34 points
3 comments4 min readLW link
(www.snellessen.com)

Utili­tar­i­anism Meets Egalitarianism

Scott Garrabrant21 Nov 2022 19:00 UTC
116 points
16 comments6 min readLW link1 review

In­ter­view with Matt Freeman

Evenflair21 Nov 2022 18:17 UTC
15 points
0 comments1 min readLW link
(overcast.fm)

Here’s the exit.

Valentine21 Nov 2022 18:07 UTC
61 points
178 comments10 min readLW link5 reviews

Benefits/​Risks of Scott Aaron­son’s Ortho­dox/​Re­form Fram­ing for AI Alignment

Jeremyy21 Nov 2022 17:54 UTC
2 points
1 comment1 min readLW link

[ASoT] Reflec­tivity in Nar­row AI

Ulisse Mini21 Nov 2022 0:51 UTC
6 points
1 comment1 min readLW link

Scott Aaron­son on “Re­form AI Align­ment”

shminux20 Nov 2022 22:20 UTC
39 points
17 comments1 min readLW link
(scottaaronson.blog)

On Mo­ral­ity, Ethics, and all that Jazz

Delen Heisman20 Nov 2022 20:00 UTC
4 points
4 comments2 min readLW link
(delen.substack.com)

Limits to the Con­trol­la­bil­ity of AGI

20 Nov 2022 19:18 UTC
11 points
2 comments9 min readLW link

Ca­reer Scout­ing: Dentistry

koratkar20 Nov 2022 15:55 UTC
67 points
5 comments5 min readLW link
(careerscouting.substack.com)

Re­fac­tor­ing My­self: 4 Years Later

Pausecafe20 Nov 2022 14:45 UTC
16 points
2 comments10 min readLW link

De­ci­sion The­ory but also Ghosts

eva_20 Nov 2022 13:24 UTC
17 points
21 comments10 min readLW link

ARC pa­per: For­mal­iz­ing the pre­sump­tion of independence

Erik Jenner20 Nov 2022 1:22 UTC
97 points
2 comments2 min readLW link
(arxiv.org)

Up­date to Mys­ter­ies of mode col­lapse: text-davinci-002 not RLHF

janus19 Nov 2022 23:51 UTC
71 points
8 comments2 min readLW link

Make the Drought Eva­po­rate!

AnthonyRepetto19 Nov 2022 23:41 UTC
32 points
25 comments3 min readLW link

Elas­tic Pro­duc­tivity Tools

Simon Berens19 Nov 2022 21:59 UTC
74 points
8 comments2 min readLW link
(simonberens.me)

A Short Dialogue on the Mean­ing of Re­ward Functions

19 Nov 2022 21:04 UTC
45 points
0 comments3 min readLW link

By De­fault, GPTs Think In Plain Sight

Fabien Roger19 Nov 2022 19:15 UTC
85 points
33 comments9 min readLW link

Re­view: Bayesian Statis­tics the Fun Way by Will Kurt

matto19 Nov 2022 18:52 UTC
4 points
2 comments2 min readLW link

log­i­cal vs in­dex­i­cal dignity

Tamsin Leake19 Nov 2022 12:43 UTC
27 points
2 comments2 min readLW link
(carado.moe)

[Question] How does acausal trade work in a de­ter­minis­tic mul­ti­verse?

sisyphus19 Nov 2022 1:50 UTC
2 points
13 comments1 min readLW link

Choos­ing the right dish

Adam Zerner19 Nov 2022 1:38 UTC
38 points
7 comments8 min readLW link

Reflec­tive Consequentialism

Adam Zerner18 Nov 2022 23:56 UTC
21 points
14 comments4 min readLW link

Value Created vs. Value Extracted

Sable18 Nov 2022 21:34 UTC
8 points
6 comments6 min readLW link
(affablyevil.substack.com)

gen­er­al­ized wireheading

Tamsin Leake18 Nov 2022 20:18 UTC
25 points
7 comments2 min readLW link
(carado.moe)

The Disas­trously Con­fi­dent And Inac­cu­rate AI

Sharat Jacob Jacob18 Nov 2022 19:06 UTC
13 points
0 comments13 min readLW link

How AI Fails Us: A non-tech­ni­cal view of the Align­ment Problem

testingthewaters18 Nov 2022 19:02 UTC
7 points
0 comments2 min readLW link
(ethics.harvard.edu)

[Question] Is there any policy for a fair treat­ment of AIs whose friendli­ness is in doubt?

nahoj18 Nov 2022 19:01 UTC
15 points
10 comments1 min readLW link

SBF, Pas­cal’s Mug­ging, and a Pro­posed Solution

Cole Killian18 Nov 2022 18:39 UTC
−1 points
5 comments5 min readLW link
(colekillian.com)

Distil­la­tion of “How Likely Is De­cep­tive Align­ment?”

NickGabs18 Nov 2022 16:31 UTC
24 points
4 comments10 min readLW link

Con­tra Chords

jefftk18 Nov 2022 16:20 UTC
12 points
1 comment7 min readLW link
(www.jefftk.com)

[Question] Up­dates on scal­ing laws for foun­da­tion mod­els from ′ Tran­scend­ing Scal­ing Laws with 0.1% Ex­tra Com­pute’

Nick_Greig18 Nov 2022 12:46 UTC
15 points
2 comments1 min readLW link

Hal­i­fax, NS – Monthly Ra­tion­al­ist, EA, and ACX Meetup

Ideopunk18 Nov 2022 11:45 UTC
10 points
0 comments1 min readLW link