Pas­cal: The Great­ness and Lit­tle­ness of Man, A Think­ing Reed

NoBadCake10 Sep 2022 20:05 UTC
9 points
0 comments1 min readLW link

[Job] Pro­ject Man­ager: Com­mu­nity Health (CEA)

Xodarap10 Sep 2022 18:40 UTC
3 points
0 comments1 min readLW link
(www.centreforeffectivealtruism.org)

Un­bounded util­ity func­tions and precommitment

MichaelStJules10 Sep 2022 16:16 UTC
4 points
3 comments1 min readLW link

[Question] What is the “Less Wrong” ap­proved acronym for 1984-risk?

Logan Zoellner10 Sep 2022 14:38 UTC
5 points
8 comments1 min readLW link

Find out how util­i­tar­ian you are—a mega thread of philos­o­phy polls

spencerg10 Sep 2022 14:05 UTC
8 points
3 comments1 min readLW link
(twitter.com)

Put Dirty Dishes in the Dishwasher

jefftk10 Sep 2022 13:10 UTC
37 points
16 comments1 min readLW link
(www.jefftk.com)

Join ASAP! (AI Safety Ac­countabil­ity Pro­gramme) 🚀

CallumMcDougall10 Sep 2022 11:15 UTC
19 points
0 comments3 min readLW link

Quintin’s al­ign­ment pa­pers roundup—week 1

Quintin Pope10 Sep 2022 6:39 UTC
120 points
6 comments9 min readLW link

Path de­pen­dence in ML in­duc­tive biases

10 Sep 2022 1:38 UTC
68 points
13 comments10 min readLW link

Keep­ing Time in Epoch Seconds

Gordon Seidoh Worley10 Sep 2022 0:28 UTC
11 points
2 comments2 min readLW link

Ought will host a fac­tored cog­ni­tion “Lab Meet­ing”

9 Sep 2022 23:46 UTC
35 points
1 comment1 min readLW link

Web4/​Heaven—The Simulation

Dunning K.9 Sep 2022 22:58 UTC
10 points
2 comments1 min readLW link

Eval­u­a­tions pro­ject @ ARC is hiring a re­searcher and a web­dev/​engineer

Beth Barnes9 Sep 2022 22:46 UTC
99 points
7 comments10 min readLW link

Swap and Scale

Stephen Fowler9 Sep 2022 22:41 UTC
17 points
3 comments1 min readLW link

My emo­tional re­ac­tion to the cur­rent fund­ing situation

Sam F. Brown9 Sep 2022 22:02 UTC
105 points
36 comments5 min readLW link
(sambrown.eu)

Alex­aTM − 20 Billion Pa­ram­e­ter Model With Im­pres­sive Performance

MrThink9 Sep 2022 21:46 UTC
5 points
0 comments1 min readLW link

[Fun][Link] Align­ment SMBC Comic

Gunnar_Zarncke9 Sep 2022 21:38 UTC
7 points
2 comments1 min readLW link
(www.smbc-comics.com)

Gate­keeper Vic­tory: AI Box Reflection

9 Sep 2022 21:38 UTC
6 points
6 comments9 min readLW link

In­ter­pret­ing Afford­able Housing

jefftk9 Sep 2022 19:40 UTC
16 points
0 comments1 min readLW link
(www.jefftk.com)

Lon­don Ra­tion­al­ish Meetup 2022-09-11

calmiguana9 Sep 2022 18:39 UTC
1 point
0 comments1 min readLW link

AI al­ign­ment with hu­mans… but with which hu­mans?

geoffreymiller9 Sep 2022 18:21 UTC
12 points
33 comments3 min readLW link

[Question] Should you re­frain from hav­ing chil­dren be­cause of the risk posed by ar­tifi­cial in­tel­li­gence?

Mientras9 Sep 2022 17:39 UTC
17 points
31 comments1 min readLW link

Notes on Resolve

David Gross9 Sep 2022 16:42 UTC
9 points
1 comment31 min readLW link

ethics and an­throp­ics of ho­mo­mor­phi­cally en­crypted computations

Tamsin Leake9 Sep 2022 10:49 UTC
47 points
49 comments3 min readLW link
(carado.moe)

Over­sight Leagues: The Train­ing Game as a Feature

Paul Bricman9 Sep 2022 10:08 UTC
20 points
6 comments10 min readLW link

Un­der­stand­ing and avoid­ing value drift

TurnTrout9 Sep 2022 4:16 UTC
43 points
9 comments6 min readLW link

Samotsvety’s AI risk forecasts

elifland9 Sep 2022 4:01 UTC
44 points
0 comments4 min readLW link

Most Peo­ple Start With The Same Few Bad Ideas

johnswentworth9 Sep 2022 0:29 UTC
162 points
30 comments3 min readLW link

Mon­i­tor­ing for de­cep­tive alignment

evhub8 Sep 2022 23:07 UTC
135 points
8 comments9 min readLW link

[An email with a bunch of links I sent an ex­pe­rienced ML re­searcher in­ter­ested in learn­ing about Align­ment /​ x-safety.]

David Scott Krueger (formerly: capybaralet)8 Sep 2022 22:28 UTC
47 points
1 comment5 min readLW link

Progress links & tweets, 2022-09-08

jasoncrawford8 Sep 2022 20:43 UTC
13 points
3 comments1 min readLW link
(rootsofprogress.org)

Turn­ing What­sApp Chat Data into Prompt-Re­sponse Form for Fine-Tuning

hatta_afiq8 Sep 2022 20:05 UTC
1 point
0 comments1 min readLW link

Post­mortem: Try­ing out for Man­i­fold Markets

8 Sep 2022 17:54 UTC
24 points
0 comments3 min readLW link

Thoughts on AGI con­scious­ness /​ sentience

Steven Byrnes8 Sep 2022 16:40 UTC
38 points
37 comments6 min readLW link

A rough idea for solv­ing ELK: An ap­proach for train­ing gen­er­al­ist agents like GATO to make plans and de­scribe them to hu­mans clearly and hon­estly.

Michael Soareverix8 Sep 2022 15:20 UTC
2 points
2 comments2 min readLW link

What Should AI Owe To Us? Ac­countable and Aligned AI Sys­tems via Con­trac­tu­al­ist AI Alignment

xuan8 Sep 2022 15:04 UTC
32 points
15 comments25 min readLW link

ACX Book Re­view Discussion

Screwtape8 Sep 2022 14:22 UTC
5 points
0 comments1 min readLW link

Covid 9/​8/​22: Booster Boosting

Zvi8 Sep 2022 13:50 UTC
34 points
9 comments24 min readLW link
(thezvi.wordpress.com)

So­lar Black­out Resistance

jefftk8 Sep 2022 13:30 UTC
69 points
32 comments3 min readLW link
(www.jefftk.com)

All AGI safety ques­tions wel­come (es­pe­cially ba­sic ones) [Sept 2022]

plex8 Sep 2022 11:56 UTC
22 points
48 comments2 min readLW link

[Question] Se­quences/​Eliezer es­says be­yond those in AI to Zom­bies?

Domenic8 Sep 2022 5:05 UTC
4 points
4 comments1 min readLW link

Linkpost: Github Copi­lot pro­duc­tivity experiment

Daniel Kokotajlo8 Sep 2022 4:41 UTC
88 points
4 comments1 min readLW link
(github.blog)

OpenPrin­ci­ples Boot­camp (Free) -- Reflect & Act on your Ra­tion­al­ity Prin­ci­ples.

ti_guo8 Sep 2022 3:06 UTC
6 points
3 comments4 min readLW link

Search­ing for Mo­du­lar­ity in Large Lan­guage Models

8 Sep 2022 2:25 UTC
44 points
3 comments14 min readLW link

90% of any­thing should be bad (& the pre­ci­sion-re­call trade­off)

cartografie8 Sep 2022 1:20 UTC
33 points
22 comments6 min readLW link

How to Do Re­search. v1

Pablo Repetto8 Sep 2022 1:08 UTC
29 points
4 comments41 min readLW link
(pabloernesto.github.io)

Galaxy Trucker Needs a New Se­cond Half

jefftk7 Sep 2022 20:10 UTC
13 points
7 comments1 min readLW link
(www.jefftk.com)

[Question] In a lack of data, how should you weigh cre­dences in the­o­ret­i­cal physics’s The­o­ries of Every­thing, or TOEs?

Noosphere897 Sep 2022 18:25 UTC
7 points
11 comments1 min readLW link

Gen­er­a­tors Of Disagree­ment With AI Alignment

George3d67 Sep 2022 18:15 UTC
27 points
9 comments9 min readLW link
(www.epistem.ink)

Shröd­inger’s lot­tery or: Why you are go­ing to live forever

Chase Dowdell7 Sep 2022 18:13 UTC
1 point
2 comments4 min readLW link